File size: 704 Bytes
9140e55
 
 
 
 
5903f80
2ad37af
 
0835723
 
 
545a921
 
 
 
 
 
 
d102a50
 
5eafee4
 
 
d102a50
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
# Models Converted to fp16 
- LLama2-chat-hf-fp16
- LLama3-7b-Instruct Model with fp16
- LLama3-70B-Instruct Model with fp16

# Quantized models:
https://fossies.org/linux/llama.cpp/examples/imatrix/README.md

https://www.databricks.com/sites/default/files/2024-04/Databricks-Big-Book-Of-GenAI-FINAL.pdf

## Vectordb
https://medium.com/@zilliz_learn/how-to-evaluate-a-vector-database-86dfdcc67d9b

## Chunk Visualization
https://chunkviz.up.railway.app/

## Prompting
https://www.promptingguide.ai/
https://learnprompting.org/docs/intro

##MLOPs
https://www.databricks.com/sites/default/files/2024-06/2023-10-EB-Big-Book-of-MLOps-2nd-Edition.pdf

## OpenAI Tokenizer
https://platform.openai.com/tokenizer