File size: 704 Bytes
9140e55 5903f80 2ad37af 0835723 545a921 d102a50 5eafee4 d102a50 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 |
# Models Converted to fp16
- LLama2-chat-hf-fp16
- LLama3-7b-Instruct Model with fp16
- LLama3-70B-Instruct Model with fp16
# Quantized models:
https://fossies.org/linux/llama.cpp/examples/imatrix/README.md
https://www.databricks.com/sites/default/files/2024-04/Databricks-Big-Book-Of-GenAI-FINAL.pdf
## Vectordb
https://medium.com/@zilliz_learn/how-to-evaluate-a-vector-database-86dfdcc67d9b
## Chunk Visualization
https://chunkviz.up.railway.app/
## Prompting
https://www.promptingguide.ai/
https://learnprompting.org/docs/intro
##MLOPs
https://www.databricks.com/sites/default/files/2024-06/2023-10-EB-Big-Book-of-MLOps-2nd-Edition.pdf
## OpenAI Tokenizer
https://platform.openai.com/tokenizer |