Make fast embed and default embed mutually exclusive. (#4121) 541b2f3 Kevin Hu commited on Dec 19, 2024
when qwen rerank model not return ok, raise exception to notice user (#3593) 8c3fb63 liwenju0 commited on Nov 22, 2024
Add api for sessions and add max_tokens for tenant_llm (#3472) 99ac12c liuhua liuhua commited on Nov 19, 2024
Move settings initialization after module init phase (#3438) 6101699 jinhai-2012 commited on Nov 15, 2024
Use consistent log file names, introduced initLogger (#3403) 8bc2fc9 zhichyu commited on Nov 14, 2024
Fix the value issue of anthropic (#3351) 9ef0b16 shijiefengjun chenhaodong Kevin Hu commited on Nov 13, 2024
exstract model dir from model‘s full name (#3368) 3256beb roc king 王志鹏 Kevin Hu commited on Nov 13, 2024
fix: TypeError: only length-1 arrays can be converted to Python scalars (#3211) 24b9cdf ksztone-huanggonghao commited on Nov 6, 2024
[Bug]: unnecessary auto-increment calculations in the tokens statistics of the chat model (#2969) c31ab66 Yinquan WANG Kevin Hu commited on Oct 22, 2024
[Bug]: When use OpenAI chat model , raise ERROR: 'CompletionUsage' object has no attribute 'get' #2948 (#2949) 8efa7c5 Yinquan WANG Kevin Hu commited on Oct 22, 2024
Resolves #2905 openai compatible model provider add llama.cpp rerank support (#2906) 27aa4e5 ziyu4huang commited on Oct 21, 2024
Fix keys of Xinference deployed models, especially has the same model name with public hosted models. (#2832) 13b2570 0000sir 0000sir Kevin Hu commited on Oct 16, 2024
support api-version and change default-model in adding azure-openai and openai (#2799) fa680e0 JobSmithManipulation Kevin Hu commited on Oct 11, 2024