jxke-MBA-M3-13i
commited on
Commit
·
fef7ac9
1
Parent(s):
9b7f617
Convert Qwen1.5-4B-Chat to RKLLM format for Rockchip devices
Browse files
README.md
ADDED
@@ -0,0 +1,63 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- zh
|
4 |
+
- en
|
5 |
+
pipeline_tag: text-generation
|
6 |
+
tags:
|
7 |
+
- qwen
|
8 |
+
- Qwen1.5-4B-Chat
|
9 |
+
- rkllm
|
10 |
+
- rk3588
|
11 |
+
---
|
12 |
+
|
13 |
+
# Qwen1.5-4B-Chat-rkllm
|
14 |
+
This is a conversion from [Qwen/Qwen1.5-4B-Chat](https://huggingface.co/Qwen/Qwen1.5-4B-Chat)
|
15 |
+
to the RKLLM format for chat in Rockchip devices.
|
16 |
+
|
17 |
+
|
18 |
+
- [Qwen1.5-4B-Chat-rkllm](#qwen15-4b-chat-rkllm)
|
19 |
+
- [Support Devices](#support-devices)
|
20 |
+
- [Convert tools](#convert-tools)
|
21 |
+
- [Converted with RKLLM runtime](#converted-with-rkllm-runtime)
|
22 |
+
- [License](#license)
|
23 |
+
- [Trouble shot](#trouble-shot)
|
24 |
+
- [Reference](#reference)
|
25 |
+
|
26 |
+
|
27 |
+
## Support Devices
|
28 |
+
* RK3588/RK3588s
|
29 |
+
|
30 |
+
## Convert tools
|
31 |
+
To Converting LLMs for Rockchip's NPUs, please see the artical<sup>1,2</sup> for model details.
|
32 |
+
|
33 |
+
## Converted with RKLLM runtime
|
34 |
+
* RKLLM runtime `1.0.1`
|
35 |
+
|
36 |
+
## License
|
37 |
+
Same as the original [Qwen/Qwen1.5-4B-Chat](https://huggingface.co/Qwen/Qwen1.5-4B-Chat)
|
38 |
+
|
39 |
+
|
40 |
+
## Trouble shot
|
41 |
+
* `E RKNN: [10:48:59.683] failed to allocate handle, ret: -1, errno: 12, errstr: Cannot allocate memory`
|
42 |
+
```shell
|
43 |
+
firefly@firefly:~/Documents/rknn-llm$ rkllm ./chatglm3-6b.rkllm
|
44 |
+
rkllm init start
|
45 |
+
rkllm-runtime version: 1.0.1, rknpu driver version: 0.8.2, platform: RK3588
|
46 |
+
Warning: Your rknpu driver version is too low, please upgrade to 0.9.6.
|
47 |
+
E RKNN: [10:48:59.683] failed to allocate handle, ret: -1, errno: 12, errstr: Cannot allocate memory
|
48 |
+
|
49 |
+
can not create weight memory for domain1
|
50 |
+
E RKNN: [10:49:00.480] failed to allocate handle, ret: -1, errno: 12, errstr: Cannot allocate memory
|
51 |
+
|
52 |
+
can not create weight memory for domain2
|
53 |
+
E RKNN: [10:49:05.216] failed to convert handle(1020) to fd, ret: -1, errno: 24, errstr: Too many open files
|
54 |
+
|
55 |
+
# Solution
|
56 |
+
firefly@firefly:~/Documents/rknn-llm$ ulimit -n 102400
|
57 |
+
```
|
58 |
+
|
59 |
+
## Reference
|
60 |
+
1. [airockchip/rknn-llm](https://github.com/airockchip/rknn-llm)
|
61 |
+
1. [Pelochus/ezrknn-llm](https://github.com/Pelochus/ezrknn-llm)
|
62 |
+
2. [Qwen/Qwen1.5-4B-Chat](https://huggingface.co/Qwen/Qwen1.5-4B-Chat)
|
63 |
+
3. [跑大模型遇到问题 #62](https://github.com/airockchip/rknn-llm/issues/62)
|