hoangus0303
/

paraformer-large-clone-from-funasr

Automatic Speech Recognition

Model card Files Files and versions

paraformer-large-clone-from-funasr / README.md

hoangus0303's picture

Update README.md

e9d5a5b verified 7 months ago

|

history blame contribute delete

1.53 kB

	---
	license: apache-2.0
	language:
	- zh
	metrics:
	- accuracy
	- cer
	pipeline_tag: automatic-speech-recognition
	tags:
	- Paraformer
	- FunASR
	- ASR
	---
	## Introduce

	This repo cloned from https://huggingface.co/funasr/Paraformer-large

	## Install funasr_onnx

	```shell
	pip install -U funasr_onnx
	# For the users in China, you could install with the command:
	# pip install -U funasr_onnx -i https://mirror.sjtu.edu.cn/pypi/web/simple
	```

	## Download the model

	```shell
	git clone https://huggingface.co/hoangus0303/paraformer-large-clone-from-funasr
	```

	## Inference with runtime

	### Speech Recognition
	#### Paraformer
	```python
	from funasr_onnx import Paraformer

	model_dir = "./paraformer-large"
	model = Paraformer(model_dir, batch_size=1, quantize=True)

	wav_path = ['./funasr/paraformer-large/asr_example.wav']

	result = model(wav_path)
	print(result)
	```
	- `model_dir`: the model path, which contains `model.onnx`, `config.yaml`, `am.mvn`
	- `batch_size`: `1` (Default), the batch size duration inference
	- `device_id`: `-1` (Default), infer on CPU. If you want to infer with GPU, set it to gpu_id (Please make sure that you have install the onnxruntime-gpu)
	- `quantize`: `False` (Default), load the model of `model.onnx` in `model_dir`. If set `True`, load the model of `model_quant.onnx` in `model_dir`
	- `intra_op_num_threads`: `4` (Default), sets the number of threads used for intraop parallelism on CPU

	Input: wav formt file, support formats: `str, np.ndarray, List[str]`

	Output: `List[str]`: recognition result