Spaces:

klasocki
/

comma-fixer

Running

App Files Files Community

klasocki commited on Aug 24, 2023

Commit

a3e3b16

1 Parent(s): 2a30eb1

Add some TODOs, update readme.md, update evaluation.ipynb

Browse files

Files changed (4) hide show

Dockerfile +1 -0
README.md +13 -2
commafixer/src/fixer.py +2 -1
notebooks/evaluation.ipynb +0 -0

Dockerfile CHANGED Viewed

@@ -27,6 +27,7 @@ COPY --chown=user . .
 FROM base as test
 RUN pip install .[test]
 RUN python -m pytest tests
 FROM python:3.10-slim as deploy

 FROM base as test
 RUN pip install .[test]
+# TODO don't run all at once because of memory errors?
 RUN python -m pytest tests
 FROM python:3.10-slim as deploy

README.md CHANGED Viewed

@@ -27,6 +27,7 @@ Note that you might have to
 `sudo service docker start`
 first.
 The application should then be available at http://localhost:8000.
 For the API, see the `openapi.yaml` file.
 Docker-compose mounts a volume and listens to changes in the source code, so the application will be reloaded and
@@ -35,7 +36,15 @@ reflect them.
 We use multi-stage builds to reduce the image size, ensure flexibility in requirements and that tests are run before
 each deployment.
 However, while it does reduce the size by nearly 3GB, the resulting image still contains deep learning libraries and
-pre-downloaded models, and will take around 7GB of disk space.
 Alternatively, you can setup a python environment by hand. It is recommended to use a virtualenv. Inside one, run
 ```bash
@@ -43,6 +52,8 @@ pip install -e .[test]
 ```
 the `[test]` option makes sure to install test dependencies.
 If you intend to perform training and evaluation of deep learning models, install also using the `[training]` option.
 ### Running tests
@@ -91,7 +102,7 @@ dataset are as follows:
 | Model    | precision | recall | F1   | support |
 |----------|-----------|--------|------|---------|
 | baseline | 0.79      | 0.72   | 0.75 | 10079   |
-| ours*    | 0.86      | 0.85   | 0.85 | 10079   |
 *details of the fine-tuning process in the next section.
 We treat each comma as one token instance, as opposed to the original paper, which NER-tags the whole multiple-token

 `sudo service docker start`
 first.
 The application should then be available at http://localhost:8000.
 For the API, see the `openapi.yaml` file.
 Docker-compose mounts a volume and listens to changes in the source code, so the application will be reloaded and
 We use multi-stage builds to reduce the image size, ensure flexibility in requirements and that tests are run before
 each deployment.
 However, while it does reduce the size by nearly 3GB, the resulting image still contains deep learning libraries and
+pre-downloaded models, and will take around 9GB of disk space.
+NOTE: Since the service is hosting two large deep learning models, there might be memory issues depending on your
+machine, where the terminal running
+docker would simply crash.
+Should that happen, you can try increasing resources allocated to docker, or splitting commands in the docker file,
+e.g., running tests one by one.
+If everything fails, you can still use the hosted huggingface hub demo, or follow the steps below and run the app
+locally without Docker.
 Alternatively, you can setup a python environment by hand. It is recommended to use a virtualenv. Inside one, run
 ```bash
 ```
 the `[test]` option makes sure to install test dependencies.
+Then, run `python app.py` or `uvicorn --host 0.0.0.0 --port 8000 "app:app" --reload` to run the application.
 If you intend to perform training and evaluation of deep learning models, install also using the `[training]` option.
 ### Running tests
 | Model    | precision | recall | F1   | support |
 |----------|-----------|--------|------|---------|
 | baseline | 0.79      | 0.72   | 0.75 | 10079   |
+| ours*    | 0.84      | 0.84   | 0.84 | 10079   |
 *details of the fine-tuning process in the next section.
 We treat each comma as one token instance, as opposed to the original paper, which NER-tags the whole multiple-token

commafixer/src/fixer.py CHANGED Viewed

@@ -52,7 +52,8 @@ class CommaFixer:
         tokenizer = AutoTokenizer.from_pretrained(config.base_model_name_or_path)
         model = PeftModel.from_pretrained(inference_model, model_name)
         model = model.merge_and_unload()  # Join LoRa matrices with the main model for faster inference
-        return model, tokenizer
 def _fix_commas_based_on_labels_and_offsets(

         tokenizer = AutoTokenizer.from_pretrained(config.base_model_name_or_path)
         model = PeftModel.from_pretrained(inference_model, model_name)
         model = model.merge_and_unload()  # Join LoRa matrices with the main model for faster inference
+        # TODO batch, and move to CUDA if available
+        return model.eval(), tokenizer
 def _fix_commas_based_on_labels_and_offsets(

notebooks/evaluation.ipynb CHANGED Viewed

The diff for this file is too large to render. See raw diff