Spaces:
Runtime error
Runtime error
chore: updating md files and credits
Browse files- README.md +22 -13
- main.py +1 -1
- public/about.md +7 -8
- public/credits_dataprotection_license.md +7 -10
README.md
CHANGED
@@ -9,34 +9,43 @@ app_file: main.py
|
|
9 |
pinned: true
|
10 |
license: mit
|
11 |
app_port: 8080
|
|
|
|
|
|
|
12 |
---
|
13 |
|
14 |
-
# Bachelor Thesis
|
15 |
-
## Webapp
|
16 |
-
#### This is the ui showcase for my thesis about interpretability in a LLM based chatbot application and applications of XAI.
|
17 |
|
18 |
-
##
|
|
|
|
|
|
|
19 |
|
20 |
**[Github Repository](https://github.com/LennardZuendorf/thesis-webapp)**
|
21 |
-
**[Huggingface Spaces Showcase](https://huggingface.co/spaces/lennardzuendorf/thesis-webapp-docker**
|
22 |
|
23 |
-
|
|
|
|
|
24 |
|
25 |
**Language and Framework:** Python
|
26 |
|
27 |
**Noteable Packages:** 🤗 Transformers, FastAPI, Gradio, SHAP, BERTViz
|
28 |
|
29 |
-
## 👨💻 Author and Credits
|
30 |
|
31 |
-
**Author
|
32 |
|
33 |
-
**Thesis Supervisor
|
34 |
<br> Second Corrector: [Prof. Dr. Hochstein](https://www.htw-berlin.de/hochschule/personen/person/?eid=10628)
|
35 |
|
36 |
-
See code for in detailed credits, work is based on
|
|
|
|
|
|
|
|
|
37 |
|
38 |
-
|
39 |
-
-
|
40 |
-
-
|
41 |
|
42 |
This Project was part of my studies of Business Computing at University of Applied Science for Technology and Business Berlin (HTW Berlin).
|
|
|
9 |
pinned: true
|
10 |
license: mit
|
11 |
app_port: 8080
|
12 |
+
models: microsoft/GODEL-v1_1-large-seq2seq
|
13 |
+
tags: ["CHAT", "XAI", "SHAP", "GODEL", "Gradio"]
|
14 |
+
disable_embedding: true
|
15 |
---
|
16 |
|
17 |
+
# Bachelor Thesis Webapp
|
|
|
|
|
18 |
|
19 |
+
## 📜 Info:
|
20 |
+
This is the ui showcase for my thesis about interpretability in a LLM based chatbot application and applications of XAI.
|
21 |
+
|
22 |
+
### 🔗 Links:
|
23 |
|
24 |
**[Github Repository](https://github.com/LennardZuendorf/thesis-webapp)**
|
|
|
25 |
|
26 |
+
**[Huggingface Spaces Showcase](https://huggingface.co/spaces/lennardzuendorf/thesis-webapp-docker)**
|
27 |
+
|
28 |
+
### 🏗️ Tech Stack:
|
29 |
|
30 |
**Language and Framework:** Python
|
31 |
|
32 |
**Noteable Packages:** 🤗 Transformers, FastAPI, Gradio, SHAP, BERTViz
|
33 |
|
34 |
+
## 👨💻 Author and Credits:
|
35 |
|
36 |
+
**Author: [@LennardZuendorf](https://github.com/LennardZuendorf)**
|
37 |
|
38 |
+
**Thesis Supervisor: [Prof. Dr. Simbeck](https://www.htw-berlin.de/hochschule/personen/person/?eid=9862)**
|
39 |
<br> Second Corrector: [Prof. Dr. Hochstein](https://www.htw-berlin.de/hochschule/personen/person/?eid=10628)
|
40 |
|
41 |
+
See code for in detailed credits, work is strongly based on:
|
42 |
+
#### GODEL
|
43 |
+
- [HGF Model Page](https://huggingface.co/microsoft/GODEL-v1_1-large-seq2seq?text=Hey+my+name+is+Mariama%21+How+are+you%3F)
|
44 |
+
- [Paper on HGF](https://huggingface.co/papers/2206.11309)
|
45 |
+
- [Paper Print](https://arxiv.org/abs/2206.11309)
|
46 |
|
47 |
+
#### SHAP
|
48 |
+
- [Github](https://github.com/shap/shap)
|
49 |
+
- [Inital Paper](https://arxiv.org/abs/1705.07874)
|
50 |
|
51 |
This Project was part of my studies of Business Computing at University of Applied Science for Technology and Business Berlin (HTW Berlin).
|
main.py
CHANGED
@@ -60,7 +60,7 @@ with gr.Blocks(
|
|
60 |
# markdown component to display the header
|
61 |
gr.Markdown("""
|
62 |
# Thesis Demo - AI Chat Application with GODEL
|
63 |
-
|
64 |
### Select between tabs below for the different views.
|
65 |
""")
|
66 |
# ChatBot tab used to chat with the AI chatbot
|
|
|
60 |
# markdown component to display the header
|
61 |
gr.Markdown("""
|
62 |
# Thesis Demo - AI Chat Application with GODEL
|
63 |
+
Interpretability powered by shap and attention visualization,
|
64 |
### Select between tabs below for the different views.
|
65 |
""")
|
66 |
# ChatBot tab used to chat with the AI chatbot
|
public/about.md
CHANGED
@@ -8,14 +8,13 @@ This research tackles the rise of LLM based applications such a chatbots and exp
|
|
8 |
|
9 |
- [GitHub Repository](https://github.com/LennardZuendorf/thesis-webapp) - The GitHub repository of this project.
|
10 |
- [HTW Berlin](https://www.htw-berlin.de/) - The University I have built this project for, as part of my thesis.
|
11 |
-
- [Thesis Print]() - Link to the thesis pdf (in English), containing more information about the project. And a full list of sources for this work as well as additional evaluations and fundamental information for the project.
|
12 |
|
13 |
|
14 |
## Implementation
|
15 |
|
16 |
-
This project is an implementation of PartitionSHAP
|
17 |
|
18 |
-
The UI is build with Gradio.
|
19 |
|
20 |
### Usage
|
21 |
|
@@ -39,13 +38,13 @@ Please credit the original authors of this project (Lennard Zündorf) and the cr
|
|
39 |
### Author
|
40 |
|
41 |
- Lennard Zündorf
|
42 | |
43 |
-
- [GitHub](https://
|
44 |
-
- [LinkedIn](https://
|
45 |
|
46 |
|
47 |
### University
|
48 |
Hochschule für Technik und Wirtschaft Berlin (HTW Berlin) - University of Applied Sciences for Engineering and Economics Berlin
|
49 |
|
50 |
-
|
51 |
-
2.
|
|
|
8 |
|
9 |
- [GitHub Repository](https://github.com/LennardZuendorf/thesis-webapp) - The GitHub repository of this project.
|
10 |
- [HTW Berlin](https://www.htw-berlin.de/) - The University I have built this project for, as part of my thesis.
|
|
|
11 |
|
12 |
|
13 |
## Implementation
|
14 |
|
15 |
+
This project is an implementation of PartitionSHAP into GODEL by Microsoft - [GODEL Model](https://huggingface.co/microsoft/GODEL-v1_1-large-seq2seq) which is a generative seq2seq transformer fine-tuned for goal directed dialog. It supports context and knowledge base inputs.
|
16 |
|
17 |
+
The UI is build with Gradio, utilizing some custom components and FastAPI.
|
18 |
|
19 |
### Usage
|
20 |
|
|
|
38 |
### Author
|
39 |
|
40 |
- Lennard Zündorf
|
41 |
+
- [[email protected]](mailto:[email protected]) or [[email protected]](mailto:[email protected])
|
42 |
+
- [GitHub](https://zuendorf.me/github)
|
43 |
+
- [LinkedIn](https://zuendorf.me/github)
|
44 |
|
45 |
|
46 |
### University
|
47 |
Hochschule für Technik und Wirtschaft Berlin (HTW Berlin) - University of Applied Sciences for Engineering and Economics Berlin
|
48 |
|
49 |
+
Supervisor: Prof. Dr. Katarina Simbeck
|
50 |
+
2. Corrector : Prof. Dr. Axel Hochstein
|
public/credits_dataprotection_license.md
CHANGED
@@ -1,7 +1,4 @@
|
|
1 |
-
|
2 |
-
|
3 |
# Credits
|
4 |
-
For full credits, please refer to the [thesis print]()
|
5 |
|
6 |
### Models
|
7 |
This implementation is build on GODEL by Microsoft, Inc.
|
@@ -10,7 +7,7 @@ This implementation is build on GODEL by Microsoft, Inc.
|
|
10 |
GODEL is an open source model by Microsoft. See [offical paper](https://arxiv.org/abs/2206.11309) for more information.
|
11 |
|
12 |
- the version used in this project is GODEL Large, see [huggingface model hub](https://huggingface.co/microsoft/GODEL-v1_1-large-seq2seq?text=Hey+my+name+is+Thomas%21+How+are+you%3F)
|
13 |
-
- the model as is a generative seq2seq transformer fine
|
14 |
- it supports context and knowledge base inputs
|
15 |
|
16 |
### Libraries
|
@@ -19,14 +16,14 @@ This project uses a number of open source libraries, only the most important one
|
|
19 |
##### Shap
|
20 |
This application uses a custom version of the shap library, which is available at [GitHub](https://github.com/shap/shap).
|
21 |
|
22 |
-
-
|
23 |
-
-
|
24 |
|
25 |
-
#####
|
26 |
-
This application uses
|
27 |
|
28 |
-
-
|
29 |
-
-
|
30 |
|
31 |
|
32 |
# Data Protection
|
|
|
|
|
|
|
1 |
# Credits
|
|
|
2 |
|
3 |
### Models
|
4 |
This implementation is build on GODEL by Microsoft, Inc.
|
|
|
7 |
GODEL is an open source model by Microsoft. See [offical paper](https://arxiv.org/abs/2206.11309) for more information.
|
8 |
|
9 |
- the version used in this project is GODEL Large, see [huggingface model hub](https://huggingface.co/microsoft/GODEL-v1_1-large-seq2seq?text=Hey+my+name+is+Thomas%21+How+are+you%3F)
|
10 |
+
- the model as is a generative seq2seq transformer fine-tuned for goal directed dialog
|
11 |
- it supports context and knowledge base inputs
|
12 |
|
13 |
### Libraries
|
|
|
16 |
##### Shap
|
17 |
This application uses a custom version of the shap library, which is available at [GitHub](https://github.com/shap/shap).
|
18 |
|
19 |
+
- Please refer to the [thesis-shap](https://github.com/LennardZuendorf/thesis-custom-shap) repository for more information about the changes made to the library, specifically the README file.
|
20 |
+
- The shap library and the used partition SHAP explainer are based on work by Lundberg et al. (2017), see [offical paper](https://arxiv.org/pdf/1705.07874.pdf) for more information.
|
21 |
|
22 |
+
##### Visualizations
|
23 |
+
This application uses attention visualization inspired by the bertviz library, which is available at[GitHub](https://github.com/jessevig/bertviz). It doesn't actually use BERTViz.
|
24 |
|
25 |
+
- The bertviz was introduced by Vig et al. (2019), see [offical paper](https://arxiv.org/pdf/1906.05714.pdf) for more information.
|
26 |
+
- This project only uses cross attention visualization with gradio and matplotlib.
|
27 |
|
28 |
|
29 |
# Data Protection
|