MostafaMSP's picture
Upload 86 files
6ecf14b verified

Part 2 - Creation of the Medical Dataset

back

In this part we are going to build the Datasets that will be used create the Medical Model

Once we have created our enviorment in the part 1. We will create our Dataset to create our model.

jupyter lab

image-20230820225439403

Let us go the the second folder called 2-data.

There we load the 2-Data.ipynb notebook

image-20230824182144129

This notebook will create the dataframes in csv format for each document that are int he folder Medical-Dialogue-System

C:.

├───data
│   ├───csv
│   ├───dialogue_0
│   ├───dialogue_1
│   ├───dialogue_2
│   ├───dialogue_3
│   ├───dialogue_4
│   
├───Medical-Dialogue-System
└───tools

and saved in the ./data./csv/

Then those csv will be cleaned and merged into single file called dialogues.csv

image-20230824232800691

This csv has 256916 dialogues between a Patient and Doctor.

In the following part we are going to build the model. 3-Modeling