diff --git "a/ipynb/llm/2. Hyper_Fine_tune_Llama_2_with_MCQ.ipynb" "b/ipynb/llm/2. Hyper_Fine_tune_Llama_2_with_MCQ.ipynb" --- "a/ipynb/llm/2. Hyper_Fine_tune_Llama_2_with_MCQ.ipynb" +++ "b/ipynb/llm/2. Hyper_Fine_tune_Llama_2_with_MCQ.ipynb" @@ -1 +1 @@ -{"cells":[{"cell_type":"markdown","source":["---"],"metadata":{"id":"LlkcVvTDs96X"}},{"cell_type":"markdown","source":["# MCQ Custom Dataset Creation\n","\n","## 1. Introduction\n","\n","High-quality data is fundamental for producing a good model; the higher the quality of the data, the better the resulting model. The following steps outline the process of creating a dataset specifically for fine-tuning our Llama2 model.\n","\n","\n","\n","![](https://i.imgur.com/IDNhAWH.png)\n","\n","\n","There are several types of datasets that can be used to fine-tune Large Language Models (LLMs):\n","\n","1. **Instruction Datasets:** These datasets contain direct instructions or prompts followed by the correct or expected outputs.\n","\n","2. **Raw Completion:** This involves providing a prompt to the model and letting it generate a response without any predefined correct answer.\n","\n","3. **Preference Datasets:** These datasets include human feedback in the form of preferences, where annotators compare pairs of model outputs to determine which is better.\n","\n","4. **Human Feedback Data:** This is specific to Reinforcement Learning from Human Feedback (RLHF) and involves direct feedback on the model's outputs from human annotators.\n","\n","5. **Demonstration Data:** Also used in RLHF, these datasets consist of examples showing ideal model outputs or actions, typically created by humans.\n","\n","6. **Reward Modeling Data:** Used to train a reward model in RLHF, this dataset predicts human feedback on model outputs based on actual feedback data.\n","\n","7. **Dialogue Data:** Particularly relevant for conversational AI, this includes annotated conversations that indicate the quality of responses or provide corrections.\n","\n","\n","---\n","\n","\n","\n","* Typically, an instruction dataset is utilized for fine-tuning the Llama 2 Model. Since we are focusing on Supervised Fine Tuning, the instruction dataset becomes our primary choice.\n","\n","Therefore, we have 2 options:\n","\n","1. Create our own Instruction Dataset.\n","2. Modify an existing instruction dataset, which involves filtering, modifying, and enriching it.\n","\n","We have decided to proceed with the 1st option: creating our own Instruction Dataset.\n","\n","* This will involve prompt engineering and incorporating sanity checks to ensure quality and relevance."],"metadata":{"id":"wAQMA1-DKZZ5"}},{"cell_type":"markdown","source":["## 2. Load and analyze the dataset"],"metadata":{"id":"hU_mUK-nol-t"}},{"cell_type":"code","execution_count":null,"metadata":{"id":"8P7g6eHuxxKe"},"outputs":[],"source":["# Install libraries\n","!pip install -q datasets transformers sentence_transformers faiss-gpu huggingface_hub"]},{"cell_type":"code","source":["# Import the required libraries\n","import json\n","import sys\n","import pandas as pd\n","from datasets import Dataset, DatasetDict, load_dataset\n","\n","from transformers import AutoTokenizer\n","import matplotlib.pyplot as plt\n","import seaborn as sns\n","\n","from sentence_transformers import SentenceTransformer\n","import faiss\n","from tqdm.autonotebook import tqdm\n","import numpy as np"],"metadata":{"id":"KKb-ikj4J-in"},"execution_count":null,"outputs":[]},{"cell_type":"code","source":["# Load JSON data from a file\n","with open(\"mcq_data.json\", \"r\") as f:\n"," data = json.load(f)\n","\n","# Create a Pandas DataFrame from the list of dictionaries\n","df = pd.DataFrame(data)\n","\n","# Calculate the number of rows for each dataset split\n","num_rows = len(df)\n","train_end = int(num_rows * 0.8) # 80% for training\n","test_end = train_end + int(num_rows * 0.1) # 10% for testing\n","\n","# Split the DataFrame into training, testing, and validation sets\n","df_train = df[:train_end]\n","df_test = df[train_end:test_end]\n","df_val = df[test_end:] # Ensures the remainder is used for validation\n","\n","# Create Datasets from the DataFrames\n","dataset_train = Dataset.from_pandas(df_train)\n","dataset_test = Dataset.from_pandas(df_test)\n","dataset_val = Dataset.from_pandas(df_val)\n","\n","# Create a DatasetDict containing the split datasets\n","dataset = DatasetDict({\n"," 'train': dataset_train,\n"," 'test': dataset_test,\n"," 'val': dataset_val\n","})\n","\n","# Print the structure of the created DatasetDict\n","print(dataset)"],"metadata":{"id":"bGi9FdmhdBDg","colab":{"base_uri":"https://localhost:8080/"},"executionInfo":{"status":"ok","timestamp":1708322802114,"user_tz":-480,"elapsed":19,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"e9369555-5be7-4b43-a6c4-3defec1485d6"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["DatasetDict({\n"," train: Dataset({\n"," features: ['Instruction', 'Question', 'A', 'B', 'C', 'D', 'Correct Answer', 'Explanation'],\n"," num_rows: 334\n"," })\n"," test: Dataset({\n"," features: ['Instruction', 'Question', 'A', 'B', 'C', 'D', 'Correct Answer', 'Explanation'],\n"," num_rows: 41\n"," })\n"," val: Dataset({\n"," features: ['Instruction', 'Question', 'A', 'B', 'C', 'D', 'Correct Answer', 'Explanation'],\n"," num_rows: 43\n"," })\n","})\n"]}]},{"cell_type":"code","source":["# Read as pandas DataFrame\n","dataset['train'].to_pandas()"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":719},"id":"-MOvcr5mD8li","executionInfo":{"status":"ok","timestamp":1708322802114,"user_tz":-480,"elapsed":18,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"5b69a1a1-5307-4ec8-9b04-2c5035176d98"},"execution_count":null,"outputs":[{"output_type":"execute_result","data":{"text/plain":[" Instruction \\\n","0 Create an MCQ on the structure of artificial n... \n","1 Create an MCQ on the training process of artif... \n","2 Create an MCQ on the role of artificial neuron... \n","3 Create an MCQ on the purpose of hidden layers ... \n","4 Create an MCQ on the basics of deep learning \n",".. ... \n","329 Create an MCQ on the hyperparameter 'Kernel' i... \n","330 Create an MCQ on the hyperparameter 'Gamma' in... \n","331 Create an MCQ on the hyperparameter 'learning_... \n","332 Create an MCQ on the hyperparameter 'n_estimat... \n","333 Create an MCQ on the application of deep learn... \n","\n"," Question \\\n","0 What is the structure of an artificial neural ... \n","1 What is the purpose of the training process in... \n","2 What is the role of artificial neurons in neur... \n","3 What is the purpose of hidden layers in artifi... \n","4 What is deep learning? \n",".. ... \n","329 What does the hyperparameter 'Kernel' define i... \n","330 What does the hyperparameter 'Gamma' control i... \n","331 What does the hyperparameter 'learning_rate' d... \n","332 What does the hyperparameter 'n_estimators' de... \n","333 Which of the following is an application of de... \n","\n"," A \\\n","0 It consists of input layers and hidden layers ... \n","1 To adjust the weights of the connections betwe... \n","2 To receive input from external sources \n","3 To receive input from external sources \n","4 A branch of machine learning based on artifici... \n",".. ... \n","329 The step size taken by the optimizer during ea... \n","330 The step size taken by the optimizer during ea... \n","331 The step size taken by the optimizer during ea... \n","332 The step size taken by the optimizer during ea... \n","333 Analyzing sensor data in autonomous vehicles \n","\n"," B \\\n","0 It consists of input layers, hidden layers, an... \n","1 To propagate input data forward through the la... \n","2 To compute the weighted total of inputs \n","3 To compute the weighted total of inputs \n","4 A programming technique to explicitly define c... \n",".. ... \n","329 The trade-off between the margin and the numbe... \n","330 The trade-off between the margin and the numbe... \n","331 The trade-off between the margin and the numbe... \n","332 The trade-off between the margin and the numbe... \n","333 Recognizing objects and scenes in images \n","\n"," C \\\n","0 It consists of input layers, hidden layers, ou... \n","1 To calculate the error between the output and ... \n","2 To transfer information to the next layer \n","3 To transfer information to the next layer \n","4 A method to process large datasets using deep ... \n",".. ... \n","329 The similarity between data points \n","330 The similarity between data points \n","331 The similarity between data points \n","332 The number of boosting trees to be trained \n","333 Transcribing spoken words into text \n","\n"," D Correct Answer \\\n","0 It consists of input layers, hidden layers, ou... C \n","1 To achieve the desired level of performance A \n","2 All of the above D \n","3 To process and transform the input data D \n","4 A type of data structure inspired by the human... A \n",".. ... ... \n","329 The maximum depth of each tree in the ensemble C \n","330 The influence of support vectors on the decisi... D \n","331 The maximum depth of each tree in the ensemble A \n","332 The maximum depth of each tree in the ensemble C \n","333 Making personalized recommendations based on u... B \n","\n"," Explanation \n","0 An artificial neural network consists of input... \n","1 The purpose of the training process in artific... \n","2 The role of artificial neurons in neural netwo... \n","3 The purpose of hidden layers in artificial neu... \n","4 Deep learning is a branch of machine learning ... \n",".. ... \n","329 The hyperparameter 'Kernel' in Support Vector ... \n","330 The hyperparameter 'Gamma' in Support Vector M... \n","331 The hyperparameter 'learning_rate' in XGBoost ... \n","332 The hyperparameter 'n_estimators' in XGBoost d... \n","333 Deep learning algorithms are used in image and... \n","\n","[334 rows x 8 columns]"],"text/html":["\n","
\n","
\n","\n","\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
InstructionQuestionABCDCorrect AnswerExplanation
0Create an MCQ on the structure of artificial n...What is the structure of an artificial neural ...It consists of input layers and hidden layers ...It consists of input layers, hidden layers, an...It consists of input layers, hidden layers, ou...It consists of input layers, hidden layers, ou...CAn artificial neural network consists of input...
1Create an MCQ on the training process of artif...What is the purpose of the training process in...To adjust the weights of the connections betwe...To propagate input data forward through the la...To calculate the error between the output and ...To achieve the desired level of performanceAThe purpose of the training process in artific...
2Create an MCQ on the role of artificial neuron...What is the role of artificial neurons in neur...To receive input from external sourcesTo compute the weighted total of inputsTo transfer information to the next layerAll of the aboveDThe role of artificial neurons in neural netwo...
3Create an MCQ on the purpose of hidden layers ...What is the purpose of hidden layers in artifi...To receive input from external sourcesTo compute the weighted total of inputsTo transfer information to the next layerTo process and transform the input dataDThe purpose of hidden layers in artificial neu...
4Create an MCQ on the basics of deep learningWhat is deep learning?A branch of machine learning based on artifici...A programming technique to explicitly define c...A method to process large datasets using deep ...A type of data structure inspired by the human...ADeep learning is a branch of machine learning ...
...........................
329Create an MCQ on the hyperparameter 'Kernel' i...What does the hyperparameter 'Kernel' define i...The step size taken by the optimizer during ea...The trade-off between the margin and the numbe...The similarity between data pointsThe maximum depth of each tree in the ensembleCThe hyperparameter 'Kernel' in Support Vector ...
330Create an MCQ on the hyperparameter 'Gamma' in...What does the hyperparameter 'Gamma' control i...The step size taken by the optimizer during ea...The trade-off between the margin and the numbe...The similarity between data pointsThe influence of support vectors on the decisi...DThe hyperparameter 'Gamma' in Support Vector M...
331Create an MCQ on the hyperparameter 'learning_...What does the hyperparameter 'learning_rate' d...The step size taken by the optimizer during ea...The trade-off between the margin and the numbe...The similarity between data pointsThe maximum depth of each tree in the ensembleAThe hyperparameter 'learning_rate' in XGBoost ...
332Create an MCQ on the hyperparameter 'n_estimat...What does the hyperparameter 'n_estimators' de...The step size taken by the optimizer during ea...The trade-off between the margin and the numbe...The number of boosting trees to be trainedThe maximum depth of each tree in the ensembleCThe hyperparameter 'n_estimators' in XGBoost d...
333Create an MCQ on the application of deep learn...Which of the following is an application of de...Analyzing sensor data in autonomous vehiclesRecognizing objects and scenes in imagesTranscribing spoken words into textMaking personalized recommendations based on u...BDeep learning algorithms are used in image and...
\n","

334 rows × 8 columns

\n","
\n","
\n","\n","
\n"," \n","\n"," \n","\n"," \n","
\n","\n","\n","
\n"," \n","\n","\n","\n"," \n","
\n","\n","
\n","
\n"],"application/vnd.google.colaboratory.intrinsic+json":{"type":"dataframe","summary":"{\n \"name\": \"dataset['train']\",\n \"rows\": 334,\n \"fields\": [\n {\n \"column\": \"Instruction\",\n \"properties\": {\n \"dtype\": \"category\",\n \"samples\": [\n \"Create an MCQ on the parameter gamma in Support Vector Machines (SVMs)\",\n \"Create an MCQ on the disadvantages of Artificial Neural Networks (ANNs)\",\n \"Create an MCQ on the role of machine learning in recommendation systems\"\n ],\n \"num_unique_values\": 165,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Question\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"What does the hyperparameter 'Kernel' determine in Support Vector Machines (SVMs)?\",\n \"Which of the following are types of deep learning architectures?\",\n \"Which of the following is NOT an application of deep learning in reinforcement learning?\"\n ],\n \"num_unique_values\": 221,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"A\",\n \"properties\": {\n \"dtype\": \"category\",\n \"samples\": [\n \"Requires large amounts of labeled data\",\n \"AI is the broader family consisting of ML and DL as its components\",\n \"Increased computational cost\"\n ],\n \"num_unique_values\": 162,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"B\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Data clustering, dimensionality reduction, and anomaly detection\",\n \"Analyzing medical images to assist doctors in making diagnoses\",\n \"The reliance on manual feature engineering\"\n ],\n \"num_unique_values\": 193,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"C\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"It may result in overfitting\",\n \"Reduced overfitting and underfitting\",\n \"To automatically learn features from visual data\"\n ],\n \"num_unique_values\": 211,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"D\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Overfitting\",\n \"Evaluating all possible combinations of hyperparameter values\",\n \"A branch of machine learning that uses linear regression\"\n ],\n \"num_unique_values\": 211,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Correct Answer\",\n \"properties\": {\n \"dtype\": \"category\",\n \"samples\": [\n \"A\",\n \"B\",\n \"C\"\n ],\n \"num_unique_values\": 4,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Explanation\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"A key difference between machine learning and deep learning is the type of algorithms used. Machine learning applies statistical algorithms, while deep learning utilizes artificial neural network architecture to learn patterns and relationships.\",\n \"Hyperparameter tuning helps reduce overfitting and underfitting, leading to improved model performance and generalizability.\",\n \"Artificial Intelligence consists of the components: Artificial Intelligence, Machine Learning, and Deep Learning.\"\n ],\n \"num_unique_values\": 302,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n }\n ]\n}"}},"metadata":{},"execution_count":19}]},{"cell_type":"code","source":["# Read as pandas DataFrame\n","dataset['test'].to_pandas()"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":1000},"id":"7WkFWJSQhKUV","executionInfo":{"status":"ok","timestamp":1708322802115,"user_tz":-480,"elapsed":17,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"8fd77b53-8311-42f7-a84d-00e396647a72"},"execution_count":null,"outputs":[{"output_type":"execute_result","data":{"text/plain":[" Instruction \\\n","0 Create an MCQ on the use of deep learning in g... \n","1 Create an MCQ on the use of deep learning in a... \n","2 Create an MCQ on the application of deep learn... \n","3 Create an MCQ on the use of deep learning in r... \n","4 Create an MCQ on the structure of artificial n... \n","5 Create an MCQ on the purpose of adjusting weig... \n","6 Create an MCQ on the role of artificial neuron... \n","7 Create an MCQ on the complexities of neural ne... \n","8 Create an MCQ on the difference between machin... \n","9 Create an MCQ on the definition of deep learning \n","10 Create an MCQ on the key characteristic of dee... \n","11 Create an MCQ on the applications of deep lear... \n","12 Create an MCQ on the training requirements of ... \n","13 Create an MCQ on the types of machine learning... \n","14 Create an MCQ on the types of neural networks ... \n","15 Create an MCQ on the applications of deep lear... \n","16 Create an MCQ on the applications of deep lear... \n","17 Create an MCQ on the applications of deep lear... \n","18 Create an MCQ on the main types of neural netw... \n","19 Create an MCQ on the definition of Artificial ... \n","20 Create an MCQ on the difference between Machin... \n","21 Create an MCQ on the components of Artificial ... \n","22 Create an MCQ on the aim of Machine Learning \n","23 Create an MCQ on the aim of Deep Learning \n","24 Create an MCQ on the difference between AI, Ma... \n","25 Create an MCQ on the application of AI in spee... \n","26 Create an MCQ on the application of AI in pers... \n","27 Create an MCQ on the application of AI in pred... \n","28 Create an MCQ on the application of AI in medi... \n","29 Create an MCQ on the difference between AI, ML... \n","30 Create an MCQ on the responsibilities of an AI... \n","31 Create an MCQ on the skills required for a Mac... \n","32 Create an MCQ on the tasks of a Deep Learning ... \n","33 Create an MCQ on the difference between ML and DL \n","34 Create an MCQ on the advantages of Artificial ... \n","35 Create an MCQ on the disadvantages of Artifici... \n","36 Create an MCQ on the advantages of Biological ... \n","37 Create an MCQ on the disadvantages of Biologic... \n","38 Create an MCQ on the differences between Artif... \n","39 Create an MCQ on hyperparameter tuning in mach... \n","40 Create an MCQ on the types of hyperparameters ... \n","\n"," Question \\\n","0 What is the role of deep learning in generativ... \n","1 How is deep learning used in autonomous vehicles? \n","2 What is the role of deep learning in speech re... \n","3 What is the application of deep learning in re... \n","4 What is the structure of an artificial neural ... \n","5 What is the purpose of adjusting weights in ar... \n","6 What is the role of artificial neurons in neur... \n","7 What determines the complexities of neural net... \n","8 What is a key difference between machine learn... \n","9 What is the definition of deep learning? \n","10 What is the key characteristic of deep learning? \n","11 Which of the following are applications of dee... \n","12 What are the training requirements for deep ne... \n","13 Which types of machine learning tasks can be p... \n","14 Which type of neural network is specifically d... \n","15 Which application of deep learning in computer... \n","16 Which application of deep learning in NLP invo... \n","17 Which application of deep learning in reinforc... \n","18 Which of the following are the main types of n... \n","19 Which of the following best defines Artificial... \n","20 What is the main difference between Machine Le... \n","21 Which of the following components are part of ... \n","22 What is the aim of Machine Learning? \n","23 What is the aim of Deep Learning? \n","24 Which of the following best describes the diff... \n","25 Which of the following is an example of AI app... \n","26 Which of the following is an example of AI app... \n","27 Which of the following is an example of AI app... \n","28 Which of the following is an example of AI app... \n","29 Which of the following statements accurately d... \n","30 Which of the following is a key responsibility... \n","31 Which of the following skills is essential for... \n","32 Which of the following is a key task of a Deep... \n","33 What distinguishes Deep Learning (DL) from Mac... \n","34 Which of the following is an advantage of Arti... \n","35 Which of the following is a disadvantage of Ar... \n","36 Which of the following is an advantage of Biol... \n","37 Which of the following is a disadvantage of Bi... \n","38 Which of the following is a difference between... \n","39 What is the purpose of hyperparameter tuning i... \n","40 Which of the following is a type of hyperparam... \n","\n"," A \\\n","0 Analyzing sensor data in autonomous vehicles \n","1 Analyzing sensor data in autonomous vehicles \n","2 Analyzing sensor data in autonomous vehicles \n","3 Analyzing sensor data in autonomous vehicles \n","4 It consists of input layers, hidden layers, an... \n","5 To increase the speed of training models \n","6 To receive input from external sources \n","7 The number of layers in the network \n","8 The type of algorithms used \n","9 A branch of machine learning that uses artific... \n","10 The use of shallow neural networks with a sing... \n","11 Image recognition, natural language processing... \n","12 Small datasets and limited computational resou... \n","13 Supervised machine learning only \n","14 Feedforward Neural Networks (FNNs) \n","15 Object detection and recognition \n","16 Automatic Text Generation \n","17 Game playing \n","18 Feedforward Neural Networks (FNNs) \n","19 The study of training machines to mimic human ... \n","20 Machine Learning uses statistical methods, whi... \n","21 Machine Learning and Deep Learning \n","22 To increase chances of success \n","23 To increase chances of success \n","24 AI is a subset of Machine Learning, which is a... \n","25 Analyzing users' browsing and viewing history ... \n","26 Analyzing users' browsing and viewing history ... \n","27 Analyzing users' browsing and viewing history ... \n","28 Analyzing users' browsing and viewing history ... \n","29 AI, ML, and DL are interchangeable terms that ... \n","30 Design and development of AI algorithms \n","31 Strong background in computer science, mathema... \n","32 Design and development of DL algorithms \n","33 DL is a more advanced form of ML that can perf... \n","34 Ability to learn irrespective of the type of data \n","35 Ability to learn irrespective of the type of data \n","36 Ability to learn irrespective of the type of data \n","37 Ability to learn irrespective of the type of data \n","38 Both ANNs and BNNs have complex and diverse ne... \n","39 To adjust the weights and biases of the model \n","40 Weights \n","\n"," B \\\n","0 Creating new content based on existing data \n","1 Recognizing objects and scenes in images \n","2 Recognizing objects and scenes in images \n","3 Recognizing objects and scenes in images \n","4 It consists of input layers and output layers ... \n","5 To prevent overfitting by validating the model... \n","6 To compute the weighted total of inputs \n","7 The number of units in each layer \n","8 The amount of data required \n","9 A type of programming that explicitly defines ... \n","10 The requirement for manual feature engineering \n","11 Data clustering, dimensionality reduction, and... \n","12 Large amounts of data and computational resources \n","13 Unsupervised machine learning only \n","14 Convolutional Neural Networks (CNNs) \n","15 Image classification \n","16 Language translation \n","17 Robotics \n","18 Convolutional Neural Networks (CNNs) \n","19 The study of statistical methods enabling mach... \n","20 Machine Learning focuses on learning from expe... \n","21 Machine Learning and Decision Trees \n","22 To increase accuracy \n","23 To increase accuracy \n","24 Machine Learning is a subset of AI, which is a... \n","25 Analyzing medical images to assist doctors in ... \n","26 Analyzing medical images to assist doctors in ... \n","27 Analyzing medical images to assist doctors in ... \n","28 Analyzing medical images to assist doctors in ... \n","29 AI focuses on creating intelligent machines, M... \n","30 Analysis and interpretation of data \n","31 Experience in developing AI algorithms and sol... \n","32 Analysis and interpretation of data \n","33 DL focuses on developing algorithms that enabl... \n","34 Simple architecture that makes it easy to expl... \n","35 Simple architecture that makes it easy to expl... \n","36 Simple architecture that makes it easy to expl... \n","37 Simple architecture that makes it easy to expl... \n","38 ANNs have fixed connections between neurons, w... \n","39 To select the optimal values for the model's h... \n","40 Biases \n","\n"," C \\\n","0 Transcribing spoken words into text \n","1 Transcribing spoken words into text \n","2 Transcribing spoken words into text \n","3 Making personalized recommendations based on u... \n","4 It consists of input layers, hidden layers, ou... \n","5 To enhance the model's performance on the trai... \n","6 To transfer information to the next layer \n","7 The type of activation function used \n","8 The complexity of the models \n","9 A technique that requires manual feature engin... \n","10 The use of deep neural networks with multiple ... \n","11 Supervised machine learning and unsupervised m... \n","12 Manual feature engineering and domain expertise \n","13 Reinforcement machine learning only \n","14 Recurrent Neural Networks (RNNs) \n","15 Image segmentation \n","16 Sentiment analysis \n","17 Control systems \n","18 Recurrent Neural Networks (RNNs) \n","19 The study that uses neural networks to imitate... \n","20 Machine Learning is a subset of Deep Learning \n","21 Artificial Intelligence and Machine Learning \n","22 To improve system efficiency \n","23 To improve system efficiency \n","24 Deep Learning is a subset of AI, which is a su... \n","25 Recognizing and classifying images and speech \n","26 Recognizing and classifying images and speech \n","27 Analyzing sensor data to predict equipment fai... \n","28 Recognizing and classifying images and speech \n","29 AI is a subset of ML that uses neural networks... \n","30 Training and evaluation of ML models \n","31 Familiarity with programming languages such as... \n","32 Training and evaluation of ML models \n","33 DL is a subset of ML that uses neural networks... \n","34 Dependence on hardware for functioning \n","35 Dependence on hardware for functioning \n","36 No controlling mechanism \n","37 No controlling mechanism \n","38 Both ANNs and BNNs have simple and predetermin... \n","39 To preprocess the input data before training t... \n","40 Learning rate \n","\n"," D Correct Answer \\\n","0 Making personalized recommendations based on u... B \n","1 Making personalized recommendations based on u... A \n","2 Making personalized recommendations based on u... C \n","3 Transcribing spoken words into text C \n","4 It consists of input layers and artificial neu... A \n","5 To reduce the computational cost of training C \n","6 All of the above D \n","7 The size of the dataset B \n","8 The performance on complex tasks B \n","9 A method of machine learning that only works w... A \n","10 The reliance on labeled datasets for training C \n","11 Data visualization and exploratory data analysis A \n","12 Pre-trained models and transfer learning B \n","13 Supervised, unsupervised, and reinforcement ma... D \n","14 None of the above B \n","15 None of the above A \n","16 Speech recognition C \n","17 None of the above B \n","18 All of the above D \n","19 The study of incorporating human intelligence ... D \n","20 Machine Learning requires human intervention, ... A \n","21 Artificial Intelligence and Deep Learning C \n","22 To analyze data and provide output B \n","23 To analyze data and provide output A \n","24 AI, Machine Learning, and Deep Learning are co... B \n","25 Analyzing sensor data to make decisions about ... C \n","26 Analyzing sensor data to make decisions about ... A \n","27 Recognizing and classifying images and speech C \n","28 Analyzing sensor data to make decisions about ... B \n","29 AI focuses on developing algorithms that enabl... B \n","30 Deployment and maintenance of DL models A \n","31 All of the above D \n","32 Deployment and maintenance of AI models A \n","33 ML is a more advanced form of DL that can perf... C \n","34 High speed of processing A \n","35 The simplest architecture makes it difficult t... D \n","36 Ability to process highly complex parallel inputs D \n","37 Speed of processing is slow D \n","38 ANNs and BNNs have the same processing speed B \n","39 To evaluate the performance of the model on a ... B \n","40 Activation function C \n","\n"," Explanation \n","0 Deep learning algorithms are used in generativ... \n","1 Deep learning algorithms are used in autonomou... \n","2 Deep learning algorithms are used in speech re... \n","3 Deep learning algorithms are used in recommend... \n","4 An artificial neural network consists of input... \n","5 The purpose of adjusting weights in artificial... \n","6 The role of artificial neurons in neural netwo... \n","7 The complexities of neural networks are determ... \n","8 A key difference between machine learning and ... \n","9 Deep learning is a branch of machine learning ... \n","10 The key characteristic of deep learning is the... \n","11 Deep learning has achieved significant success... \n","12 Training deep neural networks typically requir... \n","13 Deep learning can be used for supervised, unsu... \n","14 Convolutional Neural Networks (CNNs) are speci... \n","15 Object detection and recognition is the applic... \n","16 Sentiment analysis is the application of deep ... \n","17 Robotics is the application of deep learning i... \n","18 The main types of neural networks used in deep... \n","19 Artificial Intelligence is the mechanism to in... \n","20 The main difference between Machine Learning a... \n","21 Artificial Intelligence is the broader family ... \n","22 The aim of Machine Learning is to increase acc... \n","23 The aim of Deep Learning is to increase chance... \n","24 AI is the broader concept that encompasses the... \n","25 Speech recognition is an example of AI applica... \n","26 Personalized recommendations, as an AI applica... \n","27 AI-powered predictive maintenance systems anal... \n","28 AI-powered medical diagnosis systems analyze m... \n","29 AI, ML, and DL are related but distinct concep... \n","30 One of the key responsibilities of an AI Engin... \n","31 A Machine Learning Engineer should have a stro... \n","32 One of the key tasks of a Deep Learning Engine... \n","33 Deep Learning (DL) is a subset of Machine Lear... \n","34 One of the advantages of Artificial Neural Net... \n","35 One of the disadvantages of Artificial Neural ... \n","36 One of the advantages of Biological Neural Net... \n","37 One of the disadvantages of Biological Neural ... \n","38 One of the differences between Artificial Neur... \n","39 Hyperparameter tuning is the process of select... \n","40 In neural networks, the learning rate is a hyp... "],"text/html":["\n","
\n","
\n","\n","\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
InstructionQuestionABCDCorrect AnswerExplanation
0Create an MCQ on the use of deep learning in g...What is the role of deep learning in generativ...Analyzing sensor data in autonomous vehiclesCreating new content based on existing dataTranscribing spoken words into textMaking personalized recommendations based on u...BDeep learning algorithms are used in generativ...
1Create an MCQ on the use of deep learning in a...How is deep learning used in autonomous vehicles?Analyzing sensor data in autonomous vehiclesRecognizing objects and scenes in imagesTranscribing spoken words into textMaking personalized recommendations based on u...ADeep learning algorithms are used in autonomou...
2Create an MCQ on the application of deep learn...What is the role of deep learning in speech re...Analyzing sensor data in autonomous vehiclesRecognizing objects and scenes in imagesTranscribing spoken words into textMaking personalized recommendations based on u...CDeep learning algorithms are used in speech re...
3Create an MCQ on the use of deep learning in r...What is the application of deep learning in re...Analyzing sensor data in autonomous vehiclesRecognizing objects and scenes in imagesMaking personalized recommendations based on u...Transcribing spoken words into textCDeep learning algorithms are used in recommend...
4Create an MCQ on the structure of artificial n...What is the structure of an artificial neural ...It consists of input layers, hidden layers, an...It consists of input layers and output layers ...It consists of input layers, hidden layers, ou...It consists of input layers and artificial neu...AAn artificial neural network consists of input...
5Create an MCQ on the purpose of adjusting weig...What is the purpose of adjusting weights in ar...To increase the speed of training modelsTo prevent overfitting by validating the model...To enhance the model's performance on the trai...To reduce the computational cost of trainingCThe purpose of adjusting weights in artificial...
6Create an MCQ on the role of artificial neuron...What is the role of artificial neurons in neur...To receive input from external sourcesTo compute the weighted total of inputsTo transfer information to the next layerAll of the aboveDThe role of artificial neurons in neural netwo...
7Create an MCQ on the complexities of neural ne...What determines the complexities of neural net...The number of layers in the networkThe number of units in each layerThe type of activation function usedThe size of the datasetBThe complexities of neural networks are determ...
8Create an MCQ on the difference between machin...What is a key difference between machine learn...The type of algorithms usedThe amount of data requiredThe complexity of the modelsThe performance on complex tasksBA key difference between machine learning and ...
9Create an MCQ on the definition of deep learningWhat is the definition of deep learning?A branch of machine learning that uses artific...A type of programming that explicitly defines ...A technique that requires manual feature engin...A method of machine learning that only works w...ADeep learning is a branch of machine learning ...
10Create an MCQ on the key characteristic of dee...What is the key characteristic of deep learning?The use of shallow neural networks with a sing...The requirement for manual feature engineeringThe use of deep neural networks with multiple ...The reliance on labeled datasets for trainingCThe key characteristic of deep learning is the...
11Create an MCQ on the applications of deep lear...Which of the following are applications of dee...Image recognition, natural language processing...Data clustering, dimensionality reduction, and...Supervised machine learning and unsupervised m...Data visualization and exploratory data analysisADeep learning has achieved significant success...
12Create an MCQ on the training requirements of ...What are the training requirements for deep ne...Small datasets and limited computational resou...Large amounts of data and computational resourcesManual feature engineering and domain expertisePre-trained models and transfer learningBTraining deep neural networks typically requir...
13Create an MCQ on the types of machine learning...Which types of machine learning tasks can be p...Supervised machine learning onlyUnsupervised machine learning onlyReinforcement machine learning onlySupervised, unsupervised, and reinforcement ma...DDeep learning can be used for supervised, unsu...
14Create an MCQ on the types of neural networks ...Which type of neural network is specifically d...Feedforward Neural Networks (FNNs)Convolutional Neural Networks (CNNs)Recurrent Neural Networks (RNNs)None of the aboveBConvolutional Neural Networks (CNNs) are speci...
15Create an MCQ on the applications of deep lear...Which application of deep learning in computer...Object detection and recognitionImage classificationImage segmentationNone of the aboveAObject detection and recognition is the applic...
16Create an MCQ on the applications of deep lear...Which application of deep learning in NLP invo...Automatic Text GenerationLanguage translationSentiment analysisSpeech recognitionCSentiment analysis is the application of deep ...
17Create an MCQ on the applications of deep lear...Which application of deep learning in reinforc...Game playingRoboticsControl systemsNone of the aboveBRobotics is the application of deep learning i...
18Create an MCQ on the main types of neural netw...Which of the following are the main types of n...Feedforward Neural Networks (FNNs)Convolutional Neural Networks (CNNs)Recurrent Neural Networks (RNNs)All of the aboveDThe main types of neural networks used in deep...
19Create an MCQ on the definition of Artificial ...Which of the following best defines Artificial...The study of training machines to mimic human ...The study of statistical methods enabling mach...The study that uses neural networks to imitate...The study of incorporating human intelligence ...DArtificial Intelligence is the mechanism to in...
20Create an MCQ on the difference between Machin...What is the main difference between Machine Le...Machine Learning uses statistical methods, whi...Machine Learning focuses on learning from expe...Machine Learning is a subset of Deep LearningMachine Learning requires human intervention, ...AThe main difference between Machine Learning a...
21Create an MCQ on the components of Artificial ...Which of the following components are part of ...Machine Learning and Deep LearningMachine Learning and Decision TreesArtificial Intelligence and Machine LearningArtificial Intelligence and Deep LearningCArtificial Intelligence is the broader family ...
22Create an MCQ on the aim of Machine LearningWhat is the aim of Machine Learning?To increase chances of successTo increase accuracyTo improve system efficiencyTo analyze data and provide outputBThe aim of Machine Learning is to increase acc...
23Create an MCQ on the aim of Deep LearningWhat is the aim of Deep Learning?To increase chances of successTo increase accuracyTo improve system efficiencyTo analyze data and provide outputAThe aim of Deep Learning is to increase chance...
24Create an MCQ on the difference between AI, Ma...Which of the following best describes the diff...AI is a subset of Machine Learning, which is a...Machine Learning is a subset of AI, which is a...Deep Learning is a subset of AI, which is a su...AI, Machine Learning, and Deep Learning are co...BAI is the broader concept that encompasses the...
25Create an MCQ on the application of AI in spee...Which of the following is an example of AI app...Analyzing users' browsing and viewing history ...Analyzing medical images to assist doctors in ...Recognizing and classifying images and speechAnalyzing sensor data to make decisions about ...CSpeech recognition is an example of AI applica...
26Create an MCQ on the application of AI in pers...Which of the following is an example of AI app...Analyzing users' browsing and viewing history ...Analyzing medical images to assist doctors in ...Recognizing and classifying images and speechAnalyzing sensor data to make decisions about ...APersonalized recommendations, as an AI applica...
27Create an MCQ on the application of AI in pred...Which of the following is an example of AI app...Analyzing users' browsing and viewing history ...Analyzing medical images to assist doctors in ...Analyzing sensor data to predict equipment fai...Recognizing and classifying images and speechCAI-powered predictive maintenance systems anal...
28Create an MCQ on the application of AI in medi...Which of the following is an example of AI app...Analyzing users' browsing and viewing history ...Analyzing medical images to assist doctors in ...Recognizing and classifying images and speechAnalyzing sensor data to make decisions about ...BAI-powered medical diagnosis systems analyze m...
29Create an MCQ on the difference between AI, ML...Which of the following statements accurately d...AI, ML, and DL are interchangeable terms that ...AI focuses on creating intelligent machines, M...AI is a subset of ML that uses neural networks...AI focuses on developing algorithms that enabl...BAI, ML, and DL are related but distinct concep...
30Create an MCQ on the responsibilities of an AI...Which of the following is a key responsibility...Design and development of AI algorithmsAnalysis and interpretation of dataTraining and evaluation of ML modelsDeployment and maintenance of DL modelsAOne of the key responsibilities of an AI Engin...
31Create an MCQ on the skills required for a Mac...Which of the following skills is essential for...Strong background in computer science, mathema...Experience in developing AI algorithms and sol...Familiarity with programming languages such as...All of the aboveDA Machine Learning Engineer should have a stro...
32Create an MCQ on the tasks of a Deep Learning ...Which of the following is a key task of a Deep...Design and development of DL algorithmsAnalysis and interpretation of dataTraining and evaluation of ML modelsDeployment and maintenance of AI modelsAOne of the key tasks of a Deep Learning Engine...
33Create an MCQ on the difference between ML and DLWhat distinguishes Deep Learning (DL) from Mac...DL is a more advanced form of ML that can perf...DL focuses on developing algorithms that enabl...DL is a subset of ML that uses neural networks...ML is a more advanced form of DL that can perf...CDeep Learning (DL) is a subset of Machine Lear...
34Create an MCQ on the advantages of Artificial ...Which of the following is an advantage of Arti...Ability to learn irrespective of the type of dataSimple architecture that makes it easy to expl...Dependence on hardware for functioningHigh speed of processingAOne of the advantages of Artificial Neural Net...
35Create an MCQ on the disadvantages of Artifici...Which of the following is a disadvantage of Ar...Ability to learn irrespective of the type of dataSimple architecture that makes it easy to expl...Dependence on hardware for functioningThe simplest architecture makes it difficult t...DOne of the disadvantages of Artificial Neural ...
36Create an MCQ on the advantages of Biological ...Which of the following is an advantage of Biol...Ability to learn irrespective of the type of dataSimple architecture that makes it easy to expl...No controlling mechanismAbility to process highly complex parallel inputsDOne of the advantages of Biological Neural Net...
37Create an MCQ on the disadvantages of Biologic...Which of the following is a disadvantage of Bi...Ability to learn irrespective of the type of dataSimple architecture that makes it easy to expl...No controlling mechanismSpeed of processing is slowDOne of the disadvantages of Biological Neural ...
38Create an MCQ on the differences between Artif...Which of the following is a difference between...Both ANNs and BNNs have complex and diverse ne...ANNs have fixed connections between neurons, w...Both ANNs and BNNs have simple and predetermin...ANNs and BNNs have the same processing speedBOne of the differences between Artificial Neur...
39Create an MCQ on hyperparameter tuning in mach...What is the purpose of hyperparameter tuning i...To adjust the weights and biases of the modelTo select the optimal values for the model's h...To preprocess the input data before training t...To evaluate the performance of the model on a ...BHyperparameter tuning is the process of select...
40Create an MCQ on the types of hyperparameters ...Which of the following is a type of hyperparam...WeightsBiasesLearning rateActivation functionCIn neural networks, the learning rate is a hyp...
\n","
\n","
\n","\n","
\n"," \n","\n"," \n","\n"," \n","
\n","\n","\n","
\n"," \n","\n","\n","\n"," \n","
\n","\n","
\n","
\n"],"application/vnd.google.colaboratory.intrinsic+json":{"type":"dataframe","summary":"{\n \"name\": \"dataset['test']\",\n \"rows\": 41,\n \"fields\": [\n {\n \"column\": \"Instruction\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Create an MCQ on the difference between AI, Machine Learning, and Deep Learning\",\n \"Create an MCQ on the types of machine learning tasks that can be performed using deep learning\",\n \"Create an MCQ on the difference between machine learning and deep learning\"\n ],\n \"num_unique_values\": 41,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Question\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Which of the following best describes the difference between AI, Machine Learning, and Deep Learning?\",\n \"Which types of machine learning tasks can be performed using deep learning?\",\n \"What is a key difference between machine learning and deep learning?\"\n ],\n \"num_unique_values\": 41,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"A\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Both ANNs and BNNs have complex and diverse neurons\",\n \"The study of training machines to mimic human behavior\",\n \"Strong background in computer science, mathematics, and statistics\"\n ],\n \"num_unique_values\": 30,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"B\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"ANNs have fixed connections between neurons, while BNNs have flexible connections\",\n \"Robotics\",\n \"Analysis and interpretation of data\"\n ],\n \"num_unique_values\": 30,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"C\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Both ANNs and BNNs have simple and predetermined neural pathways\",\n \"Control systems\",\n \"Training and evaluation of ML models\"\n ],\n \"num_unique_values\": 32,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"D\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"ANNs and BNNs have the same processing speed\",\n \"Machine Learning requires human intervention, while Deep Learning does not\",\n \"ML is a more advanced form of DL that can perform complex tasks.\"\n ],\n \"num_unique_values\": 32,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Correct Answer\",\n \"properties\": {\n \"dtype\": \"category\",\n \"samples\": [\n \"A\",\n \"D\",\n \"B\"\n ],\n \"num_unique_values\": 4,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Explanation\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"AI is the broader concept that encompasses the development of computer systems that can perform tasks requiring human intelligence. Machine Learning is a subset of AI, focused on algorithms that can learn from data and make predictions or decisions. Deep Learning is a subset of Machine Learning, specifically using neural networks with multiple layers to learn and represent complex patterns.\",\n \"Deep learning can be used for supervised, unsupervised, as well as reinforcement machine learning tasks. It provides a versatile approach to process and learn from data in various learning scenarios.\",\n \"A key difference between machine learning and deep learning is the amount of data required. Machine learning can work with a smaller amount of data, while deep learning requires a larger volume of data to train the complex neural network architectures effectively.\"\n ],\n \"num_unique_values\": 41,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n }\n ]\n}"}},"metadata":{},"execution_count":20}]},{"cell_type":"code","source":["# Read as pandas DataFrame\n","dataset['val'].to_pandas()"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":1000},"id":"03oFFWL4hM4S","executionInfo":{"status":"ok","timestamp":1708322802115,"user_tz":-480,"elapsed":16,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"284e74e0-5907-4982-cd82-31dfc59696e3"},"execution_count":null,"outputs":[{"output_type":"execute_result","data":{"text/plain":[" Instruction \\\n","0 Create an MCQ on the impact of learning rate o... \n","1 Create an MCQ on the impact of number of epoch... \n","2 Create an MCQ on the impact of architecture on... \n","3 Create an MCQ on the strategies for hyperparam... \n","4 Create an MCQ on the drawback of GridSearchCV \n","5 Create an MCQ on the strategy that selects val... \n","6 Create an MCQ on the advantage of RandomizedSe... \n","7 Create an MCQ on the strategy that treats hype... \n","8 Create an MCQ on the challenges in deep learning \n","9 Create an MCQ on the advantages of deep learning \n","10 Create an MCQ on the disadvantages of deep lea... \n","11 Create an MCQ on the challenges in interpretin... \n","12 Create an MCQ on the risk of overfitting in de... \n","13 Create an MCQ on the application of machine le... \n","14 Create an MCQ on the use of machine learning i... \n","15 Create an MCQ on the role of machine learning ... \n","16 Create an MCQ on the application of machine le... \n","17 Create an MCQ on the use of machine learning i... \n","18 Create an MCQ on the regularization parameter ... \n","19 Create an MCQ on the kernel function in SVMs \n","20 Create an MCQ on the parameter that controls t... \n","21 Create an MCQ on the learning rate hyperparame... \n","22 Create an MCQ on the max_depth hyperparameter ... \n","23 Create an MCQ on the structure of artificial n... \n","24 Create an MCQ on the training process of artif... \n","25 Create an MCQ on the differences between machi... \n","26 Create an MCQ on the complexity of interpretin... \n","27 Create an MCQ on the computing power requireme... \n","28 Create an MCQ on the definition of deep learning \n","29 Create an MCQ on the key characteristic of dee... \n","30 Create an MCQ on the success of deep learning ... \n","31 Create an MCQ on the requirements for training... \n","32 Create an MCQ on the types of machine learning... \n","33 Create an MCQ on the types of neural networks ... \n","34 Create an MCQ on the applications of deep lear... \n","35 Create an MCQ on the applications of deep lear... \n","36 Create an MCQ on the applications of deep lear... \n","37 Create an MCQ on the main purpose of deep lear... \n","38 Create an MCQ on the definition of Artificial ... \n","39 Create an MCQ on the definition of Machine Lea... \n","40 Create an MCQ on the definition of Deep Learning. \n","41 Create an MCQ on the relationship between Arti... \n","42 Create an MCQ on the aim of Deep Learning. \n","\n"," Question \\\n","0 What impact does the learning rate hyperparame... \n","1 How does the number of epochs hyperparameter a... \n","2 How does the architecture of a neural network ... \n","3 Which of the following strategies is considere... \n","4 What is a drawback of using GridSearchCV for h... \n","5 Which hyperparameter tuning strategy selects v... \n","6 What is an advantage of using RandomizedSearch... \n","7 Which hyperparameter tuning strategy treats th... \n","8 What is one of the challenges in deep learning? \n","9 What is one of the advantages of deep learning? \n","10 What is one of the disadvantages of deep learn... \n","11 What is one of the challenges in interpreting ... \n","12 What is the risk associated with overfitting i... \n","13 In which of the following applications are mac... \n","14 Which of the following applications utilize ma... \n","15 Which of the following applications involve th... \n","16 Which of the following applications utilize ma... \n","17 In which of the following applications are mac... \n","18 What is the role of the regularization paramet... \n","19 What is the purpose of the kernel function in ... \n","20 Which parameter controls the influence of supp... \n","21 What does the learning rate hyperparameter det... \n","22 What does the max_depth hyperparameter determi... \n","23 Which layer of an artificial neural network re... \n","24 What is adjusted during the training process o... \n","25 Which of the following requires a larger volum... \n","26 Which of the following is true regarding the i... \n","27 Which of the following requires a high-perform... \n","28 What is deep learning? \n","29 What is the key characteristic of deep learning? \n","30 In which fields has deep learning achieved sig... \n","31 What are the requirements for training deep ne... \n","32 Which types of machine learning are used in de... \n","33 Which type of neural network is specifically d... \n","34 What is one of the main applications of deep l... \n","35 What is one of the main applications of deep l... \n","36 What is one of the main applications of deep l... \n","37 What is the main purpose of deep learning mode... \n","38 What is the definition of Artificial Intellige... \n","39 What is the definition of Machine Learning? \n","40 What is the definition of Deep Learning? \n","41 What is the relationship between Artificial In... \n","42 What is the aim of Deep Learning? \n","\n"," A \\\n","0 It determines the number of epochs needed for ... \n","1 Increasing the number of epochs always improve... \n","2 The architecture determines the learning rate ... \n","3 GridSearchCV \n","4 It is computationally expensive \n","5 GridSearchCV \n","6 It is computationally faster \n","7 GridSearchCV \n","8 Limited computational resources \n","9 Low accuracy \n","10 Low computational requirements \n","11 Easy interpretability \n","12 Improved performance on new data \n","13 Self-driving cars \n","14 Virtual assistants like Siri and Alexa \n","15 Chatbots \n","16 E-commerce sites \n","17 Social media monitoring \n","18 To control the trade-off between the margin an... \n","19 To control the trade-off between the margin an... \n","20 Regularization parameter (C) \n","21 The step size taken by the optimizer during ea... \n","22 The step size taken by the optimizer during ea... \n","23 Output layer \n","24 Weights \n","25 Machine learning \n","26 Machine learning results are easy to interpret \n","27 Machine learning \n","28 A branch of machine learning that uses artific... \n","29 The use of deep neural networks with multiple ... \n","30 Image recognition, natural language processing... \n","31 A large amount of data and computational resou... \n","32 Supervised, unsupervised, and reinforcement le... \n","33 Feedforward Neural Networks (FNNs) \n","34 Speech recognition \n","35 Object detection and recognition \n","36 Sentiment analysis \n","37 To analyze the sentiment of text \n","38 The study of training machines to mimic human ... \n","39 The study of training machines to mimic human ... \n","40 The study of training machines to mimic human ... \n","41 AI is a subset of ML \n","42 To increase chances of success \n","\n"," B \\\n","0 It controls the step size taken by the optimiz... \n","1 Increasing the number of epochs can lead to ov... \n","2 The architecture controls the step size taken ... \n","3 RandomizedSearchCV \n","4 It requires expert knowledge \n","5 RandomizedSearchCV \n","6 It guarantees optimal performance \n","7 RandomizedSearchCV \n","8 Easy interpretability of results \n","9 Manual feature engineering \n","10 Small amount of labeled data \n","11 Clear decision-making process \n","12 No impact on model performance \n","13 Security systems \n","14 Call centers \n","15 Virtual assistants \n","16 Streaming services \n","17 Sentiment analysis systems \n","18 To define the similarity between data points \n","19 To define the similarity between data points \n","20 Kernel function \n","21 The number of boosting trees to be trained \n","22 The number of boosting trees to be trained \n","23 Hidden layer \n","24 Layers \n","25 Deep learning \n","26 Deep learning results are easy to interpret \n","27 Deep learning \n","28 A technique in machine learning that involves ... \n","29 The use of decision trees for modeling \n","30 Clustering, dimensionality reduction, and anom... \n","31 The availability of cloud computing and specia... \n","32 Supervised and unsupervised learning \n","33 Convolutional Neural Networks (CNNs) \n","34 Sentiment analysis \n","35 Image classification \n","36 Image segmentation \n","37 To translate text from one language to another \n","38 The study of improving machines with experienc... \n","39 The study of improving machines with experienc... \n","40 The study of improving machines with experienc... \n","41 ML is a subset of AI \n","42 To increase accuracy \n","\n"," C \\\n","0 It determines the depth of the neural network \n","1 The number of epochs does not have any impact ... \n","2 The architecture determines the depth and widt... \n","3 Bayesian Optimization \n","4 It may result in overfitting \n","5 Bayesian Optimization \n","6 It requires less expertise \n","7 Bayesian Optimization \n","8 Small amount of training data \n","9 Limited scalability \n","10 Easy interpretability \n","11 Limited complexity \n","12 Poor performance on new data \n","13 Medical imaging \n","14 Speech recognition systems \n","15 NLP systems \n","16 Recommendation systems \n","17 Spam filters \n","18 To determine the influence of support vectors ... \n","19 To determine the influence of support vectors ... \n","20 Gamma \n","21 The maximum depth of each tree in the ensemble \n","22 The maximum depth of each tree in the ensemble \n","23 Input layer \n","24 Neurons \n","25 Both require the same volume of dataset \n","26 Both machine learning and deep learning result... \n","27 Both require the same computing power \n","28 A type of unsupervised machine learning that c... \n","29 The use of reinforcement learning algorithms \n","30 Supervised machine learning tasks like image c... \n","31 Manual feature engineering \n","32 Unsupervised and reinforcement learning \n","33 Recurrent Neural Networks (RNNs) \n","34 Image classification \n","35 Language translation \n","36 Game playing \n","37 To identify and understand visual data \n","38 The study that uses neural networks to imitate... \n","39 The study that uses neural networks to imitate... \n","40 The study that uses neural networks to imitate... \n","41 AI and ML are independent of each other \n","42 To achieve high accuracy with a small amount o... \n","\n"," D Correct Answer \\\n","0 It controls the width of the neural network B \n","1 The number of epochs determines the learning r... B \n","2 The architecture affects the activation functi... C \n","3 None of the above A \n","4 It is not effective for high-dimensional hyper... A \n","5 None of the above B \n","6 It is more effective for high-dimensional hype... A \n","7 None of the above C \n","8 No risk of overfitting A \n","9 Continual improvement D \n","10 No risk of overfitting B \n","11 Black box nature D \n","12 Reduced computational requirements C \n","13 All of the above D \n","14 All of the above D \n","15 All of the above D \n","16 All of the above D \n","17 All of the above D \n","18 To determine the maximum depth of each tree in... A \n","19 To determine the maximum depth of each tree in... B \n","20 Learning rate C \n","21 The minimum sum of instance weight needed in a... A \n","22 The minimum sum of instance weight needed in a... C \n","23 Final layer C \n","24 Connections A \n","25 Neither requires a dataset B \n","26 Deep learning results are difficult to interpret D \n","27 Neither requires computing power B \n","28 A method in machine learning that uses decisio... A \n","29 The use of unsupervised learning techniques A \n","30 Reinforcement learning tasks like robotics and... A \n","31 Supervised labeled datasets A \n","32 Supervised and reinforcement learning A \n","33 Artificial Neural Networks (ANNs) B \n","34 Game playing C \n","35 Game playing C \n","36 Speech recognition C \n","37 To control complex systems C \n","38 The study that focuses on learning, reasoning,... A \n","39 The study that focuses on learning from data a... D \n","40 The study that focuses on using neural network... D \n","41 AI and ML are the same thing B \n","42 To achieve high accuracy with a large amount o... D \n","\n"," Explanation \n","0 The learning rate hyperparameter controls the ... \n","1 Increasing the number of epochs can improve th... \n","2 The architecture of a neural network determine... \n","3 GridSearchCV is considered a 'brute force' app... \n","4 GridSearchCV is computationally expensive as i... \n","5 RandomizedSearchCV selects values at random fo... \n","6 RandomizedSearchCV is computationally faster t... \n","7 Bayesian Optimization treats the search for op... \n","8 One of the challenges in deep learning is the ... \n","9 One of the advantages of deep learning is its ... \n","10 One of the disadvantages of deep learning is t... \n","11 One of the challenges in interpreting deep lea... \n","12 The risk associated with overfitting in deep l... \n","13 Machine learning algorithms are used in image ... \n","14 Machine learning algorithms are used in speech... \n","15 Machine learning algorithms are used in NLP sy... \n","16 Machine learning algorithms are used in recomm... \n","17 Machine learning algorithms are used in sentim... \n","18 The regularization parameter (C) in SVMs contr... \n","19 The kernel function in SVMs defines the simila... \n","20 The parameter Gamma controls the influence of ... \n","21 The learning rate hyperparameter in XGBoost de... \n","22 The max_depth hyperparameter in XGBoost determ... \n","23 The input layer of an artificial neural networ... \n","24 During the training process of an artificial n... \n","25 Deep learning requires a larger volume of data... \n","26 Deep learning results are more complex and dif... \n","27 Deep learning requires a high-performance comp... \n","28 Deep learning is a branch of machine learning ... \n","29 The key characteristic of deep learning is the... \n","30 Deep learning has achieved significant success... \n","31 Training deep neural networks typically requir... \n","32 Deep learning can be used for supervised, unsu... \n","33 Convolutional Neural Networks (CNNs) are speci... \n","34 One of the main applications of deep learning ... \n","35 One of the main applications of deep learning ... \n","36 One of the main applications of deep learning ... \n","37 The main purpose of deep learning models in co... \n","38 Artificial Intelligence is the mechanism to in... \n","39 Machine Learning is the study/process which pr... \n","40 Deep Learning is a sub-part of the broader fam... \n","41 Artificial Intelligence is the broader family ... \n","42 The aim of Deep Learning is to achieve high ac... "],"text/html":["\n","
\n","
\n","\n","\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
InstructionQuestionABCDCorrect AnswerExplanation
0Create an MCQ on the impact of learning rate o...What impact does the learning rate hyperparame...It determines the number of epochs needed for ...It controls the step size taken by the optimiz...It determines the depth of the neural networkIt controls the width of the neural networkBThe learning rate hyperparameter controls the ...
1Create an MCQ on the impact of number of epoch...How does the number of epochs hyperparameter a...Increasing the number of epochs always improve...Increasing the number of epochs can lead to ov...The number of epochs does not have any impact ...The number of epochs determines the learning r...BIncreasing the number of epochs can improve th...
2Create an MCQ on the impact of architecture on...How does the architecture of a neural network ...The architecture determines the learning rate ...The architecture controls the step size taken ...The architecture determines the depth and widt...The architecture affects the activation functi...CThe architecture of a neural network determine...
3Create an MCQ on the strategies for hyperparam...Which of the following strategies is considere...GridSearchCVRandomizedSearchCVBayesian OptimizationNone of the aboveAGridSearchCV is considered a 'brute force' app...
4Create an MCQ on the drawback of GridSearchCVWhat is a drawback of using GridSearchCV for h...It is computationally expensiveIt requires expert knowledgeIt may result in overfittingIt is not effective for high-dimensional hyper...AGridSearchCV is computationally expensive as i...
5Create an MCQ on the strategy that selects val...Which hyperparameter tuning strategy selects v...GridSearchCVRandomizedSearchCVBayesian OptimizationNone of the aboveBRandomizedSearchCV selects values at random fo...
6Create an MCQ on the advantage of RandomizedSe...What is an advantage of using RandomizedSearch...It is computationally fasterIt guarantees optimal performanceIt requires less expertiseIt is more effective for high-dimensional hype...ARandomizedSearchCV is computationally faster t...
7Create an MCQ on the strategy that treats hype...Which hyperparameter tuning strategy treats th...GridSearchCVRandomizedSearchCVBayesian OptimizationNone of the aboveCBayesian Optimization treats the search for op...
8Create an MCQ on the challenges in deep learningWhat is one of the challenges in deep learning?Limited computational resourcesEasy interpretability of resultsSmall amount of training dataNo risk of overfittingAOne of the challenges in deep learning is the ...
9Create an MCQ on the advantages of deep learningWhat is one of the advantages of deep learning?Low accuracyManual feature engineeringLimited scalabilityContinual improvementDOne of the advantages of deep learning is its ...
10Create an MCQ on the disadvantages of deep lea...What is one of the disadvantages of deep learn...Low computational requirementsSmall amount of labeled dataEasy interpretabilityNo risk of overfittingBOne of the disadvantages of deep learning is t...
11Create an MCQ on the challenges in interpretin...What is one of the challenges in interpreting ...Easy interpretabilityClear decision-making processLimited complexityBlack box natureDOne of the challenges in interpreting deep lea...
12Create an MCQ on the risk of overfitting in de...What is the risk associated with overfitting i...Improved performance on new dataNo impact on model performancePoor performance on new dataReduced computational requirementsCThe risk associated with overfitting in deep l...
13Create an MCQ on the application of machine le...In which of the following applications are mac...Self-driving carsSecurity systemsMedical imagingAll of the aboveDMachine learning algorithms are used in image ...
14Create an MCQ on the use of machine learning i...Which of the following applications utilize ma...Virtual assistants like Siri and AlexaCall centersSpeech recognition systemsAll of the aboveDMachine learning algorithms are used in speech...
15Create an MCQ on the role of machine learning ...Which of the following applications involve th...ChatbotsVirtual assistantsNLP systemsAll of the aboveDMachine learning algorithms are used in NLP sy...
16Create an MCQ on the application of machine le...Which of the following applications utilize ma...E-commerce sitesStreaming servicesRecommendation systemsAll of the aboveDMachine learning algorithms are used in recomm...
17Create an MCQ on the use of machine learning i...In which of the following applications are mac...Social media monitoringSentiment analysis systemsSpam filtersAll of the aboveDMachine learning algorithms are used in sentim...
18Create an MCQ on the regularization parameter ...What is the role of the regularization paramet...To control the trade-off between the margin an...To define the similarity between data pointsTo determine the influence of support vectors ...To determine the maximum depth of each tree in...AThe regularization parameter (C) in SVMs contr...
19Create an MCQ on the kernel function in SVMsWhat is the purpose of the kernel function in ...To control the trade-off between the margin an...To define the similarity between data pointsTo determine the influence of support vectors ...To determine the maximum depth of each tree in...BThe kernel function in SVMs defines the simila...
20Create an MCQ on the parameter that controls t...Which parameter controls the influence of supp...Regularization parameter (C)Kernel functionGammaLearning rateCThe parameter Gamma controls the influence of ...
21Create an MCQ on the learning rate hyperparame...What does the learning rate hyperparameter det...The step size taken by the optimizer during ea...The number of boosting trees to be trainedThe maximum depth of each tree in the ensembleThe minimum sum of instance weight needed in a...AThe learning rate hyperparameter in XGBoost de...
22Create an MCQ on the max_depth hyperparameter ...What does the max_depth hyperparameter determi...The step size taken by the optimizer during ea...The number of boosting trees to be trainedThe maximum depth of each tree in the ensembleThe minimum sum of instance weight needed in a...CThe max_depth hyperparameter in XGBoost determ...
23Create an MCQ on the structure of artificial n...Which layer of an artificial neural network re...Output layerHidden layerInput layerFinal layerCThe input layer of an artificial neural networ...
24Create an MCQ on the training process of artif...What is adjusted during the training process o...WeightsLayersNeuronsConnectionsADuring the training process of an artificial n...
25Create an MCQ on the differences between machi...Which of the following requires a larger volum...Machine learningDeep learningBoth require the same volume of datasetNeither requires a datasetBDeep learning requires a larger volume of data...
26Create an MCQ on the complexity of interpretin...Which of the following is true regarding the i...Machine learning results are easy to interpretDeep learning results are easy to interpretBoth machine learning and deep learning result...Deep learning results are difficult to interpretDDeep learning results are more complex and dif...
27Create an MCQ on the computing power requireme...Which of the following requires a high-perform...Machine learningDeep learningBoth require the same computing powerNeither requires computing powerBDeep learning requires a high-performance comp...
28Create an MCQ on the definition of deep learningWhat is deep learning?A branch of machine learning that uses artific...A technique in machine learning that involves ...A type of unsupervised machine learning that c...A method in machine learning that uses decisio...ADeep learning is a branch of machine learning ...
29Create an MCQ on the key characteristic of dee...What is the key characteristic of deep learning?The use of deep neural networks with multiple ...The use of decision trees for modelingThe use of reinforcement learning algorithmsThe use of unsupervised learning techniquesAThe key characteristic of deep learning is the...
30Create an MCQ on the success of deep learning ...In which fields has deep learning achieved sig...Image recognition, natural language processing...Clustering, dimensionality reduction, and anom...Supervised machine learning tasks like image c...Reinforcement learning tasks like robotics and...ADeep learning has achieved significant success...
31Create an MCQ on the requirements for training...What are the requirements for training deep ne...A large amount of data and computational resou...The availability of cloud computing and specia...Manual feature engineeringSupervised labeled datasetsATraining deep neural networks typically requir...
32Create an MCQ on the types of machine learning...Which types of machine learning are used in de...Supervised, unsupervised, and reinforcement le...Supervised and unsupervised learningUnsupervised and reinforcement learningSupervised and reinforcement learningADeep learning can be used for supervised, unsu...
33Create an MCQ on the types of neural networks ...Which type of neural network is specifically d...Feedforward Neural Networks (FNNs)Convolutional Neural Networks (CNNs)Recurrent Neural Networks (RNNs)Artificial Neural Networks (ANNs)BConvolutional Neural Networks (CNNs) are speci...
34Create an MCQ on the applications of deep lear...What is one of the main applications of deep l...Speech recognitionSentiment analysisImage classificationGame playingCOne of the main applications of deep learning ...
35Create an MCQ on the applications of deep lear...What is one of the main applications of deep l...Object detection and recognitionImage classificationLanguage translationGame playingCOne of the main applications of deep learning ...
36Create an MCQ on the applications of deep lear...What is one of the main applications of deep l...Sentiment analysisImage segmentationGame playingSpeech recognitionCOne of the main applications of deep learning ...
37Create an MCQ on the main purpose of deep lear...What is the main purpose of deep learning mode...To analyze the sentiment of textTo translate text from one language to anotherTo identify and understand visual dataTo control complex systemsCThe main purpose of deep learning models in co...
38Create an MCQ on the definition of Artificial ...What is the definition of Artificial Intellige...The study of training machines to mimic human ...The study of improving machines with experienc...The study that uses neural networks to imitate...The study that focuses on learning, reasoning,...AArtificial Intelligence is the mechanism to in...
39Create an MCQ on the definition of Machine Lea...What is the definition of Machine Learning?The study of training machines to mimic human ...The study of improving machines with experienc...The study that uses neural networks to imitate...The study that focuses on learning from data a...DMachine Learning is the study/process which pr...
40Create an MCQ on the definition of Deep Learning.What is the definition of Deep Learning?The study of training machines to mimic human ...The study of improving machines with experienc...The study that uses neural networks to imitate...The study that focuses on using neural network...DDeep Learning is a sub-part of the broader fam...
41Create an MCQ on the relationship between Arti...What is the relationship between Artificial In...AI is a subset of MLML is a subset of AIAI and ML are independent of each otherAI and ML are the same thingBArtificial Intelligence is the broader family ...
42Create an MCQ on the aim of Deep Learning.What is the aim of Deep Learning?To increase chances of successTo increase accuracyTo achieve high accuracy with a small amount o...To achieve high accuracy with a large amount o...DThe aim of Deep Learning is to achieve high ac...
\n","
\n","
\n","\n","
\n"," \n","\n"," \n","\n"," \n","
\n","\n","\n","
\n"," \n","\n","\n","\n"," \n","
\n","\n","
\n","
\n"],"application/vnd.google.colaboratory.intrinsic+json":{"type":"dataframe","summary":"{\n \"name\": \"dataset['val']\",\n \"rows\": 43,\n \"fields\": [\n {\n \"column\": \"Instruction\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Create an MCQ on the main purpose of deep learning models in computer vision.\",\n \"Create an MCQ on the training process of artificial neural networks\",\n \"Create an MCQ on the differences between machine learning and deep learning\"\n ],\n \"num_unique_values\": 43,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Question\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"What is the main purpose of deep learning models in computer vision?\",\n \"What is adjusted during the training process of an artificial neural network?\",\n \"Which of the following requires a larger volume of dataset compared to machine learning?\"\n ],\n \"num_unique_values\": 43,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"A\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"To increase chances of success\",\n \"Chatbots\",\n \"A large amount of data and computational resources\"\n ],\n \"num_unique_values\": 36,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"B\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"To increase accuracy\",\n \"Virtual assistants\",\n \"The availability of cloud computing and specialized hardware\"\n ],\n \"num_unique_values\": 36,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"C\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Gamma\",\n \"NLP systems\",\n \"It may result in overfitting\"\n ],\n \"num_unique_values\": 37,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"D\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"AI and ML are the same thing\",\n \"Connections\",\n \"Speech recognition\"\n ],\n \"num_unique_values\": 33,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Correct Answer\",\n \"properties\": {\n \"dtype\": \"category\",\n \"samples\": [\n \"C\",\n \"D\",\n \"B\"\n ],\n \"num_unique_values\": 4,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Explanation\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"The main purpose of deep learning models in computer vision is to identify and understand visual data. Deep learning models can be used to perform tasks such as object detection and recognition, image classification, and image segmentation.\",\n \"During the training process of an artificial neural network, the weights of the connections between neurons are adjusted to enhance the performance of the model.\",\n \"Deep learning requires a larger volume of dataset compared to machine learning.\"\n ],\n \"num_unique_values\": 43,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n }\n ]\n}"}},"metadata":{},"execution_count":21}]},{"cell_type":"code","source":["from huggingface_hub import notebook_login\n","notebook_login()"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":145,"referenced_widgets":["ac3383a4853c484ca009cddb6e305853","095e9de8c9c24583baa8c7d074c5eacc","7a6eab206dcb4cd3b0cd4681a38f192f","e40bf3020d024dc0bc72a6f23b471494","8fc0cb635ac64733b957061f58ed16b6","9dcbf15879f94eddaf99a1603d1ca4c8","b0cf0967450c415f92e8540b30944ccf","e588245da98b412d849fbe2f94fb4b79","58409bb2b4c845f589fce0e2c2078a8c","17a1cd2fe40e4dde8b6b196e29abbec1","10bef54de0f64e86a3c9b1c039885ee9","40c244e062aa47a7846cace18c952cad","0d5efa7a7e7a4fe2ac2a789fe79ea94a","51e3dfdb61374128a65a208209c93060","283deaf2c9544e80a9433ae9148492d5","b9eb5c03173143aeb2793da889a2428b","8a3773ae30cf482a80edc5af98dc9cd8","0e34328104b64acb9a63e76013ccfc0d","e4fb41fce6b44fffbb1e0a6ab9d477ab","b5b3620259184090b78f008fc8a789db","bdec133061344a838b792668b98a5daa","f26867efbb384d1791ccc7e6ff2f7b2b","e35c557212ea4317ba624707f8d42dde","bfee25db776447eb9126cdbdbdfe6a8c","82764a4acf0e49e58547e9f2e52d5533","ab32080c5ec54c8abb90f0d85855adf5","5eacd5686b6d4486b772e957a80d894d","6c6af529b3dd449cb2826706c220d0cc","079da2bd7cfd4ed3b3324a2e54a8ae87","570cfe79345a43b5a9b34f05e72cc6c9","b21ff34c2fd844f2baee7dd45eda5700","8f7bf4ff652b4ae2b137b0518a4d6a00"]},"id":"U2YZULfoCura","executionInfo":{"status":"ok","timestamp":1708322802639,"user_tz":-480,"elapsed":9,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"31a4d102-5e31-4500-c9bc-34d0274857cf"},"execution_count":null,"outputs":[{"output_type":"display_data","data":{"text/plain":["VBox(children=(HTML(value='
"],"image/png":"\n"},"metadata":{}}]},{"cell_type":"markdown","source":["## 3. Filter out rows with more than 2048 tokens\n","\n","We will remove samples with more than 2048 tokens (max context size of Llama 2 by default = 4096)."],"metadata":{"id":"_RXe958fNLwH"}},{"cell_type":"code","source":["def filter_by_token_count(dataset_split, combined_token_counts, max_tokens=2048):\n"," # Filter out rows with more than 'max_tokens' tokens\n"," filtered_dataset = [example for example, count in zip(dataset_split, combined_token_counts) if count <= max_tokens]\n"," return filtered_dataset\n","\n","\n","# Assuming 'dataset' contains your data splits\n","fig, axs = plt.subplots(3, 5, figsize=(25, 15)) # Adjust figure size as necessary\n","\n","split_names = ['train', 'test', 'val']\n","for row, split_name in enumerate(split_names):\n"," # Tokenize and count\n"," instruction_counts, explanation_counts, question_counts, options_counts, combined_counts = tokenize_and_count(dataset[split_name])\n","\n"," # Filter dataset based on combined token count\n"," filtered_dataset = filter_by_token_count(dataset[split_name], combined_counts)\n","\n"," # Re-tokenize and count for the filtered dataset\n"," filtered_instruction_counts, filtered_explanation_counts, filtered_question_counts, filtered_options_counts, filtered_combined_counts = tokenize_and_count(filtered_dataset)\n","\n"," # Plotting the distributions for the filtered datasets, organizing by row based on the split\n"," plot_distribution(filtered_instruction_counts, f\"{split_name} (filtered): Instruction\", axs[row, 0])\n"," plot_distribution(filtered_explanation_counts, f\"{split_name} (filtered): Explanation\", axs[row, 1])\n"," plot_distribution(filtered_question_counts, f\"{split_name} (filtered): Question\", axs[row, 2])\n"," plot_distribution(filtered_options_counts, f\"{split_name} (filtered): Options\", axs[row, 3])\n"," plot_distribution(filtered_combined_counts, f\"{split_name} (filtered): Combined\", axs[row, 4])\n","\n","# Adjust layout to prevent overlap and ensure clarity\n","plt.tight_layout(pad=3.0)\n","plt.show()\n"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":898},"id":"WYzaKhSvz9Yk","executionInfo":{"status":"ok","timestamp":1708322857920,"user_tz":-480,"elapsed":8192,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"ee8da4e6-3afb-4534-885b-9780571cde25"},"execution_count":null,"outputs":[{"output_type":"display_data","data":{"text/plain":["
"],"image/png":"\n"},"metadata":{}}]},{"cell_type":"code","source":["# Function to filter dataset and plot new distribution\n","def filter_and_plot(dataset_split_name, dataset_split_data, combined_token_counts, axs, position):\n"," # Filter out rows with more than 2048 tokens\n"," valid_indices = [i for i, count in enumerate(combined_token_counts) if count <= 2048]\n"," print(f\"Number of valid rows in {dataset_split_name}: {len(valid_indices)}\")\n"," print(f\"Removing {len(dataset_split_data) - len(valid_indices)} rows from {dataset_split_name}...\")\n","\n"," # Extract valid rows based on indices\n"," valid_dataset = [dataset_split_data[i] for i in valid_indices]\n","\n"," # Re-calculate token counts for the valid dataset if necessary\n"," # This step is assumed necessary only if the token counts need to be recalculated for the filtered dataset\n"," # Otherwise, valid_token_counts = [combined_token_counts[i] for i in valid_indices] would suffice\n"," _, _, _, _, valid_combined_counts = tokenize_and_count(valid_dataset)\n","\n"," # Plot the new distribution for valid rows\n"," plot_distribution(valid_combined_counts, f\"New distribution after filtering {dataset_split_name}\", axs[position])\n","\n","# Create a figure with subplots\n","fig, axs = plt.subplots(3, 1, figsize=(6, 9)) # Adjust figsize as necessary\n","\n","# Assuming the 'dataset' variable is a dictionary containing data splits 'train', 'test', and 'val'\n","for i, split_name in enumerate(['train', 'test', 'val']):\n"," # Tokenize and count for the specific dataset split\n"," _, _, _, _, combined_counts = tokenize_and_count(dataset[split_name])\n","\n"," # Filter datasets based on token count and plot the new distribution\n"," filter_and_plot(split_name, dataset[split_name], combined_counts, axs, i)\n","\n","plt.tight_layout()\n","plt.show()\n"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":1000},"id":"LTw4ITHDl92J","executionInfo":{"status":"ok","timestamp":1708322864043,"user_tz":-480,"elapsed":2649,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"9f5de7d2-6e9a-4e2e-d3ee-73df3a6ad59f"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["Number of valid rows in train: 334\n","Removing 0 rows from train...\n","Number of valid rows in test: 41\n","Removing 0 rows from test...\n","Number of valid rows in val: 43\n","Removing 0 rows from val...\n"]},{"output_type":"display_data","data":{"text/plain":["
"],"image/png":"\n"},"metadata":{}}]},{"cell_type":"code","source":["# Initialize a flag to indicate whether any entries were removed in any split\n","entries_removed = False\n","\n","# Iterate over each split in the dataset\n","for split_name in ['train', 'test', 'val']:\n"," # Get the original length of the split\n"," original_length = len(dataset[split_name])\n"," # Tokenize and count tokens in the split\n"," _, _, _, _, combined_counts = tokenize_and_count(dataset[split_name])\n"," # Determine valid indices (entries with <= 2048 tokens)\n"," valid_indices = [i for i, count in enumerate(combined_counts) if count <= 2048]\n"," # Check if any entries were removed\n"," if len(valid_indices) < original_length:\n"," entries_removed = True\n"," # Update the dataset split with filtered entries\n"," dataset[split_name] = dataset[split_name].select(valid_indices)\n","\n","# Flag to control execution of subsequent code\n","continue_execution = True\n","\n","if not entries_removed:\n"," print(\"No entries removed due to token count. Skipping saving.\")\n"," continue_execution = False\n","\n","# Proceed with further steps only if entries were removed\n","if continue_execution:\n"," # Save the filtered dataset to disk\n"," dataset.save_to_disk('new_mcq_data')\n"," print(\"Filtered dataset saved successfully.\")"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"2N8lcoTS5jkm","executionInfo":{"status":"ok","timestamp":1708322873795,"user_tz":-480,"elapsed":922,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"d1028a6f-0388-47ea-8486-e715bcbc52fd"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["No entries removed due to token count. Skipping saving.\n"]}]},{"cell_type":"markdown","source":["---\n","\n","## 4. Near-deduplication Using Embeddings\n","\n","* Near-deduplication with embeddings is a technique that employs vector representations to effectively identify and manage nearly identical data entries.\n","\n","* By transforming data into these vectors (embeddings), we can quantitatively measure how similar different pieces of data are. This transformation significantly improves our ability to manage large datasets, where sorting through and removing near-duplicates manually would be impractical.\n","\n","* Widely used in fields like database management, information retrieval, and machine learning, this approach is crucial for efficient data handling and analysis.\n","\n","---\n","\n","### We Will Not Perform Deduplication on Our MCQ Dataset.\n","\n","* **Intentional Repetition for Emphasis**: In educational contexts, certain concepts may be intentionally repeated to underscore their significance. Deduplication could diminish the dataset's educational effectiveness by removing these purposeful repetitions.\n","\n","* **Variations of Similar Questions**: MCQ datasets often feature questions that, while seemingly similar, include minor variations in wording, options, or context. Inadequately designed deduplication algorithms risk eliminating these nuances, thereby losing valuable elements of the dataset.\n","\n","* **Difficulty in Defining \"Duplicates\"**: Identifying duplicates within MCQs poses a significant challenge, as questions that appear identical might differ in subtle yet crucial ways. These distinctions often represent unique learning opportunities that would be lost through deduplication.\n","\n","---\n"],"metadata":{"id":"kXzTKu99w3g4"}},{"cell_type":"markdown","source":["## 5. Top-k sampling\n","\n","Only keep the top k samples with the most tokens.\n","\n","---\n","\n","### Decision on \"Top-k Sampling\" for Our MCQ Dataset\n","\n","\n","We have decided against employing \"Top-k sampling\" to select only the top k samples with the most tokens in our MCQ dataset. This approach does not align with the core objectives of MCQ dataset development for several critical reasons:\n","\n","\n","**Practical Considerations**\n","\n","* **Conciseness and Effectiveness**: The hallmark of high-quality MCQs lies in their conciseness and meaningfulness. Favoring question length over substance could detract from the dataset's quality, as longer questions do not necessarily equate to higher educational value. Succinct yet profound questions are typically the most beneficial and stimulating for learners.\n","\n","\n","* Given these considerations, we conclude that \"Top-k sampling,\" which prioritizes token count, falls short of fulfilling the requirements of our MCQ dataset. The true merit of a valuable MCQ dataset resides in its diverse and balanced assortment of topics and difficulty levels, not merely in question length. This philosophy ensures our dataset remains versatile and effective across various educational and machine learning applications.\n","---"],"metadata":{"id":"TOCspcgXNOav"}},{"cell_type":"code","source":["# @title\n","# Push to Hugging Face Hub\n","dataset.push_to_hub(\"ssoh/mcq_dataset_2\")"],"metadata":{"id":"pj1b5S_68KB0","colab":{"base_uri":"https://localhost:8080/","height":244,"referenced_widgets":["d245dbe5cf8946e28f08287c15926a6d","45f5b19c94a544d79b3d65e13e67a0f1","ecb708463202466b96b244822fddd7c3","9cc0c38f4fb4452e8a14f1b208aaaefd","9ec8035efb334973bf54923d9b25b491","60fc22bbcb204b008cf416f44835ab84","686e9cf6109241b28c44aba78667aaa9","bf5c401f943e4040893e01cade21b632","5bb2a8fc3af2421c9b1f92039ea61ba2","5050f71288f2478c8d5bdeff47fcdca5","551a53a84dd8459b8cdd3e3e09ae687c","5af01ff36b2d4edebc842f0aea68bbdd","243e232f096047ca84cafe43fdd02abf","aee15bf2685349e684a7c1e0847fe830","324b3953949e4416b3614572969d1124","20471d4effb54abfabce62c072f90ebe","34914f01bdad45aca133fda48cb7b77d","015ca0b73d984087b070630305d794af","df0642f7d06d4a1eaf10ddf27460f3f1","9398e6c79bd947ae8e7090363bd669a4","814c3545d598458182c9d638cd7d78c0","46cbf4315b0e4a00b41157bc4d03f0fd","00efd4b16cd24e7c92d9dd2daa03b10f","9e4171a5c49a494a9ae72aa72a67862c","7569df6a04a14194866afbb4e78f6468","39ff79fe9a214782894e983ce4d45acd","91b5f1a2be93471eb880235771722cd9","57c0c6ab34974fc89e2036bd1074a098","bb364cb3c7134948b24ba1b3a0c2186d","4108281cd53941288d99b60e994770f1","1e11face32ee40f890cb97751209d6ac","45a866686d954d33ba1b3034397661ea","f5014c5a31d342e7952bcebf024250a2","8288c75935d74e85bd62e1682f8f04fe","1981153d1d2943b1936946a362048f3f","d5aec20dc1034e57980a8e9121fdd8e0","9795e18cccce455485accc5309d8d9de","f0bea944fba44d508575eb73b8684596","f9b825f130304fccadb8d660daa0b3ad","4d18d9a7763749faacffaf660cb6c100","4ba50000d3b14f74a288aa8249723fd6","152308af6482481aa88914fb1f37eb13","2e3600f228c446349e2fe7126fb95255","57849a6643f64b58802bb57ab8e18412","426bad77baaa4db49164e3cf7b3505b9","2f6008ded0274199ba675ec6f0a7469d","7c290b50d93a4f0cb209a5e444934596","bc438acf4f2047e6b7e49829c8b078ee","9862cd6a17af4bfd80a02f9b68b80629","c949f9dd143c4ccda644e3a601a46302","b603160aed09476c937847a2c1fa6236","e3e9b4527bd84cbe9f83479d090416c0","3db20951b33c4ce3b14acd7fe1a438ce","96497d48b0bd436b964826dcbaa92a76","2fb73ee317e7482e94e7c696c6a0a2de","48cd0eb61edc469482a93d6a5cf8dabd","1ee577704a9048e39d73ad278b587215","5cd9bf0a588947a8895e187842dfca0c","cfcd77f72de043b3ad28dbaa216909d3","621a509dcda84c90b6f1e76b39b9b97a","e7ea517d5821476fa388b2d67c098675","14348abed9184177a87d6a8c0f558097","dbc3be056d4e4e7d87beb01252eecb37","0acc8312636b4361b915e9bd39a02057","f0f3a78cb34f41caba416d266c270952","f118a0a1a7b444508e6c02aa0acc83d7"]},"executionInfo":{"status":"ok","timestamp":1708322881797,"user_tz":-480,"elapsed":3008,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"266d2243-1d5a-41a4-b0e2-25a573b7f602"},"execution_count":null,"outputs":[{"output_type":"display_data","data":{"text/plain":["Uploading the dataset shards: 0%| | 0/1 [00:00 "]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_142852-r9yt9wy0"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run vermilion-springroll-11 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/r9yt9wy0"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"4e64b2267c9c4213a5914056f6811d1b","version_major":2,"version_minor":0},"text/plain":["Map: 0%| | 0/334 [00:00\n"," \n"," \n"," [66/66 10:49, Epoch 6/6]\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
103.0054002.068154
201.4362001.290664
300.7781001.072349
400.6132000.931616
500.4969000.921341
600.4680000.888872

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:04]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.006 MB uploaded\\r'), FloatProgress(value=1.0, max=1.0)))"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▃▂▁▁▁▁
eval/runtime▁▅▇▇▇█▇
eval/samples_per_second█▄▂▂▂▁▂
eval/steps_per_second█▄▂▂▁▁▂
eval_loss
train/epoch▁▁▂▂▄▄▅▅▆▆▇▇██
train/global_step▁▁▂▂▃▃▅▅▆▆▇▇███
train/learning_rate█▇▅▄▂▁
train/loss█▄▂▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.88887
eval/runtime5.8859
eval/samples_per_second7.306
eval/steps_per_second1.019
eval_loss0.88887
train/epoch6.0
train/global_step66
train/learning_rate2e-05
train/loss0.468
train/total_flos2090258212601856.0
train/train_loss1.06766
train/train_runtime658.5513
train/train_samples_per_second3.043
train/train_steps_per_second0.1

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run vermilion-springroll-11 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/r9yt9wy0
Synced 6 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_142852-r9yt9wy0/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 14:40:09,515] Trial 0 finished with value: 0.8888720870018005 and parameters: {'learning_rate': 0.00022063199006940203, 'num_train_epochs': 6, 'per_device_train_batch_size': 32, 'warmup_steps': 3}. Best is trial 0 with value: 0.8888720870018005.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"3de6384b3841485d8068f7265fbda30e","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113223888807826, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_144009-p10q3kv9"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run lunar-ox-12 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/p10q3kv9"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"0a6c66cadea549f78fe5a183e841734e","version_major":2,"version_minor":0},"text/plain":["Map: 0%| | 0/334 [00:00\n"," \n"," \n"," [66/66 10:55, Epoch 6/6]\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.8819001.890017
201.0036001.057150
300.6157000.908779
400.5004000.848109
500.4300000.856009
600.4028000.839791

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.023 MB uploaded\\r'), FloatProgress(value=0.2557933392427504, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▂▁▁▁▁▁
eval/runtime▁▄▆█▆▆▅
eval/samples_per_second█▅▃▁▃▃▃
eval/steps_per_second█▅▃▁▄▃▄
eval_loss
train/epoch▁▁▂▂▄▄▅▅▆▆▇▇██
train/global_step▁▁▂▂▃▃▅▅▆▆▇▇███
train/learning_rate█▇▅▄▂▁
train/loss█▃▂▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.83979
eval/runtime5.9095
eval/samples_per_second7.276
eval/steps_per_second1.015
eval_loss0.83979
train/epoch6.0
train/global_step66
train/learning_rate4e-05
train/loss0.4028
train/total_flos2090258212601856.0
train/train_loss0.91779
train/train_runtime664.2846
train/train_samples_per_second3.017
train/train_steps_per_second0.099

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run lunar-ox-12 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/p10q3kv9
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_144009-p10q3kv9/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 14:51:33,062] Trial 1 finished with value: 0.8397907614707947 and parameters: {'learning_rate': 0.000388078354781562, 'num_train_epochs': 6, 'per_device_train_batch_size': 32, 'warmup_steps': 5}. Best is trial 1 with value: 0.8397907614707947.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"9a11d1cc83a44c18a071c37ee29cb191","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113095899862755, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_145133-zcwhia3h"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run lunar-envelope-13 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/zcwhia3h"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [120/168 11:45 < 04:46, 0.17 it/s, Epoch 5/8]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.8054001.864337
201.0785001.049373
300.6376000.906046
400.5493000.883616
500.4913000.882419
600.4567000.849847
700.4312000.859990
800.4107000.856455
900.3987000.832274
1000.3759000.863837
1100.3876000.843072
1200.3749000.846136

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.17044687077892842, max=1.…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▂▂▁▁▁▁▁▁▁▁▁▁
eval/runtime▁▃▄▅▃▄▃▃▅█▅▅▄
eval/samples_per_second█▆▅▄▆▅▆▆▄▁▄▄▅
eval/steps_per_second█▆▅▄▅▅▆▆▄▁▄▄▄
eval_loss
train/epoch▁▁▂▂▂▂▃▃▄▄▄▄▅▅▅▅▆▆▇▇▇▇████
train/global_step▁▁▂▂▂▂▃▃▄▄▄▄▅▅▅▅▆▆▇▇▇▇█████
train/learning_rate█▇▇▆▅▅▄▄▃▂▂▁
train/loss█▃▂▂▁▁▁▁▁▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.83227
eval/runtime5.8969
eval/samples_per_second7.292
eval/steps_per_second1.017
eval_loss0.83227
train/epoch5.71
train/global_step120
train/learning_rate0.00011
train/loss0.3749
train/total_flos1821144316674048.0
train/train_loss0.69982
train/train_runtime710.1571
train/train_samples_per_second3.763
train/train_steps_per_second0.237

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run lunar-envelope-13 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/zcwhia3h
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_145133-zcwhia3h/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 15:03:40,491] Trial 2 finished with value: 0.8322736024856567 and parameters: {'learning_rate': 0.00038013816677024434, 'num_train_epochs': 8, 'per_device_train_batch_size': 16, 'warmup_steps': 4}. Best is trial 2 with value: 0.8322736024856567.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"192adca007164c1188758c608dfd03f7","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.01111326985539765, max=1.0)…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_150340-0rux0mbt"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run dazzling-orchid-14 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/0rux0mbt"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [84/84 08:09, Epoch 4/4]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
103.1171002.101251
201.5435001.223637
300.7658001.053006
400.6335000.956405
500.5513000.914150
600.4887000.883170
700.4532000.863325
800.4317000.861434

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.022 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.6507556781402156, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▃▂▂▁▁▁▁▁
eval/runtime▁▄▃▅█▆▅▇▅
eval/samples_per_second█▅▆▄▁▃▄▂▄
eval/steps_per_second█▅▆▅▁▃▅▂▄
eval_loss
train/epoch▁▁▂▂▃▃▄▄▅▅▆▆▇▇████
train/global_step▁▁▂▂▃▃▄▄▅▅▆▆▇▇█████
train/learning_rate█▇▆▅▄▃▂▁
train/loss█▄▂▂▁▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.86143
eval/runtime5.9003
eval/samples_per_second7.288
eval/steps_per_second1.017
eval_loss0.86143
train/epoch4.0
train/global_step84
train/learning_rate1e-05
train/loss0.4317
train/total_flos1274022822739968.0
train/train_loss0.96978
train/train_runtime494.7488
train/train_samples_per_second2.7
train/train_steps_per_second0.17

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run dazzling-orchid-14 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/0rux0mbt
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_150340-0rux0mbt/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 15:12:11,748] Trial 3 finished with value: 0.8614340424537659 and parameters: {'learning_rate': 0.00023956952379873406, 'num_train_epochs': 4, 'per_device_train_batch_size': 16, 'warmup_steps': 5}. Best is trial 2 with value: 0.8322736024856567.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"f7b8197a4e4c4242a7965774efc249e1","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113144411097488, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_151211-6nazgql4"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run prosperous-dragon-15 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/6nazgql4"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [66/66 10:56, Epoch 6/6]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.6801001.782772
200.8941001.025657
300.5874000.897337
400.4763000.830805
500.4204000.864126
600.3949000.841267

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.1705031517334534, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▂▁▁▁▁▁
eval/runtime▁▇▆▆█▇▅
eval/samples_per_second█▂▄▄▁▂▄
eval/steps_per_second█▁▃▃▁▁▃
eval_loss
train/epoch▁▁▂▂▄▄▅▅▆▆▇▇██
train/global_step▁▁▂▂▃▃▅▅▆▆▇▇███
train/learning_rate█▇▅▄▂▁
train/loss█▃▂▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.83081
eval/runtime5.9104
eval/samples_per_second7.275
eval/steps_per_second1.015
eval_loss0.83081
train/epoch6.0
train/global_step66
train/learning_rate4e-05
train/loss0.3949
train/total_flos2090258212601856.0
train/train_loss0.85955
train/train_runtime665.5241
train/train_samples_per_second3.011
train/train_steps_per_second0.099

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run prosperous-dragon-15 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/6nazgql4
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_151211-6nazgql4/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 15:23:34,430] Trial 4 finished with value: 0.8308054208755493 and parameters: {'learning_rate': 0.00041915607985727055, 'num_train_epochs': 6, 'per_device_train_batch_size': 32, 'warmup_steps': 3}. Best is trial 4 with value: 0.8308054208755493.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"d8dc9ac41d544e3fb4094e8c4b80cecc","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113028755709011, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_152334-gqh7mnqx"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run crimson-dragon-16 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/gqh7mnqx"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [44/44 07:13, Epoch 4/4]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.6064001.642761
200.8001001.017194
300.5857000.903642
400.4734000.855594

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.17057383277516674, max=1.…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▂▁▁▁
eval/runtime▁▅▅█▁
eval/samples_per_second█▄▅▁█
eval/steps_per_second█▅▅▁█
eval_loss
train/epoch▁▁▃▃▅▅▇▇██
train/global_step▁▁▃▃▅▅▇▇███
train/learning_rate█▆▃▁
train/loss█▂▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.85559
eval/runtime5.9033
eval/samples_per_second7.284
eval/steps_per_second1.016
eval_loss0.85559
train/epoch4.0
train/global_step44
train/learning_rate5e-05
train/loss0.4734
train/total_flos1395517145776128.0
train/train_loss1.0546
train/train_runtime442.5635
train/train_samples_per_second3.019
train/train_steps_per_second0.099

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run crimson-dragon-16 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/gqh7mnqx
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_152334-gqh7mnqx/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 15:31:13,768] Trial 5 finished with value: 0.855594277381897 and parameters: {'learning_rate': 0.0004882684074952214, 'num_train_epochs': 4, 'per_device_train_batch_size': 32, 'warmup_steps': 3}. Best is trial 4 with value: 0.8308054208755493.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"b17199cf7d1c41ee969d27814d9efe8b","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113176900026802, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_153113-emmid59b"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run beaming-paper-17 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/emmid59b"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [66/66 10:55, Epoch 6/6]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.8145001.876597
201.0222001.064405
300.6246000.902199
400.5075000.854691
500.4362000.856049
600.4078000.836567

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.17026547676022635, max=1.…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▃▁▁▁▁▁
eval/runtime▁▁▄▃█▅▃
eval/samples_per_second██▅▅▁▄▆
eval/steps_per_second██▄▅▁▄▅
eval_loss
train/epoch▁▁▂▂▄▄▅▅▆▆▇▇██
train/global_step▁▁▂▂▃▃▅▅▆▆▇▇███
train/learning_rate█▇▅▄▂▁
train/loss█▃▂▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.83657
eval/runtime5.9028
eval/samples_per_second7.285
eval/steps_per_second1.016
eval_loss0.83657
train/epoch6.0
train/global_step66
train/learning_rate4e-05
train/loss0.4078
train/total_flos2090258212601856.0
train/train_loss0.91464
train/train_runtime664.3977
train/train_samples_per_second3.016
train/train_steps_per_second0.099

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run beaming-paper-17 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/emmid59b
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_153113-emmid59b/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 15:42:35,259] Trial 6 finished with value: 0.8365665674209595 and parameters: {'learning_rate': 0.000371977101120841, 'num_train_epochs': 6, 'per_device_train_batch_size': 32, 'warmup_steps': 4}. Best is trial 4 with value: 0.8308054208755493.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"3718f807c93646f8ba07bbc2d3594547","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113231277947003, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_154235-x78afq0n"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run glittering-monkey-18 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/x78afq0n"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [44/44 07:14, Epoch 4/4]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
103.1951002.187073
201.5745001.620554
300.9015001.120180
400.6909001.036435

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.17055463319920083, max=1.…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▅▂▁▁
eval/runtime▁▆▄█▅
eval/samples_per_second█▃▅▁▄
eval/steps_per_second█▃▅▁▄
eval_loss
train/epoch▁▁▃▃▅▅▇▇██
train/global_step▁▁▃▃▅▅▇▇███
train/learning_rate█▆▃▁
train/loss█▃▂▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss1.03644
eval/runtime5.9099
eval/samples_per_second7.276
eval/steps_per_second1.015
eval_loss1.03644
train/epoch4.0
train/global_step44
train/learning_rate2e-05
train/loss0.6909
train/total_flos1395517145776128.0
train/train_loss1.50181
train/train_runtime442.6998
train/train_samples_per_second3.018
train/train_steps_per_second0.099

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run glittering-monkey-18 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/x78afq0n
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_154235-x78afq0n/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 15:50:14,768] Trial 7 finished with value: 1.0364350080490112 and parameters: {'learning_rate': 0.00021352963324526537, 'num_train_epochs': 4, 'per_device_train_batch_size': 32, 'warmup_steps': 5}. Best is trial 4 with value: 0.8308054208755493.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"d31e8ddbc07347adb1c8a3798ee36db8","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113219644499218, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_155014-8e8ip35f"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run beaming-fuse-19 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/8e8ip35f"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [120/126 11:45 < 00:35, 0.17 it/s, Epoch 5/6]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.7196001.833074
201.0752001.046847
300.6402000.922950
400.5504000.871228
500.4986000.876081
600.4539000.857398
700.4283000.860341
800.4096000.856458
900.3941000.830379
1000.3715000.847909
1100.3818000.842489
1200.3595000.851102

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:04]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.012 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.35110723430597374, max=1.…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▃▂▁▁▁▁▁▁▁▁▁▁
eval/runtime▁▂▂▃▄▆█▄█▇▅█▁
eval/samples_per_second█▇▇▆▅▃▁▅▁▂▄▁█
eval/steps_per_second▇▇▇▆▅▃▁▅▁▁▅▁█
eval_loss
train/epoch▁▁▂▂▂▂▃▃▄▄▄▄▅▅▅▅▆▆▇▇▇▇████
train/global_step▁▁▂▂▂▂▃▃▄▄▄▄▅▅▅▅▆▆▇▇▇▇█████
train/learning_rate█▇▇▆▅▅▄▄▃▂▂▁
train/loss█▃▂▂▁▁▁▁▁▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.83038
eval/runtime5.8734
eval/samples_per_second7.321
eval/steps_per_second1.022
eval_loss0.83038
train/epoch5.71
train/global_step120
train/learning_rate2e-05
train/loss0.3595
train/total_flos1821144316674048.0
train/train_loss0.69023
train/train_runtime710.9352
train/train_samples_per_second2.819
train/train_steps_per_second0.177

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run beaming-fuse-19 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/8e8ip35f
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_155014-8e8ip35f/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 16:02:22,467] Trial 8 finished with value: 0.8303791284561157 and parameters: {'learning_rate': 0.0003782307395143863, 'num_train_epochs': 6, 'per_device_train_batch_size': 16, 'warmup_steps': 3}. Best is trial 8 with value: 0.8303791284561157.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"d8cdb5224da6481691ab8959404e3d24","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113188377607407, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_160222-bob3m06p"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run dazzling-ox-20 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/bob3m06p"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [140/168 13:43 < 02:47, 0.17 it/s, Epoch 6/8]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.7859001.793209
201.0033001.026370
300.6111000.900247
400.5459000.883518
500.4864000.883806
600.4442000.881959
700.4328000.877395
800.4108000.867789
900.4000000.852276
1000.3731000.867085
1100.3862000.850435
1200.3738000.855305
1300.3920000.853369
1400.3604000.869668

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.1705731731337404, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▂▁▁▁▁▁▁▁▁▁▁▁▁▁
eval/runtime▁▄▅▆▆▆▆▇▇█▆▆▆▇▆
eval/samples_per_second█▅▄▃▃▃▃▂▁▁▃▃▃▂▃
eval/steps_per_second█▅▄▂▃▄▃▂▂▁▂▃▄▂▃
eval_loss
train/epoch▁▁▂▂▂▂▃▃▃▃▄▄▄▄▅▅▅▅▆▆▆▆▇▇▇▇████
train/global_step▁▁▂▂▂▂▃▃▃▃▄▄▄▄▅▅▅▅▆▆▆▆▇▇▇▇█████
train/learning_rate█▇▇▆▆▅▅▄▄▃▃▂▂▁
train/loss█▃▂▂▁▁▁▁▁▁▁▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.85044
eval/runtime5.8867
eval/samples_per_second7.305
eval/steps_per_second1.019
eval_loss0.85044
train/epoch6.67
train/global_step140
train/learning_rate8e-05
train/loss0.3604
train/total_flos2129326975303680.0
train/train_loss0.64328
train/train_runtime828.8076
train/train_samples_per_second3.224
train/train_steps_per_second0.203

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run dazzling-ox-20 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/bob3m06p
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_160222-bob3m06p/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 16:16:28,206] Trial 9 finished with value: 0.8504351377487183 and parameters: {'learning_rate': 0.000457264058410859, 'num_train_epochs': 8, 'per_device_train_batch_size': 16, 'warmup_steps': 5}. Best is trial 8 with value: 0.8303791284561157.\n"]}],"source":["def objective(trial):\n","\n"," # Define hyperparameters outside the wandb.init to use them later in the code\n"," learning_rate = trial.suggest_float('learning_rate', 2e-4, 5e-4, log=True)\n"," num_train_epochs = trial.suggest_categorical('num_train_epochs', [4, 6, 8])\n"," per_device_train_batch_size = trial.suggest_categorical('per_device_train_batch_size', [16, 32])\n"," warmup_steps = trial.suggest_int('warmup_steps', 3, 5)\n","\n"," wandb.init(\n"," project=\"Education-Chatbot-Optimization\",\n"," entity=\"szehanz\",\n"," group=\"optuna-optimization\",\n"," job_type=\"hyperparameter_search\",\n"," reinit=True,\n"," config={\n"," \"learning_rate\": learning_rate,\n"," \"num_train_epochs\": num_train_epochs,\n"," \"per_device_train_batch_size\": per_device_train_batch_size,\n"," \"warmup_steps\": warmup_steps\n"," }\n"," )\n","\n"," # Format the current date and time\n"," current_time = datetime.now().strftime(\"%Y%m%d-%H%M%S\")\n"," output_dir = f\"train_out_dir_{current_time}\" # Append the current date and time to the directory name\n","\n"," # Create the output directory\n"," os.makedirs(output_dir, exist_ok=True) # Using exist_ok=True to avoid error if the directory already exists\n","\n","\n"," # Define TrainingArguments with the suggested hyperparameters\n"," training_args = TrainingArguments(\n"," output_dir=output_dir, # Directory for saving output models and checkpoints.\n"," save_strategy=\"steps\", # Save model checkpoints at regular step intervals.\n"," save_steps=10, # Save model checkpoints every 10 steps.\n"," learning_rate=learning_rate, # Initial learning rate for the optimizer.\n"," per_device_train_batch_size=per_device_train_batch_size, # Batch size per device during training.\n"," per_device_eval_batch_size=8, # Batch size per device during evaluation.\n"," num_train_epochs=num_train_epochs, # Total number of training epochs.\n"," warmup_steps=warmup_steps, # Number of warmup steps for the learning rate scheduler.\n"," evaluation_strategy='steps', # Perform evaluation at regular step intervals.\n"," eval_steps=10, # Perform evaluation every 10 steps.\n"," logging_steps=10,\n"," optim='paged_adamw_8bit', # Specifies the optimizer to use.\n"," lr_scheduler_type='linear', # Type of learning rate scheduler.\n"," gradient_accumulation_steps=1, # Number of steps to accumulate gradients before performing an update.\n"," load_best_model_at_end=True, # Load the best model based on evaluation metric at the end of training.\n"," report_to='wandb', # Disable automatic integrations with external reporting tools.\n"," )\n","\n","\n"," # Initialize the Trainer with early stopping callback inside the objective function\n"," trainer = SFTTrainer(\n"," model=model, # Ensure a function or a mechanism to initialize your model\n"," train_dataset=train_dataset,\n"," eval_dataset=val_dataset,\n"," peft_config=peft_config,\n"," dataset_text_field=\"Instruction\",\n"," tokenizer=tokenizer,\n"," args=training_args,\n"," max_seq_length=4096,\n"," callbacks=[EarlyStoppingCallback(early_stopping_patience=3)],\n"," )\n","\n"," # Train the model and evaluate within the objective function\n"," trainer.train()\n"," eval_result = trainer.evaluate()\n","\n"," # Log the primary metric to WandB\n"," wandb.log({\"eval_loss\": eval_result[\"eval_loss\"]})\n","\n"," # Finish the WandB run for this trial\n"," wandb.finish()\n","\n"," # Return the metric to be optimized\n"," return eval_result[\"eval_loss\"]\n","\n","\n","# Run the optimization\n","study = optuna.create_study(direction='minimize')\n","study.optimize(objective, n_trials=10)"]},{"cell_type":"code","execution_count":null,"metadata":{"id":"fmdlQTVSHT8e","outputId":"a2935a56-5cad-4dbc-c55c-53b3b5ad1368"},"outputs":[{"name":"stdout","output_type":"stream","text":["Best trial:\n"," Value: 0.8303791284561157\n"," Params: \n"," learning_rate: 0.0003782307395143863\n"," num_train_epochs: 6\n"," per_device_train_batch_size: 16\n"," warmup_steps: 3\n"]}],"source":["# Best trial results\n","print(\"Best trial:\")\n","print(f\" Value: {study.best_trial.value}\")\n","print(\" Params: \")\n","for key, value in study.best_trial.params.items():\n"," print(f\" {key}: {value}\")"]},{"cell_type":"code","execution_count":null,"metadata":{"id":"mKlA_ahVHT8e","outputId":"6365a674-b011-48bb-94ea-7aa9d657d323","colab":{"referenced_widgets":[""]}},"outputs":[{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_161628-5gyifk7s"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run floating-fish-2 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/huggingface"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/huggingface/runs/5gyifk7s"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [126/126 12:16, Epoch 6/6]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.7158001.857712
201.0773001.051454
300.6475000.913019
400.5472000.881412
500.4899000.886365
600.4574000.855178
700.4284000.860198
800.4072000.863780
900.3950000.834071
1000.3723000.848378
1100.3795000.848452
1200.3588000.857301

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.17130191715842674, max=1.…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▂▂▁▁▁▁▁▁▁▁▁
eval/runtime▁▄▄▆▆▆▅█▇▇▆█
eval/samples_per_second█▅▅▃▃▃▄▁▁▂▃▁
eval/steps_per_second█▅▅▃▃▃▄▂▂▂▃▁
train/epoch▁▁▂▂▂▂▃▃▃▃▄▄▅▅▅▅▆▆▆▆▇▇███
train/global_step▁▁▂▂▂▂▃▃▃▃▄▄▅▅▅▅▆▆▆▆▇▇███
train/learning_rate█▇▇▆▅▅▄▄▃▂▂▁
train/loss█▃▂▂▁▁▁▁▁▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.8573
eval/runtime5.9051
eval/samples_per_second7.282
eval/steps_per_second1.016
train/epoch6.0
train/global_step126
train/learning_rate2e-05
train/loss0.3588
train/total_flos1913972332118016.0
train/train_loss0.67577
train/train_runtime746.485
train/train_samples_per_second2.685
train/train_steps_per_second0.169

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run floating-fish-2 at: https://wandb.ai/szehanz/huggingface/runs/5gyifk7s
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_161628-5gyifk7s/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"}],"source":["# Use best hyperparameters from the study\n","best_trial = study.best_trial\n","\n","best_learning_rate = best_trial.params['learning_rate']\n","best_num_train_epochs = best_trial.params['num_train_epochs']\n","best_per_device_train_batch_size = best_trial.params['per_device_train_batch_size']\n","best_warmup_steps = best_trial.params['warmup_steps']\n","\n","\n","# Define TrainingArguments with the best hyperparameters for retraining\n","best_training_args = TrainingArguments(\n"," output_dir=\"best_train_out_dir\",\n"," save_strategy=\"steps\",\n"," save_steps=10,\n"," learning_rate=best_learning_rate,\n"," per_device_train_batch_size=best_per_device_train_batch_size,\n"," per_device_eval_batch_size=8,\n"," num_train_epochs=best_num_train_epochs,\n"," warmup_steps=best_warmup_steps,\n"," evaluation_strategy='steps',\n"," eval_steps=10,\n"," logging_steps=10,\n"," optim='paged_adamw_8bit',\n"," lr_scheduler_type='linear',\n"," gradient_accumulation_steps=1,\n"," load_best_model_at_end=True,\n"," report_to='wandb',\n",")\n","\n","# Reinitialize the Trainer with the best hyperparameters\n","best_trainer = SFTTrainer(\n"," model=model,\n"," train_dataset=train_dataset,\n"," eval_dataset=val_dataset,\n"," peft_config=peft_config,\n"," dataset_text_field=\"Instruction\",\n"," tokenizer=tokenizer,\n"," args=best_training_args,\n"," max_seq_length=4096,\n",")\n","\n","# Retrain the model with the best hyperparameters\n","best_trainer.train()\n","\n","\n","# Save trained model\n","best_trainer.model.save_pretrained(new_model)\n","\n","# Finish the WandB run for this trial\n","wandb.finish()"]},{"cell_type":"markdown","metadata":{"id":"_g0fB7P9s0ol"},"source":["Merging the base model with the trained adapter."]},{"cell_type":"code","execution_count":null,"metadata":{"colab":{"referenced_widgets":["aafec7a64d034e05b1aaf17bb153136b","1191c9b140394f1aa3952c1cecda8fed","68107c402ec343ffa40e22171e9fe3e9"]},"id":"QQn30cRtAZ-P","outputId":"6508be7b-0a96-494e-bd33-d35c5c331f52"},"outputs":[{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"68107c402ec343ffa40e22171e9fe3e9","version_major":2,"version_minor":0},"text/plain":["Loading checkpoint shards: 0%| | 0/2 [00:00
Copy a token from your Hugging Face\ntokens page and paste it below.
Immediately click login after copying\nyour token or it might be stored in plain text in this notebook file.
"}},"14d0ff6e07a148c0b65224af86af6b95":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"187745df1f154e7da09a0c646a22cebb":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_638517f3c94445c7b610b84744859ef8","placeholder":"​","style":"IPY_MODEL_5206f9fc25234586b735fc9c65e6b19a","value":"generation_config.json: 100%"}},"19b8b7692cfd46eca5abb50ac2854262":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"19d3adedf1a245f69a2498b1838f6415":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HBoxView","box_style":"","children":["IPY_MODEL_31ffba7793b944cd80bec0fb6ac2fbbe","IPY_MODEL_1ae848be349a41239a7ba6efadc82edf","IPY_MODEL_9a33d92ad0604eff9ac56944586facbb"],"layout":"IPY_MODEL_293d425206684064b0572e990a815cee"}},"1ae848be349a41239a7ba6efadc82edf":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_8ae8a9d03adc476a8dd64f09e194907d","max":2,"min":0,"orientation":"horizontal","style":"IPY_MODEL_aeb94dbf3f8a4b43bb07c833af3b98c9","value":2}},"1c6e5dd4ca3446ec8cf8a03b85ad57dc":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HBoxView","box_style":"","children":["IPY_MODEL_0786fc506a7640a287016991f57d3768","IPY_MODEL_3af05a188f40469dbd52fda55c7a9e22","IPY_MODEL_0af30e36f2524c1084c1237948f3b18e"],"layout":"IPY_MODEL_dfe1c2648a564676bfe1e09bda61d439"}},"1ea12323b1244c768a17a6ed5420f854":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"1f9e8ddb08814b2db7761478eb9069f5":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_7982d1d315964d138f31643db445b48a","max":614,"min":0,"orientation":"horizontal","style":"IPY_MODEL_67346bc716384552a69d1afdc844db00","value":614}},"20cc750b1bde4efdb3c3a5b5321222b4":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_a199f20edde447829b17804b17af2fe5","max":188,"min":0,"orientation":"horizontal","style":"IPY_MODEL_447f608108364159ab7ed546ad02559a","value":188}},"21556be54ed34b15b909bf8e7b8fd93a":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_b8113970ea7245e9890221d4e4cf5e8e","placeholder":"​","style":"IPY_MODEL_d3224d16458249a3bfd29253c2d6a86f","value":"\nPro Tip: If you don't already have one, you can create a dedicated\n'notebooks' token with 'write' access, that you can then easily reuse for all\nnotebooks. "}},"2254fe28315f43a99c3579195c0c0008":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"22e8bae9ff6744bca3e90d46d220106f":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"23176fa0130348c5bc1b0deb591709f3":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"236b3417f0e94039a1a449e7ab96738b":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"25543b118c8e423aa7fd8c898d5e1256":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"2924e96aa10346efb39684e5369e2170":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"LabelModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"LabelModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"LabelView","description":"","description_tooltip":null,"layout":"IPY_MODEL_bce8d1501218410ba8b042aeb3f0fc26","placeholder":"​","style":"IPY_MODEL_405603de026d484ab283f053f4b17c6d","value":"Your token has been saved to /root/.cache/huggingface/token"}},"293d425206684064b0572e990a815cee":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"29f303aa6ac8464aa91124c3fe659379":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"2cb9cfbde1e0483c97a2c531e0034adf":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"2dce1978d19e4de3a6a1b1cef6ed518f":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"3076e4abb7fe427fa4fccb43e9f3371e":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"30d47a9da70a4cf5a7e597f13d28d526":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"31ffba7793b944cd80bec0fb6ac2fbbe":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_7ee17b5879ec4909b7b616b79d5eab36","placeholder":"​","style":"IPY_MODEL_d8b847adb46d4049a4b43bba2a0e06da","value":"Downloading shards: 100%"}},"32a57f9df45b41da91e1d638f531505d":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"3af05a188f40469dbd52fda55c7a9e22":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_82829039182948efb800703e3e21d96d","max":26788,"min":0,"orientation":"horizontal","style":"IPY_MODEL_32a57f9df45b41da91e1d638f531505d","value":26788}},"3e4a06b9b13444e3b82e0c3c26e17b8f":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"CheckboxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"CheckboxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"CheckboxView","description":"Add token as git credential?","description_tooltip":null,"disabled":false,"indent":true,"layout":"IPY_MODEL_f91ebc43c1344e8688e2eeb2771c7b65","style":"IPY_MODEL_ea00aa1eb73949fc94083f1d31372915","value":true}},"405603de026d484ab283f053f4b17c6d":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"42262413163d49fd8902c275c13da1e6":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_cf370d3015b04b6caa9cbfb8f21dd5e7","placeholder":"​","style":"IPY_MODEL_1ea12323b1244c768a17a6ed5420f854","value":" 9.98G/9.98G [01:29<00:00, 126MB/s]"}},"441c37014e6b4c8995e7a5305f96fa38":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"447f608108364159ab7ed546ad02559a":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"46e41557d0bc4cb4a03c6894fcc57d0e":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"4cee3e23bcbd4643ba542d94b82dbfd2":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"5206f9fc25234586b735fc9c65e6b19a":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"52f2b6adb9904cbeb014387429c0d1ad":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HBoxView","box_style":"","children":["IPY_MODEL_b61ac64b2a0c4beb8ea209782b9dd09b","IPY_MODEL_9e9c2d712ae84d3aa87fbe433ae0701b","IPY_MODEL_fe4d028a78e64a4d8b47f8bc8b8ec15d"],"layout":"IPY_MODEL_88a7aeaf3b2547ea91cf20ed2f1a3dc0"}},"537d173a4313491f8f89f799f1dc7b2a":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"541ef20ab6f34337a2d6d20098f6fef5":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"LabelModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"LabelModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"LabelView","description":"","description_tooltip":null,"layout":"IPY_MODEL_b0f0ac261e364edd99d7b75e747e2c47","placeholder":"​","style":"IPY_MODEL_ff9f726db3434e3184e723d5da884d0a","value":"Token is valid (permission: write)."}},"55e506bd03d24dd7b24129dc64da9c01":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"5e5b95c9801443cdbce9c8e629c33589":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"PasswordModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"PasswordModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"PasswordView","continuous_update":true,"description":"Token:","description_tooltip":null,"disabled":false,"layout":"IPY_MODEL_29f303aa6ac8464aa91124c3fe659379","placeholder":"​","style":"IPY_MODEL_3076e4abb7fe427fa4fccb43e9f3371e","value":""}},"62f1cf19fe204aa4a424248e807ce061":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"638517f3c94445c7b610b84744859ef8":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"6430379d01874ec3a7cf9fea59c42914":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ButtonModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ButtonModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ButtonView","button_style":"","description":"Login","disabled":false,"icon":"","layout":"IPY_MODEL_2dce1978d19e4de3a6a1b1cef6ed518f","style":"IPY_MODEL_e81c501824f94e7d839684fafbc65b31","tooltip":""}},"67346bc716384552a69d1afdc844db00":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"6edf40f558f54d8b82d949f83557d609":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"LabelModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"LabelModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"LabelView","description":"","description_tooltip":null,"layout":"IPY_MODEL_08400a144d3c497a94ae4d84e72a1067","placeholder":"​","style":"IPY_MODEL_976a3440d2c3423c8be835b0d6f56492","value":"Connecting..."}},"77c0d63f6cd6494daf7e94cfe0397e45":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_4cee3e23bcbd4643ba542d94b82dbfd2","placeholder":"​","style":"IPY_MODEL_25543b118c8e423aa7fd8c898d5e1256","value":" 188/188 [00:00<00:00, 12.3kB/s]"}},"77daad9def9f4254933afdeb7202e5d2":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"7982d1d315964d138f31643db445b48a":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"7c701d53772344f6a057cd43742d47f5":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_b9e4e59acd4d468393f42857d5635e25","placeholder":"​","style":"IPY_MODEL_30d47a9da70a4cf5a7e597f13d28d526","value":"Loading checkpoint shards: 100%"}},"7ee17b5879ec4909b7b616b79d5eab36":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"8080eb1edf8f44f69b5f218046715f36":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_e2df35aaf7e0420b9524d533cb122521","max":2,"min":0,"orientation":"horizontal","style":"IPY_MODEL_d90c3a424bac4bb09e99339c28df2eda","value":2}},"82829039182948efb800703e3e21d96d":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"83127281047a429a846b9bb3b3209760":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"86014f569ee34895badef4acfe958051":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"86616a0d2f9d4da1a75e00d092ca9d4d":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HBoxView","box_style":"","children":["IPY_MODEL_a7a17a5883784ed9961437d0102fd9a4","IPY_MODEL_9934540648a444a09f0783843ab2d901","IPY_MODEL_42262413163d49fd8902c275c13da1e6"],"layout":"IPY_MODEL_83127281047a429a846b9bb3b3209760"}},"88a7aeaf3b2547ea91cf20ed2f1a3dc0":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"8ae8a9d03adc476a8dd64f09e194907d":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"963973c1e970410cb2ba97e9fb4ef511":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"976a3440d2c3423c8be835b0d6f56492":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"9934540648a444a09f0783843ab2d901":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_f702ae2d46834e9b934a40abca49a990","max":9976576152,"min":0,"orientation":"horizontal","style":"IPY_MODEL_14d0ff6e07a148c0b65224af86af6b95","value":9976576152}},"9a33d92ad0604eff9ac56944586facbb":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_537d173a4313491f8f89f799f1dc7b2a","placeholder":"​","style":"IPY_MODEL_55e506bd03d24dd7b24129dc64da9c01","value":" 2/2 [01:56<00:00, 52.47s/it]"}},"9c7550ec29c34ef2b674a3667ebd740e":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_236b3417f0e94039a1a449e7ab96738b","placeholder":"​","style":"IPY_MODEL_77daad9def9f4254933afdeb7202e5d2","value":"config.json: 100%"}},"9e4d26b87e1243f3ae744ddbbe2c5b51":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"9e9c2d712ae84d3aa87fbe433ae0701b":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_86014f569ee34895badef4acfe958051","max":3500296424,"min":0,"orientation":"horizontal","style":"IPY_MODEL_22e8bae9ff6744bca3e90d46d220106f","value":3500296424}},"a066a28914bc46cc85ebcb7d90d27fbe":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"a199f20edde447829b17804b17af2fe5":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"a39b2a49ff814181aa0df84c57e9245b":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"a7a17a5883784ed9961437d0102fd9a4":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_ae173a50f4374c7fa55d8e360d30d4d1","placeholder":"​","style":"IPY_MODEL_b93a21bcaf6344c49a0ab8e2a887922c","value":"model-00001-of-00002.safetensors: 100%"}},"a7ef8ff133144d4b9817800e5b4739a4":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"a9628200c0a34cccacdb0ea66e200b87":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_0cda6495ef4b4c9ab3eeff2392d89163","placeholder":"​","style":"IPY_MODEL_9e4d26b87e1243f3ae744ddbbe2c5b51","value":" 2/2 [00:58<00:00, 26.87s/it]"}},"a96de5c1f80f43d8b0b689d5db0fa248":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"ae173a50f4374c7fa55d8e360d30d4d1":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"aeb94dbf3f8a4b43bb07c833af3b98c9":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"b0f0ac261e364edd99d7b75e747e2c47":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"b4ed4d5e266a4cd5b9011436b4e92951":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HBoxView","box_style":"","children":["IPY_MODEL_7c701d53772344f6a057cd43742d47f5","IPY_MODEL_8080eb1edf8f44f69b5f218046715f36","IPY_MODEL_a9628200c0a34cccacdb0ea66e200b87"],"layout":"IPY_MODEL_2254fe28315f43a99c3579195c0c0008"}},"b577aeffe358446ea41e4b3c58b80641":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_441c37014e6b4c8995e7a5305f96fa38","placeholder":"​","style":"IPY_MODEL_a39b2a49ff814181aa0df84c57e9245b","value":" 614/614 [00:00<00:00, 15.7kB/s]"}},"b61ac64b2a0c4beb8ea209782b9dd09b":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_a066a28914bc46cc85ebcb7d90d27fbe","placeholder":"​","style":"IPY_MODEL_a96de5c1f80f43d8b0b689d5db0fa248","value":"model-00002-of-00002.safetensors: 100%"}},"b634ae14caf84bc99dcafd3a70b7a383":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"b64f26ac024c46eabfc4728586369130":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"LabelModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"LabelModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"LabelView","description":"","description_tooltip":null,"layout":"IPY_MODEL_a7ef8ff133144d4b9817800e5b4739a4","placeholder":"​","style":"IPY_MODEL_ba4cf32b2f71428282721e7818b34a5a","value":"Login successful"}},"b8113970ea7245e9890221d4e4cf5e8e":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"b93a21bcaf6344c49a0ab8e2a887922c":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"b9e4e59acd4d468393f42857d5635e25":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"ba4cf32b2f71428282721e7818b34a5a":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"bce8d1501218410ba8b042aeb3f0fc26":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"c3d08bc595a74c3180a7a83afc569584":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"VBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"VBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"VBoxView","box_style":"","children":["IPY_MODEL_541ef20ab6f34337a2d6d20098f6fef5","IPY_MODEL_fe46e1cf697f4b1fab764104be32da95","IPY_MODEL_2924e96aa10346efb39684e5369e2170","IPY_MODEL_b64f26ac024c46eabfc4728586369130"],"layout":"IPY_MODEL_cad60b6f14f249c187d573dd3a4428e0"}},"ca8d07b9b454471e883d43e03b475a9f":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"cad60b6f14f249c187d573dd3a4428e0":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":"center","align_self":null,"border":null,"bottom":null,"display":"flex","flex":null,"flex_flow":"column","grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":"50%"}},"cf370d3015b04b6caa9cbfb8f21dd5e7":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"d3224d16458249a3bfd29253c2d6a86f":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"d8b847adb46d4049a4b43bba2a0e06da":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"d90c3a424bac4bb09e99339c28df2eda":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"de756c426cf0492bb122a45b94d4bbe7":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"df0d8231953b44f1bf8e6d2cd39b2f66":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HBoxView","box_style":"","children":["IPY_MODEL_9c7550ec29c34ef2b674a3667ebd740e","IPY_MODEL_1f9e8ddb08814b2db7761478eb9069f5","IPY_MODEL_b577aeffe358446ea41e4b3c58b80641"],"layout":"IPY_MODEL_19b8b7692cfd46eca5abb50ac2854262"}},"dfe1c2648a564676bfe1e09bda61d439":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"e2df35aaf7e0420b9524d533cb122521":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"e81c501824f94e7d839684fafbc65b31":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ButtonStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ButtonStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","button_color":null,"font_weight":""}},"ea00aa1eb73949fc94083f1d31372915":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"f702ae2d46834e9b934a40abca49a990":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"f81ada25e7ff4f5da6b3f6c6e73590e4":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"f91ebc43c1344e8688e2eeb2771c7b65":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"fe46e1cf697f4b1fab764104be32da95":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"LabelModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"LabelModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"LabelView","description":"","description_tooltip":null,"layout":"IPY_MODEL_f81ada25e7ff4f5da6b3f6c6e73590e4","placeholder":"​","style":"IPY_MODEL_62f1cf19fe204aa4a424248e807ce061","value":"Your token has been saved in your configured git credential helpers (store)."}},"fe4d028a78e64a4d8b47f8bc8b8ec15d":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_b634ae14caf84bc99dcafd3a70b7a383","placeholder":"​","style":"IPY_MODEL_46e41557d0bc4cb4a03c6894fcc57d0e","value":" 3.50G/3.50G [00:26<00:00, 176MB/s]"}},"ff9f726db3434e3184e723d5da884d0a":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}}}}},"nbformat":4,"nbformat_minor":0} \ No newline at end of file +{"cells":[{"cell_type":"markdown","source":["## Creation of the MCQ-tuned model by fine-tuning the Base model with Custom MCQ dataset"],"metadata":{"id":"dcQxOEQQ6btB"}},{"cell_type":"markdown","source":["---"],"metadata":{"id":"LlkcVvTDs96X"}},{"cell_type":"markdown","source":["# MCQ Custom Dataset Creation\n","\n","## 1. Introduction\n","\n","High-quality data is fundamental for producing a good model; the higher the quality of the data, the better the resulting model. The following steps outline the process of creating a dataset specifically for fine-tuning our Llama2 model.\n","\n","\n","\n","![](https://i.imgur.com/IDNhAWH.png)\n","\n","\n","There are several types of datasets that can be used to fine-tune Large Language Models (LLMs):\n","\n","1. **Instruction Datasets:** These datasets contain direct instructions or prompts followed by the correct or expected outputs.\n","\n","2. **Raw Completion:** This involves providing a prompt to the model and letting it generate a response without any predefined correct answer.\n","\n","3. **Preference Datasets:** These datasets include human feedback in the form of preferences, where annotators compare pairs of model outputs to determine which is better.\n","\n","4. **Human Feedback Data:** This is specific to Reinforcement Learning from Human Feedback (RLHF) and involves direct feedback on the model's outputs from human annotators.\n","\n","5. **Demonstration Data:** Also used in RLHF, these datasets consist of examples showing ideal model outputs or actions, typically created by humans.\n","\n","6. **Reward Modeling Data:** Used to train a reward model in RLHF, this dataset predicts human feedback on model outputs based on actual feedback data.\n","\n","7. **Dialogue Data:** Particularly relevant for conversational AI, this includes annotated conversations that indicate the quality of responses or provide corrections.\n","\n","\n","---\n","\n","\n","\n","* Typically, an instruction dataset is utilized for fine-tuning the Llama 2 Model. Since we are focusing on Supervised Fine Tuning, the instruction dataset becomes our primary choice.\n","\n","Therefore, we have 2 options:\n","\n","1. Create our own Instruction Dataset.\n","2. Modify an existing instruction dataset, which involves filtering, modifying, and enriching it.\n","\n","We have decided to proceed with the 1st option: creating our own Instruction Dataset.\n","\n","* This will involve prompt engineering and incorporating sanity checks to ensure quality and relevance."],"metadata":{"id":"wAQMA1-DKZZ5"}},{"cell_type":"markdown","source":["## 2. Load and analyze the dataset"],"metadata":{"id":"hU_mUK-nol-t"}},{"cell_type":"code","execution_count":null,"metadata":{"id":"8P7g6eHuxxKe"},"outputs":[],"source":["# Install libraries\n","!pip install -q datasets transformers sentence_transformers faiss-gpu huggingface_hub"]},{"cell_type":"code","source":["# Import the required libraries\n","import json\n","import sys\n","import pandas as pd\n","from datasets import Dataset, DatasetDict, load_dataset\n","\n","from transformers import AutoTokenizer\n","import matplotlib.pyplot as plt\n","import seaborn as sns\n","\n","from sentence_transformers import SentenceTransformer\n","import faiss\n","from tqdm.autonotebook import tqdm\n","import numpy as np"],"metadata":{"id":"KKb-ikj4J-in"},"execution_count":null,"outputs":[]},{"cell_type":"code","source":["# Load JSON data from a file\n","with open(\"mcq_data.json\", \"r\") as f:\n"," data = json.load(f)\n","\n","# Create a Pandas DataFrame from the list of dictionaries\n","df = pd.DataFrame(data)\n","\n","# Calculate the number of rows for each dataset split\n","num_rows = len(df)\n","train_end = int(num_rows * 0.8) # 80% for training\n","test_end = train_end + int(num_rows * 0.1) # 10% for testing\n","\n","# Split the DataFrame into training, testing, and validation sets\n","df_train = df[:train_end]\n","df_test = df[train_end:test_end]\n","df_val = df[test_end:] # Ensures the remainder is used for validation\n","\n","# Create Datasets from the DataFrames\n","dataset_train = Dataset.from_pandas(df_train)\n","dataset_test = Dataset.from_pandas(df_test)\n","dataset_val = Dataset.from_pandas(df_val)\n","\n","# Create a DatasetDict containing the split datasets\n","dataset = DatasetDict({\n"," 'train': dataset_train,\n"," 'test': dataset_test,\n"," 'val': dataset_val\n","})\n","\n","# Print the structure of the created DatasetDict\n","print(dataset)"],"metadata":{"id":"bGi9FdmhdBDg","colab":{"base_uri":"https://localhost:8080/"},"executionInfo":{"status":"ok","timestamp":1708322802114,"user_tz":-480,"elapsed":19,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"e9369555-5be7-4b43-a6c4-3defec1485d6"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["DatasetDict({\n"," train: Dataset({\n"," features: ['Instruction', 'Question', 'A', 'B', 'C', 'D', 'Correct Answer', 'Explanation'],\n"," num_rows: 334\n"," })\n"," test: Dataset({\n"," features: ['Instruction', 'Question', 'A', 'B', 'C', 'D', 'Correct Answer', 'Explanation'],\n"," num_rows: 41\n"," })\n"," val: Dataset({\n"," features: ['Instruction', 'Question', 'A', 'B', 'C', 'D', 'Correct Answer', 'Explanation'],\n"," num_rows: 43\n"," })\n","})\n"]}]},{"cell_type":"code","source":["# Read as pandas DataFrame\n","dataset['train'].to_pandas()"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":719},"id":"-MOvcr5mD8li","executionInfo":{"status":"ok","timestamp":1708322802114,"user_tz":-480,"elapsed":18,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"5b69a1a1-5307-4ec8-9b04-2c5035176d98"},"execution_count":null,"outputs":[{"output_type":"execute_result","data":{"text/plain":[" Instruction \\\n","0 Create an MCQ on the structure of artificial n... \n","1 Create an MCQ on the training process of artif... \n","2 Create an MCQ on the role of artificial neuron... \n","3 Create an MCQ on the purpose of hidden layers ... \n","4 Create an MCQ on the basics of deep learning \n",".. ... \n","329 Create an MCQ on the hyperparameter 'Kernel' i... \n","330 Create an MCQ on the hyperparameter 'Gamma' in... \n","331 Create an MCQ on the hyperparameter 'learning_... \n","332 Create an MCQ on the hyperparameter 'n_estimat... \n","333 Create an MCQ on the application of deep learn... \n","\n"," Question \\\n","0 What is the structure of an artificial neural ... \n","1 What is the purpose of the training process in... \n","2 What is the role of artificial neurons in neur... \n","3 What is the purpose of hidden layers in artifi... \n","4 What is deep learning? \n",".. ... \n","329 What does the hyperparameter 'Kernel' define i... \n","330 What does the hyperparameter 'Gamma' control i... \n","331 What does the hyperparameter 'learning_rate' d... \n","332 What does the hyperparameter 'n_estimators' de... \n","333 Which of the following is an application of de... \n","\n"," A \\\n","0 It consists of input layers and hidden layers ... \n","1 To adjust the weights of the connections betwe... \n","2 To receive input from external sources \n","3 To receive input from external sources \n","4 A branch of machine learning based on artifici... \n",".. ... \n","329 The step size taken by the optimizer during ea... \n","330 The step size taken by the optimizer during ea... \n","331 The step size taken by the optimizer during ea... \n","332 The step size taken by the optimizer during ea... \n","333 Analyzing sensor data in autonomous vehicles \n","\n"," B \\\n","0 It consists of input layers, hidden layers, an... \n","1 To propagate input data forward through the la... \n","2 To compute the weighted total of inputs \n","3 To compute the weighted total of inputs \n","4 A programming technique to explicitly define c... \n",".. ... \n","329 The trade-off between the margin and the numbe... \n","330 The trade-off between the margin and the numbe... \n","331 The trade-off between the margin and the numbe... \n","332 The trade-off between the margin and the numbe... \n","333 Recognizing objects and scenes in images \n","\n"," C \\\n","0 It consists of input layers, hidden layers, ou... \n","1 To calculate the error between the output and ... \n","2 To transfer information to the next layer \n","3 To transfer information to the next layer \n","4 A method to process large datasets using deep ... \n",".. ... \n","329 The similarity between data points \n","330 The similarity between data points \n","331 The similarity between data points \n","332 The number of boosting trees to be trained \n","333 Transcribing spoken words into text \n","\n"," D Correct Answer \\\n","0 It consists of input layers, hidden layers, ou... C \n","1 To achieve the desired level of performance A \n","2 All of the above D \n","3 To process and transform the input data D \n","4 A type of data structure inspired by the human... A \n",".. ... ... \n","329 The maximum depth of each tree in the ensemble C \n","330 The influence of support vectors on the decisi... D \n","331 The maximum depth of each tree in the ensemble A \n","332 The maximum depth of each tree in the ensemble C \n","333 Making personalized recommendations based on u... B \n","\n"," Explanation \n","0 An artificial neural network consists of input... \n","1 The purpose of the training process in artific... \n","2 The role of artificial neurons in neural netwo... \n","3 The purpose of hidden layers in artificial neu... \n","4 Deep learning is a branch of machine learning ... \n",".. ... \n","329 The hyperparameter 'Kernel' in Support Vector ... \n","330 The hyperparameter 'Gamma' in Support Vector M... \n","331 The hyperparameter 'learning_rate' in XGBoost ... \n","332 The hyperparameter 'n_estimators' in XGBoost d... \n","333 Deep learning algorithms are used in image and... \n","\n","[334 rows x 8 columns]"],"text/html":["\n","
\n","
\n","\n","\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
InstructionQuestionABCDCorrect AnswerExplanation
0Create an MCQ on the structure of artificial n...What is the structure of an artificial neural ...It consists of input layers and hidden layers ...It consists of input layers, hidden layers, an...It consists of input layers, hidden layers, ou...It consists of input layers, hidden layers, ou...CAn artificial neural network consists of input...
1Create an MCQ on the training process of artif...What is the purpose of the training process in...To adjust the weights of the connections betwe...To propagate input data forward through the la...To calculate the error between the output and ...To achieve the desired level of performanceAThe purpose of the training process in artific...
2Create an MCQ on the role of artificial neuron...What is the role of artificial neurons in neur...To receive input from external sourcesTo compute the weighted total of inputsTo transfer information to the next layerAll of the aboveDThe role of artificial neurons in neural netwo...
3Create an MCQ on the purpose of hidden layers ...What is the purpose of hidden layers in artifi...To receive input from external sourcesTo compute the weighted total of inputsTo transfer information to the next layerTo process and transform the input dataDThe purpose of hidden layers in artificial neu...
4Create an MCQ on the basics of deep learningWhat is deep learning?A branch of machine learning based on artifici...A programming technique to explicitly define c...A method to process large datasets using deep ...A type of data structure inspired by the human...ADeep learning is a branch of machine learning ...
...........................
329Create an MCQ on the hyperparameter 'Kernel' i...What does the hyperparameter 'Kernel' define i...The step size taken by the optimizer during ea...The trade-off between the margin and the numbe...The similarity between data pointsThe maximum depth of each tree in the ensembleCThe hyperparameter 'Kernel' in Support Vector ...
330Create an MCQ on the hyperparameter 'Gamma' in...What does the hyperparameter 'Gamma' control i...The step size taken by the optimizer during ea...The trade-off between the margin and the numbe...The similarity between data pointsThe influence of support vectors on the decisi...DThe hyperparameter 'Gamma' in Support Vector M...
331Create an MCQ on the hyperparameter 'learning_...What does the hyperparameter 'learning_rate' d...The step size taken by the optimizer during ea...The trade-off between the margin and the numbe...The similarity between data pointsThe maximum depth of each tree in the ensembleAThe hyperparameter 'learning_rate' in XGBoost ...
332Create an MCQ on the hyperparameter 'n_estimat...What does the hyperparameter 'n_estimators' de...The step size taken by the optimizer during ea...The trade-off between the margin and the numbe...The number of boosting trees to be trainedThe maximum depth of each tree in the ensembleCThe hyperparameter 'n_estimators' in XGBoost d...
333Create an MCQ on the application of deep learn...Which of the following is an application of de...Analyzing sensor data in autonomous vehiclesRecognizing objects and scenes in imagesTranscribing spoken words into textMaking personalized recommendations based on u...BDeep learning algorithms are used in image and...
\n","

334 rows × 8 columns

\n","
\n","
\n","\n","
\n"," \n","\n"," \n","\n"," \n","
\n","\n","\n","
\n"," \n","\n","\n","\n"," \n","
\n","\n","
\n","
\n"],"application/vnd.google.colaboratory.intrinsic+json":{"type":"dataframe","summary":"{\n \"name\": \"dataset['train']\",\n \"rows\": 334,\n \"fields\": [\n {\n \"column\": \"Instruction\",\n \"properties\": {\n \"dtype\": \"category\",\n \"samples\": [\n \"Create an MCQ on the parameter gamma in Support Vector Machines (SVMs)\",\n \"Create an MCQ on the disadvantages of Artificial Neural Networks (ANNs)\",\n \"Create an MCQ on the role of machine learning in recommendation systems\"\n ],\n \"num_unique_values\": 165,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Question\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"What does the hyperparameter 'Kernel' determine in Support Vector Machines (SVMs)?\",\n \"Which of the following are types of deep learning architectures?\",\n \"Which of the following is NOT an application of deep learning in reinforcement learning?\"\n ],\n \"num_unique_values\": 221,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"A\",\n \"properties\": {\n \"dtype\": \"category\",\n \"samples\": [\n \"Requires large amounts of labeled data\",\n \"AI is the broader family consisting of ML and DL as its components\",\n \"Increased computational cost\"\n ],\n \"num_unique_values\": 162,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"B\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Data clustering, dimensionality reduction, and anomaly detection\",\n \"Analyzing medical images to assist doctors in making diagnoses\",\n \"The reliance on manual feature engineering\"\n ],\n \"num_unique_values\": 193,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"C\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"It may result in overfitting\",\n \"Reduced overfitting and underfitting\",\n \"To automatically learn features from visual data\"\n ],\n \"num_unique_values\": 211,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"D\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Overfitting\",\n \"Evaluating all possible combinations of hyperparameter values\",\n \"A branch of machine learning that uses linear regression\"\n ],\n \"num_unique_values\": 211,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Correct Answer\",\n \"properties\": {\n \"dtype\": \"category\",\n \"samples\": [\n \"A\",\n \"B\",\n \"C\"\n ],\n \"num_unique_values\": 4,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Explanation\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"A key difference between machine learning and deep learning is the type of algorithms used. Machine learning applies statistical algorithms, while deep learning utilizes artificial neural network architecture to learn patterns and relationships.\",\n \"Hyperparameter tuning helps reduce overfitting and underfitting, leading to improved model performance and generalizability.\",\n \"Artificial Intelligence consists of the components: Artificial Intelligence, Machine Learning, and Deep Learning.\"\n ],\n \"num_unique_values\": 302,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n }\n ]\n}"}},"metadata":{},"execution_count":19}]},{"cell_type":"code","source":["# Read as pandas DataFrame\n","dataset['test'].to_pandas()"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":1000},"id":"7WkFWJSQhKUV","executionInfo":{"status":"ok","timestamp":1708322802115,"user_tz":-480,"elapsed":17,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"8fd77b53-8311-42f7-a84d-00e396647a72"},"execution_count":null,"outputs":[{"output_type":"execute_result","data":{"text/plain":[" Instruction \\\n","0 Create an MCQ on the use of deep learning in g... \n","1 Create an MCQ on the use of deep learning in a... \n","2 Create an MCQ on the application of deep learn... \n","3 Create an MCQ on the use of deep learning in r... \n","4 Create an MCQ on the structure of artificial n... \n","5 Create an MCQ on the purpose of adjusting weig... \n","6 Create an MCQ on the role of artificial neuron... \n","7 Create an MCQ on the complexities of neural ne... \n","8 Create an MCQ on the difference between machin... \n","9 Create an MCQ on the definition of deep learning \n","10 Create an MCQ on the key characteristic of dee... \n","11 Create an MCQ on the applications of deep lear... \n","12 Create an MCQ on the training requirements of ... \n","13 Create an MCQ on the types of machine learning... \n","14 Create an MCQ on the types of neural networks ... \n","15 Create an MCQ on the applications of deep lear... \n","16 Create an MCQ on the applications of deep lear... \n","17 Create an MCQ on the applications of deep lear... \n","18 Create an MCQ on the main types of neural netw... \n","19 Create an MCQ on the definition of Artificial ... \n","20 Create an MCQ on the difference between Machin... \n","21 Create an MCQ on the components of Artificial ... \n","22 Create an MCQ on the aim of Machine Learning \n","23 Create an MCQ on the aim of Deep Learning \n","24 Create an MCQ on the difference between AI, Ma... \n","25 Create an MCQ on the application of AI in spee... \n","26 Create an MCQ on the application of AI in pers... \n","27 Create an MCQ on the application of AI in pred... \n","28 Create an MCQ on the application of AI in medi... \n","29 Create an MCQ on the difference between AI, ML... \n","30 Create an MCQ on the responsibilities of an AI... \n","31 Create an MCQ on the skills required for a Mac... \n","32 Create an MCQ on the tasks of a Deep Learning ... \n","33 Create an MCQ on the difference between ML and DL \n","34 Create an MCQ on the advantages of Artificial ... \n","35 Create an MCQ on the disadvantages of Artifici... \n","36 Create an MCQ on the advantages of Biological ... \n","37 Create an MCQ on the disadvantages of Biologic... \n","38 Create an MCQ on the differences between Artif... \n","39 Create an MCQ on hyperparameter tuning in mach... \n","40 Create an MCQ on the types of hyperparameters ... \n","\n"," Question \\\n","0 What is the role of deep learning in generativ... \n","1 How is deep learning used in autonomous vehicles? \n","2 What is the role of deep learning in speech re... \n","3 What is the application of deep learning in re... \n","4 What is the structure of an artificial neural ... \n","5 What is the purpose of adjusting weights in ar... \n","6 What is the role of artificial neurons in neur... \n","7 What determines the complexities of neural net... \n","8 What is a key difference between machine learn... \n","9 What is the definition of deep learning? \n","10 What is the key characteristic of deep learning? \n","11 Which of the following are applications of dee... \n","12 What are the training requirements for deep ne... \n","13 Which types of machine learning tasks can be p... \n","14 Which type of neural network is specifically d... \n","15 Which application of deep learning in computer... \n","16 Which application of deep learning in NLP invo... \n","17 Which application of deep learning in reinforc... \n","18 Which of the following are the main types of n... \n","19 Which of the following best defines Artificial... \n","20 What is the main difference between Machine Le... \n","21 Which of the following components are part of ... \n","22 What is the aim of Machine Learning? \n","23 What is the aim of Deep Learning? \n","24 Which of the following best describes the diff... \n","25 Which of the following is an example of AI app... \n","26 Which of the following is an example of AI app... \n","27 Which of the following is an example of AI app... \n","28 Which of the following is an example of AI app... \n","29 Which of the following statements accurately d... \n","30 Which of the following is a key responsibility... \n","31 Which of the following skills is essential for... \n","32 Which of the following is a key task of a Deep... \n","33 What distinguishes Deep Learning (DL) from Mac... \n","34 Which of the following is an advantage of Arti... \n","35 Which of the following is a disadvantage of Ar... \n","36 Which of the following is an advantage of Biol... \n","37 Which of the following is a disadvantage of Bi... \n","38 Which of the following is a difference between... \n","39 What is the purpose of hyperparameter tuning i... \n","40 Which of the following is a type of hyperparam... \n","\n"," A \\\n","0 Analyzing sensor data in autonomous vehicles \n","1 Analyzing sensor data in autonomous vehicles \n","2 Analyzing sensor data in autonomous vehicles \n","3 Analyzing sensor data in autonomous vehicles \n","4 It consists of input layers, hidden layers, an... \n","5 To increase the speed of training models \n","6 To receive input from external sources \n","7 The number of layers in the network \n","8 The type of algorithms used \n","9 A branch of machine learning that uses artific... \n","10 The use of shallow neural networks with a sing... \n","11 Image recognition, natural language processing... \n","12 Small datasets and limited computational resou... \n","13 Supervised machine learning only \n","14 Feedforward Neural Networks (FNNs) \n","15 Object detection and recognition \n","16 Automatic Text Generation \n","17 Game playing \n","18 Feedforward Neural Networks (FNNs) \n","19 The study of training machines to mimic human ... \n","20 Machine Learning uses statistical methods, whi... \n","21 Machine Learning and Deep Learning \n","22 To increase chances of success \n","23 To increase chances of success \n","24 AI is a subset of Machine Learning, which is a... \n","25 Analyzing users' browsing and viewing history ... \n","26 Analyzing users' browsing and viewing history ... \n","27 Analyzing users' browsing and viewing history ... \n","28 Analyzing users' browsing and viewing history ... \n","29 AI, ML, and DL are interchangeable terms that ... \n","30 Design and development of AI algorithms \n","31 Strong background in computer science, mathema... \n","32 Design and development of DL algorithms \n","33 DL is a more advanced form of ML that can perf... \n","34 Ability to learn irrespective of the type of data \n","35 Ability to learn irrespective of the type of data \n","36 Ability to learn irrespective of the type of data \n","37 Ability to learn irrespective of the type of data \n","38 Both ANNs and BNNs have complex and diverse ne... \n","39 To adjust the weights and biases of the model \n","40 Weights \n","\n"," B \\\n","0 Creating new content based on existing data \n","1 Recognizing objects and scenes in images \n","2 Recognizing objects and scenes in images \n","3 Recognizing objects and scenes in images \n","4 It consists of input layers and output layers ... \n","5 To prevent overfitting by validating the model... \n","6 To compute the weighted total of inputs \n","7 The number of units in each layer \n","8 The amount of data required \n","9 A type of programming that explicitly defines ... \n","10 The requirement for manual feature engineering \n","11 Data clustering, dimensionality reduction, and... \n","12 Large amounts of data and computational resources \n","13 Unsupervised machine learning only \n","14 Convolutional Neural Networks (CNNs) \n","15 Image classification \n","16 Language translation \n","17 Robotics \n","18 Convolutional Neural Networks (CNNs) \n","19 The study of statistical methods enabling mach... \n","20 Machine Learning focuses on learning from expe... \n","21 Machine Learning and Decision Trees \n","22 To increase accuracy \n","23 To increase accuracy \n","24 Machine Learning is a subset of AI, which is a... \n","25 Analyzing medical images to assist doctors in ... \n","26 Analyzing medical images to assist doctors in ... \n","27 Analyzing medical images to assist doctors in ... \n","28 Analyzing medical images to assist doctors in ... \n","29 AI focuses on creating intelligent machines, M... \n","30 Analysis and interpretation of data \n","31 Experience in developing AI algorithms and sol... \n","32 Analysis and interpretation of data \n","33 DL focuses on developing algorithms that enabl... \n","34 Simple architecture that makes it easy to expl... \n","35 Simple architecture that makes it easy to expl... \n","36 Simple architecture that makes it easy to expl... \n","37 Simple architecture that makes it easy to expl... \n","38 ANNs have fixed connections between neurons, w... \n","39 To select the optimal values for the model's h... \n","40 Biases \n","\n"," C \\\n","0 Transcribing spoken words into text \n","1 Transcribing spoken words into text \n","2 Transcribing spoken words into text \n","3 Making personalized recommendations based on u... \n","4 It consists of input layers, hidden layers, ou... \n","5 To enhance the model's performance on the trai... \n","6 To transfer information to the next layer \n","7 The type of activation function used \n","8 The complexity of the models \n","9 A technique that requires manual feature engin... \n","10 The use of deep neural networks with multiple ... \n","11 Supervised machine learning and unsupervised m... \n","12 Manual feature engineering and domain expertise \n","13 Reinforcement machine learning only \n","14 Recurrent Neural Networks (RNNs) \n","15 Image segmentation \n","16 Sentiment analysis \n","17 Control systems \n","18 Recurrent Neural Networks (RNNs) \n","19 The study that uses neural networks to imitate... \n","20 Machine Learning is a subset of Deep Learning \n","21 Artificial Intelligence and Machine Learning \n","22 To improve system efficiency \n","23 To improve system efficiency \n","24 Deep Learning is a subset of AI, which is a su... \n","25 Recognizing and classifying images and speech \n","26 Recognizing and classifying images and speech \n","27 Analyzing sensor data to predict equipment fai... \n","28 Recognizing and classifying images and speech \n","29 AI is a subset of ML that uses neural networks... \n","30 Training and evaluation of ML models \n","31 Familiarity with programming languages such as... \n","32 Training and evaluation of ML models \n","33 DL is a subset of ML that uses neural networks... \n","34 Dependence on hardware for functioning \n","35 Dependence on hardware for functioning \n","36 No controlling mechanism \n","37 No controlling mechanism \n","38 Both ANNs and BNNs have simple and predetermin... \n","39 To preprocess the input data before training t... \n","40 Learning rate \n","\n"," D Correct Answer \\\n","0 Making personalized recommendations based on u... B \n","1 Making personalized recommendations based on u... A \n","2 Making personalized recommendations based on u... C \n","3 Transcribing spoken words into text C \n","4 It consists of input layers and artificial neu... A \n","5 To reduce the computational cost of training C \n","6 All of the above D \n","7 The size of the dataset B \n","8 The performance on complex tasks B \n","9 A method of machine learning that only works w... A \n","10 The reliance on labeled datasets for training C \n","11 Data visualization and exploratory data analysis A \n","12 Pre-trained models and transfer learning B \n","13 Supervised, unsupervised, and reinforcement ma... D \n","14 None of the above B \n","15 None of the above A \n","16 Speech recognition C \n","17 None of the above B \n","18 All of the above D \n","19 The study of incorporating human intelligence ... D \n","20 Machine Learning requires human intervention, ... A \n","21 Artificial Intelligence and Deep Learning C \n","22 To analyze data and provide output B \n","23 To analyze data and provide output A \n","24 AI, Machine Learning, and Deep Learning are co... B \n","25 Analyzing sensor data to make decisions about ... C \n","26 Analyzing sensor data to make decisions about ... A \n","27 Recognizing and classifying images and speech C \n","28 Analyzing sensor data to make decisions about ... B \n","29 AI focuses on developing algorithms that enabl... B \n","30 Deployment and maintenance of DL models A \n","31 All of the above D \n","32 Deployment and maintenance of AI models A \n","33 ML is a more advanced form of DL that can perf... C \n","34 High speed of processing A \n","35 The simplest architecture makes it difficult t... D \n","36 Ability to process highly complex parallel inputs D \n","37 Speed of processing is slow D \n","38 ANNs and BNNs have the same processing speed B \n","39 To evaluate the performance of the model on a ... B \n","40 Activation function C \n","\n"," Explanation \n","0 Deep learning algorithms are used in generativ... \n","1 Deep learning algorithms are used in autonomou... \n","2 Deep learning algorithms are used in speech re... \n","3 Deep learning algorithms are used in recommend... \n","4 An artificial neural network consists of input... \n","5 The purpose of adjusting weights in artificial... \n","6 The role of artificial neurons in neural netwo... \n","7 The complexities of neural networks are determ... \n","8 A key difference between machine learning and ... \n","9 Deep learning is a branch of machine learning ... \n","10 The key characteristic of deep learning is the... \n","11 Deep learning has achieved significant success... \n","12 Training deep neural networks typically requir... \n","13 Deep learning can be used for supervised, unsu... \n","14 Convolutional Neural Networks (CNNs) are speci... \n","15 Object detection and recognition is the applic... \n","16 Sentiment analysis is the application of deep ... \n","17 Robotics is the application of deep learning i... \n","18 The main types of neural networks used in deep... \n","19 Artificial Intelligence is the mechanism to in... \n","20 The main difference between Machine Learning a... \n","21 Artificial Intelligence is the broader family ... \n","22 The aim of Machine Learning is to increase acc... \n","23 The aim of Deep Learning is to increase chance... \n","24 AI is the broader concept that encompasses the... \n","25 Speech recognition is an example of AI applica... \n","26 Personalized recommendations, as an AI applica... \n","27 AI-powered predictive maintenance systems anal... \n","28 AI-powered medical diagnosis systems analyze m... \n","29 AI, ML, and DL are related but distinct concep... \n","30 One of the key responsibilities of an AI Engin... \n","31 A Machine Learning Engineer should have a stro... \n","32 One of the key tasks of a Deep Learning Engine... \n","33 Deep Learning (DL) is a subset of Machine Lear... \n","34 One of the advantages of Artificial Neural Net... \n","35 One of the disadvantages of Artificial Neural ... \n","36 One of the advantages of Biological Neural Net... \n","37 One of the disadvantages of Biological Neural ... \n","38 One of the differences between Artificial Neur... \n","39 Hyperparameter tuning is the process of select... \n","40 In neural networks, the learning rate is a hyp... "],"text/html":["\n","
\n","
\n","\n","\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
InstructionQuestionABCDCorrect AnswerExplanation
0Create an MCQ on the use of deep learning in g...What is the role of deep learning in generativ...Analyzing sensor data in autonomous vehiclesCreating new content based on existing dataTranscribing spoken words into textMaking personalized recommendations based on u...BDeep learning algorithms are used in generativ...
1Create an MCQ on the use of deep learning in a...How is deep learning used in autonomous vehicles?Analyzing sensor data in autonomous vehiclesRecognizing objects and scenes in imagesTranscribing spoken words into textMaking personalized recommendations based on u...ADeep learning algorithms are used in autonomou...
2Create an MCQ on the application of deep learn...What is the role of deep learning in speech re...Analyzing sensor data in autonomous vehiclesRecognizing objects and scenes in imagesTranscribing spoken words into textMaking personalized recommendations based on u...CDeep learning algorithms are used in speech re...
3Create an MCQ on the use of deep learning in r...What is the application of deep learning in re...Analyzing sensor data in autonomous vehiclesRecognizing objects and scenes in imagesMaking personalized recommendations based on u...Transcribing spoken words into textCDeep learning algorithms are used in recommend...
4Create an MCQ on the structure of artificial n...What is the structure of an artificial neural ...It consists of input layers, hidden layers, an...It consists of input layers and output layers ...It consists of input layers, hidden layers, ou...It consists of input layers and artificial neu...AAn artificial neural network consists of input...
5Create an MCQ on the purpose of adjusting weig...What is the purpose of adjusting weights in ar...To increase the speed of training modelsTo prevent overfitting by validating the model...To enhance the model's performance on the trai...To reduce the computational cost of trainingCThe purpose of adjusting weights in artificial...
6Create an MCQ on the role of artificial neuron...What is the role of artificial neurons in neur...To receive input from external sourcesTo compute the weighted total of inputsTo transfer information to the next layerAll of the aboveDThe role of artificial neurons in neural netwo...
7Create an MCQ on the complexities of neural ne...What determines the complexities of neural net...The number of layers in the networkThe number of units in each layerThe type of activation function usedThe size of the datasetBThe complexities of neural networks are determ...
8Create an MCQ on the difference between machin...What is a key difference between machine learn...The type of algorithms usedThe amount of data requiredThe complexity of the modelsThe performance on complex tasksBA key difference between machine learning and ...
9Create an MCQ on the definition of deep learningWhat is the definition of deep learning?A branch of machine learning that uses artific...A type of programming that explicitly defines ...A technique that requires manual feature engin...A method of machine learning that only works w...ADeep learning is a branch of machine learning ...
10Create an MCQ on the key characteristic of dee...What is the key characteristic of deep learning?The use of shallow neural networks with a sing...The requirement for manual feature engineeringThe use of deep neural networks with multiple ...The reliance on labeled datasets for trainingCThe key characteristic of deep learning is the...
11Create an MCQ on the applications of deep lear...Which of the following are applications of dee...Image recognition, natural language processing...Data clustering, dimensionality reduction, and...Supervised machine learning and unsupervised m...Data visualization and exploratory data analysisADeep learning has achieved significant success...
12Create an MCQ on the training requirements of ...What are the training requirements for deep ne...Small datasets and limited computational resou...Large amounts of data and computational resourcesManual feature engineering and domain expertisePre-trained models and transfer learningBTraining deep neural networks typically requir...
13Create an MCQ on the types of machine learning...Which types of machine learning tasks can be p...Supervised machine learning onlyUnsupervised machine learning onlyReinforcement machine learning onlySupervised, unsupervised, and reinforcement ma...DDeep learning can be used for supervised, unsu...
14Create an MCQ on the types of neural networks ...Which type of neural network is specifically d...Feedforward Neural Networks (FNNs)Convolutional Neural Networks (CNNs)Recurrent Neural Networks (RNNs)None of the aboveBConvolutional Neural Networks (CNNs) are speci...
15Create an MCQ on the applications of deep lear...Which application of deep learning in computer...Object detection and recognitionImage classificationImage segmentationNone of the aboveAObject detection and recognition is the applic...
16Create an MCQ on the applications of deep lear...Which application of deep learning in NLP invo...Automatic Text GenerationLanguage translationSentiment analysisSpeech recognitionCSentiment analysis is the application of deep ...
17Create an MCQ on the applications of deep lear...Which application of deep learning in reinforc...Game playingRoboticsControl systemsNone of the aboveBRobotics is the application of deep learning i...
18Create an MCQ on the main types of neural netw...Which of the following are the main types of n...Feedforward Neural Networks (FNNs)Convolutional Neural Networks (CNNs)Recurrent Neural Networks (RNNs)All of the aboveDThe main types of neural networks used in deep...
19Create an MCQ on the definition of Artificial ...Which of the following best defines Artificial...The study of training machines to mimic human ...The study of statistical methods enabling mach...The study that uses neural networks to imitate...The study of incorporating human intelligence ...DArtificial Intelligence is the mechanism to in...
20Create an MCQ on the difference between Machin...What is the main difference between Machine Le...Machine Learning uses statistical methods, whi...Machine Learning focuses on learning from expe...Machine Learning is a subset of Deep LearningMachine Learning requires human intervention, ...AThe main difference between Machine Learning a...
21Create an MCQ on the components of Artificial ...Which of the following components are part of ...Machine Learning and Deep LearningMachine Learning and Decision TreesArtificial Intelligence and Machine LearningArtificial Intelligence and Deep LearningCArtificial Intelligence is the broader family ...
22Create an MCQ on the aim of Machine LearningWhat is the aim of Machine Learning?To increase chances of successTo increase accuracyTo improve system efficiencyTo analyze data and provide outputBThe aim of Machine Learning is to increase acc...
23Create an MCQ on the aim of Deep LearningWhat is the aim of Deep Learning?To increase chances of successTo increase accuracyTo improve system efficiencyTo analyze data and provide outputAThe aim of Deep Learning is to increase chance...
24Create an MCQ on the difference between AI, Ma...Which of the following best describes the diff...AI is a subset of Machine Learning, which is a...Machine Learning is a subset of AI, which is a...Deep Learning is a subset of AI, which is a su...AI, Machine Learning, and Deep Learning are co...BAI is the broader concept that encompasses the...
25Create an MCQ on the application of AI in spee...Which of the following is an example of AI app...Analyzing users' browsing and viewing history ...Analyzing medical images to assist doctors in ...Recognizing and classifying images and speechAnalyzing sensor data to make decisions about ...CSpeech recognition is an example of AI applica...
26Create an MCQ on the application of AI in pers...Which of the following is an example of AI app...Analyzing users' browsing and viewing history ...Analyzing medical images to assist doctors in ...Recognizing and classifying images and speechAnalyzing sensor data to make decisions about ...APersonalized recommendations, as an AI applica...
27Create an MCQ on the application of AI in pred...Which of the following is an example of AI app...Analyzing users' browsing and viewing history ...Analyzing medical images to assist doctors in ...Analyzing sensor data to predict equipment fai...Recognizing and classifying images and speechCAI-powered predictive maintenance systems anal...
28Create an MCQ on the application of AI in medi...Which of the following is an example of AI app...Analyzing users' browsing and viewing history ...Analyzing medical images to assist doctors in ...Recognizing and classifying images and speechAnalyzing sensor data to make decisions about ...BAI-powered medical diagnosis systems analyze m...
29Create an MCQ on the difference between AI, ML...Which of the following statements accurately d...AI, ML, and DL are interchangeable terms that ...AI focuses on creating intelligent machines, M...AI is a subset of ML that uses neural networks...AI focuses on developing algorithms that enabl...BAI, ML, and DL are related but distinct concep...
30Create an MCQ on the responsibilities of an AI...Which of the following is a key responsibility...Design and development of AI algorithmsAnalysis and interpretation of dataTraining and evaluation of ML modelsDeployment and maintenance of DL modelsAOne of the key responsibilities of an AI Engin...
31Create an MCQ on the skills required for a Mac...Which of the following skills is essential for...Strong background in computer science, mathema...Experience in developing AI algorithms and sol...Familiarity with programming languages such as...All of the aboveDA Machine Learning Engineer should have a stro...
32Create an MCQ on the tasks of a Deep Learning ...Which of the following is a key task of a Deep...Design and development of DL algorithmsAnalysis and interpretation of dataTraining and evaluation of ML modelsDeployment and maintenance of AI modelsAOne of the key tasks of a Deep Learning Engine...
33Create an MCQ on the difference between ML and DLWhat distinguishes Deep Learning (DL) from Mac...DL is a more advanced form of ML that can perf...DL focuses on developing algorithms that enabl...DL is a subset of ML that uses neural networks...ML is a more advanced form of DL that can perf...CDeep Learning (DL) is a subset of Machine Lear...
34Create an MCQ on the advantages of Artificial ...Which of the following is an advantage of Arti...Ability to learn irrespective of the type of dataSimple architecture that makes it easy to expl...Dependence on hardware for functioningHigh speed of processingAOne of the advantages of Artificial Neural Net...
35Create an MCQ on the disadvantages of Artifici...Which of the following is a disadvantage of Ar...Ability to learn irrespective of the type of dataSimple architecture that makes it easy to expl...Dependence on hardware for functioningThe simplest architecture makes it difficult t...DOne of the disadvantages of Artificial Neural ...
36Create an MCQ on the advantages of Biological ...Which of the following is an advantage of Biol...Ability to learn irrespective of the type of dataSimple architecture that makes it easy to expl...No controlling mechanismAbility to process highly complex parallel inputsDOne of the advantages of Biological Neural Net...
37Create an MCQ on the disadvantages of Biologic...Which of the following is a disadvantage of Bi...Ability to learn irrespective of the type of dataSimple architecture that makes it easy to expl...No controlling mechanismSpeed of processing is slowDOne of the disadvantages of Biological Neural ...
38Create an MCQ on the differences between Artif...Which of the following is a difference between...Both ANNs and BNNs have complex and diverse ne...ANNs have fixed connections between neurons, w...Both ANNs and BNNs have simple and predetermin...ANNs and BNNs have the same processing speedBOne of the differences between Artificial Neur...
39Create an MCQ on hyperparameter tuning in mach...What is the purpose of hyperparameter tuning i...To adjust the weights and biases of the modelTo select the optimal values for the model's h...To preprocess the input data before training t...To evaluate the performance of the model on a ...BHyperparameter tuning is the process of select...
40Create an MCQ on the types of hyperparameters ...Which of the following is a type of hyperparam...WeightsBiasesLearning rateActivation functionCIn neural networks, the learning rate is a hyp...
\n","
\n","
\n","\n","
\n"," \n","\n"," \n","\n"," \n","
\n","\n","\n","
\n"," \n","\n","\n","\n"," \n","
\n","\n","
\n","
\n"],"application/vnd.google.colaboratory.intrinsic+json":{"type":"dataframe","summary":"{\n \"name\": \"dataset['test']\",\n \"rows\": 41,\n \"fields\": [\n {\n \"column\": \"Instruction\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Create an MCQ on the difference between AI, Machine Learning, and Deep Learning\",\n \"Create an MCQ on the types of machine learning tasks that can be performed using deep learning\",\n \"Create an MCQ on the difference between machine learning and deep learning\"\n ],\n \"num_unique_values\": 41,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Question\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Which of the following best describes the difference between AI, Machine Learning, and Deep Learning?\",\n \"Which types of machine learning tasks can be performed using deep learning?\",\n \"What is a key difference between machine learning and deep learning?\"\n ],\n \"num_unique_values\": 41,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"A\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Both ANNs and BNNs have complex and diverse neurons\",\n \"The study of training machines to mimic human behavior\",\n \"Strong background in computer science, mathematics, and statistics\"\n ],\n \"num_unique_values\": 30,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"B\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"ANNs have fixed connections between neurons, while BNNs have flexible connections\",\n \"Robotics\",\n \"Analysis and interpretation of data\"\n ],\n \"num_unique_values\": 30,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"C\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Both ANNs and BNNs have simple and predetermined neural pathways\",\n \"Control systems\",\n \"Training and evaluation of ML models\"\n ],\n \"num_unique_values\": 32,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"D\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"ANNs and BNNs have the same processing speed\",\n \"Machine Learning requires human intervention, while Deep Learning does not\",\n \"ML is a more advanced form of DL that can perform complex tasks.\"\n ],\n \"num_unique_values\": 32,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Correct Answer\",\n \"properties\": {\n \"dtype\": \"category\",\n \"samples\": [\n \"A\",\n \"D\",\n \"B\"\n ],\n \"num_unique_values\": 4,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Explanation\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"AI is the broader concept that encompasses the development of computer systems that can perform tasks requiring human intelligence. Machine Learning is a subset of AI, focused on algorithms that can learn from data and make predictions or decisions. Deep Learning is a subset of Machine Learning, specifically using neural networks with multiple layers to learn and represent complex patterns.\",\n \"Deep learning can be used for supervised, unsupervised, as well as reinforcement machine learning tasks. It provides a versatile approach to process and learn from data in various learning scenarios.\",\n \"A key difference between machine learning and deep learning is the amount of data required. Machine learning can work with a smaller amount of data, while deep learning requires a larger volume of data to train the complex neural network architectures effectively.\"\n ],\n \"num_unique_values\": 41,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n }\n ]\n}"}},"metadata":{},"execution_count":20}]},{"cell_type":"code","source":["# Read as pandas DataFrame\n","dataset['val'].to_pandas()"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":1000},"id":"03oFFWL4hM4S","executionInfo":{"status":"ok","timestamp":1708322802115,"user_tz":-480,"elapsed":16,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"284e74e0-5907-4982-cd82-31dfc59696e3"},"execution_count":null,"outputs":[{"output_type":"execute_result","data":{"text/plain":[" Instruction \\\n","0 Create an MCQ on the impact of learning rate o... \n","1 Create an MCQ on the impact of number of epoch... \n","2 Create an MCQ on the impact of architecture on... \n","3 Create an MCQ on the strategies for hyperparam... \n","4 Create an MCQ on the drawback of GridSearchCV \n","5 Create an MCQ on the strategy that selects val... \n","6 Create an MCQ on the advantage of RandomizedSe... \n","7 Create an MCQ on the strategy that treats hype... \n","8 Create an MCQ on the challenges in deep learning \n","9 Create an MCQ on the advantages of deep learning \n","10 Create an MCQ on the disadvantages of deep lea... \n","11 Create an MCQ on the challenges in interpretin... \n","12 Create an MCQ on the risk of overfitting in de... \n","13 Create an MCQ on the application of machine le... \n","14 Create an MCQ on the use of machine learning i... \n","15 Create an MCQ on the role of machine learning ... \n","16 Create an MCQ on the application of machine le... \n","17 Create an MCQ on the use of machine learning i... \n","18 Create an MCQ on the regularization parameter ... \n","19 Create an MCQ on the kernel function in SVMs \n","20 Create an MCQ on the parameter that controls t... \n","21 Create an MCQ on the learning rate hyperparame... \n","22 Create an MCQ on the max_depth hyperparameter ... \n","23 Create an MCQ on the structure of artificial n... \n","24 Create an MCQ on the training process of artif... \n","25 Create an MCQ on the differences between machi... \n","26 Create an MCQ on the complexity of interpretin... \n","27 Create an MCQ on the computing power requireme... \n","28 Create an MCQ on the definition of deep learning \n","29 Create an MCQ on the key characteristic of dee... \n","30 Create an MCQ on the success of deep learning ... \n","31 Create an MCQ on the requirements for training... \n","32 Create an MCQ on the types of machine learning... \n","33 Create an MCQ on the types of neural networks ... \n","34 Create an MCQ on the applications of deep lear... \n","35 Create an MCQ on the applications of deep lear... \n","36 Create an MCQ on the applications of deep lear... \n","37 Create an MCQ on the main purpose of deep lear... \n","38 Create an MCQ on the definition of Artificial ... \n","39 Create an MCQ on the definition of Machine Lea... \n","40 Create an MCQ on the definition of Deep Learning. \n","41 Create an MCQ on the relationship between Arti... \n","42 Create an MCQ on the aim of Deep Learning. \n","\n"," Question \\\n","0 What impact does the learning rate hyperparame... \n","1 How does the number of epochs hyperparameter a... \n","2 How does the architecture of a neural network ... \n","3 Which of the following strategies is considere... \n","4 What is a drawback of using GridSearchCV for h... \n","5 Which hyperparameter tuning strategy selects v... \n","6 What is an advantage of using RandomizedSearch... \n","7 Which hyperparameter tuning strategy treats th... \n","8 What is one of the challenges in deep learning? \n","9 What is one of the advantages of deep learning? \n","10 What is one of the disadvantages of deep learn... \n","11 What is one of the challenges in interpreting ... \n","12 What is the risk associated with overfitting i... \n","13 In which of the following applications are mac... \n","14 Which of the following applications utilize ma... \n","15 Which of the following applications involve th... \n","16 Which of the following applications utilize ma... \n","17 In which of the following applications are mac... \n","18 What is the role of the regularization paramet... \n","19 What is the purpose of the kernel function in ... \n","20 Which parameter controls the influence of supp... \n","21 What does the learning rate hyperparameter det... \n","22 What does the max_depth hyperparameter determi... \n","23 Which layer of an artificial neural network re... \n","24 What is adjusted during the training process o... \n","25 Which of the following requires a larger volum... \n","26 Which of the following is true regarding the i... \n","27 Which of the following requires a high-perform... \n","28 What is deep learning? \n","29 What is the key characteristic of deep learning? \n","30 In which fields has deep learning achieved sig... \n","31 What are the requirements for training deep ne... \n","32 Which types of machine learning are used in de... \n","33 Which type of neural network is specifically d... \n","34 What is one of the main applications of deep l... \n","35 What is one of the main applications of deep l... \n","36 What is one of the main applications of deep l... \n","37 What is the main purpose of deep learning mode... \n","38 What is the definition of Artificial Intellige... \n","39 What is the definition of Machine Learning? \n","40 What is the definition of Deep Learning? \n","41 What is the relationship between Artificial In... \n","42 What is the aim of Deep Learning? \n","\n"," A \\\n","0 It determines the number of epochs needed for ... \n","1 Increasing the number of epochs always improve... \n","2 The architecture determines the learning rate ... \n","3 GridSearchCV \n","4 It is computationally expensive \n","5 GridSearchCV \n","6 It is computationally faster \n","7 GridSearchCV \n","8 Limited computational resources \n","9 Low accuracy \n","10 Low computational requirements \n","11 Easy interpretability \n","12 Improved performance on new data \n","13 Self-driving cars \n","14 Virtual assistants like Siri and Alexa \n","15 Chatbots \n","16 E-commerce sites \n","17 Social media monitoring \n","18 To control the trade-off between the margin an... \n","19 To control the trade-off between the margin an... \n","20 Regularization parameter (C) \n","21 The step size taken by the optimizer during ea... \n","22 The step size taken by the optimizer during ea... \n","23 Output layer \n","24 Weights \n","25 Machine learning \n","26 Machine learning results are easy to interpret \n","27 Machine learning \n","28 A branch of machine learning that uses artific... \n","29 The use of deep neural networks with multiple ... \n","30 Image recognition, natural language processing... \n","31 A large amount of data and computational resou... \n","32 Supervised, unsupervised, and reinforcement le... \n","33 Feedforward Neural Networks (FNNs) \n","34 Speech recognition \n","35 Object detection and recognition \n","36 Sentiment analysis \n","37 To analyze the sentiment of text \n","38 The study of training machines to mimic human ... \n","39 The study of training machines to mimic human ... \n","40 The study of training machines to mimic human ... \n","41 AI is a subset of ML \n","42 To increase chances of success \n","\n"," B \\\n","0 It controls the step size taken by the optimiz... \n","1 Increasing the number of epochs can lead to ov... \n","2 The architecture controls the step size taken ... \n","3 RandomizedSearchCV \n","4 It requires expert knowledge \n","5 RandomizedSearchCV \n","6 It guarantees optimal performance \n","7 RandomizedSearchCV \n","8 Easy interpretability of results \n","9 Manual feature engineering \n","10 Small amount of labeled data \n","11 Clear decision-making process \n","12 No impact on model performance \n","13 Security systems \n","14 Call centers \n","15 Virtual assistants \n","16 Streaming services \n","17 Sentiment analysis systems \n","18 To define the similarity between data points \n","19 To define the similarity between data points \n","20 Kernel function \n","21 The number of boosting trees to be trained \n","22 The number of boosting trees to be trained \n","23 Hidden layer \n","24 Layers \n","25 Deep learning \n","26 Deep learning results are easy to interpret \n","27 Deep learning \n","28 A technique in machine learning that involves ... \n","29 The use of decision trees for modeling \n","30 Clustering, dimensionality reduction, and anom... \n","31 The availability of cloud computing and specia... \n","32 Supervised and unsupervised learning \n","33 Convolutional Neural Networks (CNNs) \n","34 Sentiment analysis \n","35 Image classification \n","36 Image segmentation \n","37 To translate text from one language to another \n","38 The study of improving machines with experienc... \n","39 The study of improving machines with experienc... \n","40 The study of improving machines with experienc... \n","41 ML is a subset of AI \n","42 To increase accuracy \n","\n"," C \\\n","0 It determines the depth of the neural network \n","1 The number of epochs does not have any impact ... \n","2 The architecture determines the depth and widt... \n","3 Bayesian Optimization \n","4 It may result in overfitting \n","5 Bayesian Optimization \n","6 It requires less expertise \n","7 Bayesian Optimization \n","8 Small amount of training data \n","9 Limited scalability \n","10 Easy interpretability \n","11 Limited complexity \n","12 Poor performance on new data \n","13 Medical imaging \n","14 Speech recognition systems \n","15 NLP systems \n","16 Recommendation systems \n","17 Spam filters \n","18 To determine the influence of support vectors ... \n","19 To determine the influence of support vectors ... \n","20 Gamma \n","21 The maximum depth of each tree in the ensemble \n","22 The maximum depth of each tree in the ensemble \n","23 Input layer \n","24 Neurons \n","25 Both require the same volume of dataset \n","26 Both machine learning and deep learning result... \n","27 Both require the same computing power \n","28 A type of unsupervised machine learning that c... \n","29 The use of reinforcement learning algorithms \n","30 Supervised machine learning tasks like image c... \n","31 Manual feature engineering \n","32 Unsupervised and reinforcement learning \n","33 Recurrent Neural Networks (RNNs) \n","34 Image classification \n","35 Language translation \n","36 Game playing \n","37 To identify and understand visual data \n","38 The study that uses neural networks to imitate... \n","39 The study that uses neural networks to imitate... \n","40 The study that uses neural networks to imitate... \n","41 AI and ML are independent of each other \n","42 To achieve high accuracy with a small amount o... \n","\n"," D Correct Answer \\\n","0 It controls the width of the neural network B \n","1 The number of epochs determines the learning r... B \n","2 The architecture affects the activation functi... C \n","3 None of the above A \n","4 It is not effective for high-dimensional hyper... A \n","5 None of the above B \n","6 It is more effective for high-dimensional hype... A \n","7 None of the above C \n","8 No risk of overfitting A \n","9 Continual improvement D \n","10 No risk of overfitting B \n","11 Black box nature D \n","12 Reduced computational requirements C \n","13 All of the above D \n","14 All of the above D \n","15 All of the above D \n","16 All of the above D \n","17 All of the above D \n","18 To determine the maximum depth of each tree in... A \n","19 To determine the maximum depth of each tree in... B \n","20 Learning rate C \n","21 The minimum sum of instance weight needed in a... A \n","22 The minimum sum of instance weight needed in a... C \n","23 Final layer C \n","24 Connections A \n","25 Neither requires a dataset B \n","26 Deep learning results are difficult to interpret D \n","27 Neither requires computing power B \n","28 A method in machine learning that uses decisio... A \n","29 The use of unsupervised learning techniques A \n","30 Reinforcement learning tasks like robotics and... A \n","31 Supervised labeled datasets A \n","32 Supervised and reinforcement learning A \n","33 Artificial Neural Networks (ANNs) B \n","34 Game playing C \n","35 Game playing C \n","36 Speech recognition C \n","37 To control complex systems C \n","38 The study that focuses on learning, reasoning,... A \n","39 The study that focuses on learning from data a... D \n","40 The study that focuses on using neural network... D \n","41 AI and ML are the same thing B \n","42 To achieve high accuracy with a large amount o... D \n","\n"," Explanation \n","0 The learning rate hyperparameter controls the ... \n","1 Increasing the number of epochs can improve th... \n","2 The architecture of a neural network determine... \n","3 GridSearchCV is considered a 'brute force' app... \n","4 GridSearchCV is computationally expensive as i... \n","5 RandomizedSearchCV selects values at random fo... \n","6 RandomizedSearchCV is computationally faster t... \n","7 Bayesian Optimization treats the search for op... \n","8 One of the challenges in deep learning is the ... \n","9 One of the advantages of deep learning is its ... \n","10 One of the disadvantages of deep learning is t... \n","11 One of the challenges in interpreting deep lea... \n","12 The risk associated with overfitting in deep l... \n","13 Machine learning algorithms are used in image ... \n","14 Machine learning algorithms are used in speech... \n","15 Machine learning algorithms are used in NLP sy... \n","16 Machine learning algorithms are used in recomm... \n","17 Machine learning algorithms are used in sentim... \n","18 The regularization parameter (C) in SVMs contr... \n","19 The kernel function in SVMs defines the simila... \n","20 The parameter Gamma controls the influence of ... \n","21 The learning rate hyperparameter in XGBoost de... \n","22 The max_depth hyperparameter in XGBoost determ... \n","23 The input layer of an artificial neural networ... \n","24 During the training process of an artificial n... \n","25 Deep learning requires a larger volume of data... \n","26 Deep learning results are more complex and dif... \n","27 Deep learning requires a high-performance comp... \n","28 Deep learning is a branch of machine learning ... \n","29 The key characteristic of deep learning is the... \n","30 Deep learning has achieved significant success... \n","31 Training deep neural networks typically requir... \n","32 Deep learning can be used for supervised, unsu... \n","33 Convolutional Neural Networks (CNNs) are speci... \n","34 One of the main applications of deep learning ... \n","35 One of the main applications of deep learning ... \n","36 One of the main applications of deep learning ... \n","37 The main purpose of deep learning models in co... \n","38 Artificial Intelligence is the mechanism to in... \n","39 Machine Learning is the study/process which pr... \n","40 Deep Learning is a sub-part of the broader fam... \n","41 Artificial Intelligence is the broader family ... \n","42 The aim of Deep Learning is to achieve high ac... "],"text/html":["\n","
\n","
\n","\n","\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
InstructionQuestionABCDCorrect AnswerExplanation
0Create an MCQ on the impact of learning rate o...What impact does the learning rate hyperparame...It determines the number of epochs needed for ...It controls the step size taken by the optimiz...It determines the depth of the neural networkIt controls the width of the neural networkBThe learning rate hyperparameter controls the ...
1Create an MCQ on the impact of number of epoch...How does the number of epochs hyperparameter a...Increasing the number of epochs always improve...Increasing the number of epochs can lead to ov...The number of epochs does not have any impact ...The number of epochs determines the learning r...BIncreasing the number of epochs can improve th...
2Create an MCQ on the impact of architecture on...How does the architecture of a neural network ...The architecture determines the learning rate ...The architecture controls the step size taken ...The architecture determines the depth and widt...The architecture affects the activation functi...CThe architecture of a neural network determine...
3Create an MCQ on the strategies for hyperparam...Which of the following strategies is considere...GridSearchCVRandomizedSearchCVBayesian OptimizationNone of the aboveAGridSearchCV is considered a 'brute force' app...
4Create an MCQ on the drawback of GridSearchCVWhat is a drawback of using GridSearchCV for h...It is computationally expensiveIt requires expert knowledgeIt may result in overfittingIt is not effective for high-dimensional hyper...AGridSearchCV is computationally expensive as i...
5Create an MCQ on the strategy that selects val...Which hyperparameter tuning strategy selects v...GridSearchCVRandomizedSearchCVBayesian OptimizationNone of the aboveBRandomizedSearchCV selects values at random fo...
6Create an MCQ on the advantage of RandomizedSe...What is an advantage of using RandomizedSearch...It is computationally fasterIt guarantees optimal performanceIt requires less expertiseIt is more effective for high-dimensional hype...ARandomizedSearchCV is computationally faster t...
7Create an MCQ on the strategy that treats hype...Which hyperparameter tuning strategy treats th...GridSearchCVRandomizedSearchCVBayesian OptimizationNone of the aboveCBayesian Optimization treats the search for op...
8Create an MCQ on the challenges in deep learningWhat is one of the challenges in deep learning?Limited computational resourcesEasy interpretability of resultsSmall amount of training dataNo risk of overfittingAOne of the challenges in deep learning is the ...
9Create an MCQ on the advantages of deep learningWhat is one of the advantages of deep learning?Low accuracyManual feature engineeringLimited scalabilityContinual improvementDOne of the advantages of deep learning is its ...
10Create an MCQ on the disadvantages of deep lea...What is one of the disadvantages of deep learn...Low computational requirementsSmall amount of labeled dataEasy interpretabilityNo risk of overfittingBOne of the disadvantages of deep learning is t...
11Create an MCQ on the challenges in interpretin...What is one of the challenges in interpreting ...Easy interpretabilityClear decision-making processLimited complexityBlack box natureDOne of the challenges in interpreting deep lea...
12Create an MCQ on the risk of overfitting in de...What is the risk associated with overfitting i...Improved performance on new dataNo impact on model performancePoor performance on new dataReduced computational requirementsCThe risk associated with overfitting in deep l...
13Create an MCQ on the application of machine le...In which of the following applications are mac...Self-driving carsSecurity systemsMedical imagingAll of the aboveDMachine learning algorithms are used in image ...
14Create an MCQ on the use of machine learning i...Which of the following applications utilize ma...Virtual assistants like Siri and AlexaCall centersSpeech recognition systemsAll of the aboveDMachine learning algorithms are used in speech...
15Create an MCQ on the role of machine learning ...Which of the following applications involve th...ChatbotsVirtual assistantsNLP systemsAll of the aboveDMachine learning algorithms are used in NLP sy...
16Create an MCQ on the application of machine le...Which of the following applications utilize ma...E-commerce sitesStreaming servicesRecommendation systemsAll of the aboveDMachine learning algorithms are used in recomm...
17Create an MCQ on the use of machine learning i...In which of the following applications are mac...Social media monitoringSentiment analysis systemsSpam filtersAll of the aboveDMachine learning algorithms are used in sentim...
18Create an MCQ on the regularization parameter ...What is the role of the regularization paramet...To control the trade-off between the margin an...To define the similarity between data pointsTo determine the influence of support vectors ...To determine the maximum depth of each tree in...AThe regularization parameter (C) in SVMs contr...
19Create an MCQ on the kernel function in SVMsWhat is the purpose of the kernel function in ...To control the trade-off between the margin an...To define the similarity between data pointsTo determine the influence of support vectors ...To determine the maximum depth of each tree in...BThe kernel function in SVMs defines the simila...
20Create an MCQ on the parameter that controls t...Which parameter controls the influence of supp...Regularization parameter (C)Kernel functionGammaLearning rateCThe parameter Gamma controls the influence of ...
21Create an MCQ on the learning rate hyperparame...What does the learning rate hyperparameter det...The step size taken by the optimizer during ea...The number of boosting trees to be trainedThe maximum depth of each tree in the ensembleThe minimum sum of instance weight needed in a...AThe learning rate hyperparameter in XGBoost de...
22Create an MCQ on the max_depth hyperparameter ...What does the max_depth hyperparameter determi...The step size taken by the optimizer during ea...The number of boosting trees to be trainedThe maximum depth of each tree in the ensembleThe minimum sum of instance weight needed in a...CThe max_depth hyperparameter in XGBoost determ...
23Create an MCQ on the structure of artificial n...Which layer of an artificial neural network re...Output layerHidden layerInput layerFinal layerCThe input layer of an artificial neural networ...
24Create an MCQ on the training process of artif...What is adjusted during the training process o...WeightsLayersNeuronsConnectionsADuring the training process of an artificial n...
25Create an MCQ on the differences between machi...Which of the following requires a larger volum...Machine learningDeep learningBoth require the same volume of datasetNeither requires a datasetBDeep learning requires a larger volume of data...
26Create an MCQ on the complexity of interpretin...Which of the following is true regarding the i...Machine learning results are easy to interpretDeep learning results are easy to interpretBoth machine learning and deep learning result...Deep learning results are difficult to interpretDDeep learning results are more complex and dif...
27Create an MCQ on the computing power requireme...Which of the following requires a high-perform...Machine learningDeep learningBoth require the same computing powerNeither requires computing powerBDeep learning requires a high-performance comp...
28Create an MCQ on the definition of deep learningWhat is deep learning?A branch of machine learning that uses artific...A technique in machine learning that involves ...A type of unsupervised machine learning that c...A method in machine learning that uses decisio...ADeep learning is a branch of machine learning ...
29Create an MCQ on the key characteristic of dee...What is the key characteristic of deep learning?The use of deep neural networks with multiple ...The use of decision trees for modelingThe use of reinforcement learning algorithmsThe use of unsupervised learning techniquesAThe key characteristic of deep learning is the...
30Create an MCQ on the success of deep learning ...In which fields has deep learning achieved sig...Image recognition, natural language processing...Clustering, dimensionality reduction, and anom...Supervised machine learning tasks like image c...Reinforcement learning tasks like robotics and...ADeep learning has achieved significant success...
31Create an MCQ on the requirements for training...What are the requirements for training deep ne...A large amount of data and computational resou...The availability of cloud computing and specia...Manual feature engineeringSupervised labeled datasetsATraining deep neural networks typically requir...
32Create an MCQ on the types of machine learning...Which types of machine learning are used in de...Supervised, unsupervised, and reinforcement le...Supervised and unsupervised learningUnsupervised and reinforcement learningSupervised and reinforcement learningADeep learning can be used for supervised, unsu...
33Create an MCQ on the types of neural networks ...Which type of neural network is specifically d...Feedforward Neural Networks (FNNs)Convolutional Neural Networks (CNNs)Recurrent Neural Networks (RNNs)Artificial Neural Networks (ANNs)BConvolutional Neural Networks (CNNs) are speci...
34Create an MCQ on the applications of deep lear...What is one of the main applications of deep l...Speech recognitionSentiment analysisImage classificationGame playingCOne of the main applications of deep learning ...
35Create an MCQ on the applications of deep lear...What is one of the main applications of deep l...Object detection and recognitionImage classificationLanguage translationGame playingCOne of the main applications of deep learning ...
36Create an MCQ on the applications of deep lear...What is one of the main applications of deep l...Sentiment analysisImage segmentationGame playingSpeech recognitionCOne of the main applications of deep learning ...
37Create an MCQ on the main purpose of deep lear...What is the main purpose of deep learning mode...To analyze the sentiment of textTo translate text from one language to anotherTo identify and understand visual dataTo control complex systemsCThe main purpose of deep learning models in co...
38Create an MCQ on the definition of Artificial ...What is the definition of Artificial Intellige...The study of training machines to mimic human ...The study of improving machines with experienc...The study that uses neural networks to imitate...The study that focuses on learning, reasoning,...AArtificial Intelligence is the mechanism to in...
39Create an MCQ on the definition of Machine Lea...What is the definition of Machine Learning?The study of training machines to mimic human ...The study of improving machines with experienc...The study that uses neural networks to imitate...The study that focuses on learning from data a...DMachine Learning is the study/process which pr...
40Create an MCQ on the definition of Deep Learning.What is the definition of Deep Learning?The study of training machines to mimic human ...The study of improving machines with experienc...The study that uses neural networks to imitate...The study that focuses on using neural network...DDeep Learning is a sub-part of the broader fam...
41Create an MCQ on the relationship between Arti...What is the relationship between Artificial In...AI is a subset of MLML is a subset of AIAI and ML are independent of each otherAI and ML are the same thingBArtificial Intelligence is the broader family ...
42Create an MCQ on the aim of Deep Learning.What is the aim of Deep Learning?To increase chances of successTo increase accuracyTo achieve high accuracy with a small amount o...To achieve high accuracy with a large amount o...DThe aim of Deep Learning is to achieve high ac...
\n","
\n","
\n","\n","
\n"," \n","\n"," \n","\n"," \n","
\n","\n","\n","
\n"," \n","\n","\n","\n"," \n","
\n","\n","
\n","
\n"],"application/vnd.google.colaboratory.intrinsic+json":{"type":"dataframe","summary":"{\n \"name\": \"dataset['val']\",\n \"rows\": 43,\n \"fields\": [\n {\n \"column\": \"Instruction\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Create an MCQ on the main purpose of deep learning models in computer vision.\",\n \"Create an MCQ on the training process of artificial neural networks\",\n \"Create an MCQ on the differences between machine learning and deep learning\"\n ],\n \"num_unique_values\": 43,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Question\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"What is the main purpose of deep learning models in computer vision?\",\n \"What is adjusted during the training process of an artificial neural network?\",\n \"Which of the following requires a larger volume of dataset compared to machine learning?\"\n ],\n \"num_unique_values\": 43,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"A\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"To increase chances of success\",\n \"Chatbots\",\n \"A large amount of data and computational resources\"\n ],\n \"num_unique_values\": 36,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"B\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"To increase accuracy\",\n \"Virtual assistants\",\n \"The availability of cloud computing and specialized hardware\"\n ],\n \"num_unique_values\": 36,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"C\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"Gamma\",\n \"NLP systems\",\n \"It may result in overfitting\"\n ],\n \"num_unique_values\": 37,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"D\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"AI and ML are the same thing\",\n \"Connections\",\n \"Speech recognition\"\n ],\n \"num_unique_values\": 33,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Correct Answer\",\n \"properties\": {\n \"dtype\": \"category\",\n \"samples\": [\n \"C\",\n \"D\",\n \"B\"\n ],\n \"num_unique_values\": 4,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Explanation\",\n \"properties\": {\n \"dtype\": \"string\",\n \"samples\": [\n \"The main purpose of deep learning models in computer vision is to identify and understand visual data. Deep learning models can be used to perform tasks such as object detection and recognition, image classification, and image segmentation.\",\n \"During the training process of an artificial neural network, the weights of the connections between neurons are adjusted to enhance the performance of the model.\",\n \"Deep learning requires a larger volume of dataset compared to machine learning.\"\n ],\n \"num_unique_values\": 43,\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n }\n ]\n}"}},"metadata":{},"execution_count":21}]},{"cell_type":"code","source":["from huggingface_hub import notebook_login\n","notebook_login()"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":145,"referenced_widgets":["ac3383a4853c484ca009cddb6e305853","095e9de8c9c24583baa8c7d074c5eacc","7a6eab206dcb4cd3b0cd4681a38f192f","e40bf3020d024dc0bc72a6f23b471494","8fc0cb635ac64733b957061f58ed16b6","9dcbf15879f94eddaf99a1603d1ca4c8","b0cf0967450c415f92e8540b30944ccf","e588245da98b412d849fbe2f94fb4b79","58409bb2b4c845f589fce0e2c2078a8c","17a1cd2fe40e4dde8b6b196e29abbec1","10bef54de0f64e86a3c9b1c039885ee9","40c244e062aa47a7846cace18c952cad","0d5efa7a7e7a4fe2ac2a789fe79ea94a","51e3dfdb61374128a65a208209c93060","283deaf2c9544e80a9433ae9148492d5","b9eb5c03173143aeb2793da889a2428b","8a3773ae30cf482a80edc5af98dc9cd8","0e34328104b64acb9a63e76013ccfc0d","e4fb41fce6b44fffbb1e0a6ab9d477ab","b5b3620259184090b78f008fc8a789db","bdec133061344a838b792668b98a5daa","f26867efbb384d1791ccc7e6ff2f7b2b","e35c557212ea4317ba624707f8d42dde","bfee25db776447eb9126cdbdbdfe6a8c","82764a4acf0e49e58547e9f2e52d5533","ab32080c5ec54c8abb90f0d85855adf5","5eacd5686b6d4486b772e957a80d894d","6c6af529b3dd449cb2826706c220d0cc","079da2bd7cfd4ed3b3324a2e54a8ae87","570cfe79345a43b5a9b34f05e72cc6c9","b21ff34c2fd844f2baee7dd45eda5700","8f7bf4ff652b4ae2b137b0518a4d6a00"]},"id":"U2YZULfoCura","executionInfo":{"status":"ok","timestamp":1708322802639,"user_tz":-480,"elapsed":9,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"31a4d102-5e31-4500-c9bc-34d0274857cf"},"execution_count":null,"outputs":[{"output_type":"display_data","data":{"text/plain":["VBox(children=(HTML(value='
"],"image/png":"\n"},"metadata":{}}]},{"cell_type":"markdown","source":["## 3. Filter out rows with more than 2048 tokens\n","\n","We will remove samples with more than 2048 tokens (max context size of Llama 2 by default = 4096)."],"metadata":{"id":"_RXe958fNLwH"}},{"cell_type":"code","source":["def filter_by_token_count(dataset_split, combined_token_counts, max_tokens=2048):\n"," # Filter out rows with more than 'max_tokens' tokens\n"," filtered_dataset = [example for example, count in zip(dataset_split, combined_token_counts) if count <= max_tokens]\n"," return filtered_dataset\n","\n","\n","# Assuming 'dataset' contains your data splits\n","fig, axs = plt.subplots(3, 5, figsize=(25, 15)) # Adjust figure size as necessary\n","\n","split_names = ['train', 'test', 'val']\n","for row, split_name in enumerate(split_names):\n"," # Tokenize and count\n"," instruction_counts, explanation_counts, question_counts, options_counts, combined_counts = tokenize_and_count(dataset[split_name])\n","\n"," # Filter dataset based on combined token count\n"," filtered_dataset = filter_by_token_count(dataset[split_name], combined_counts)\n","\n"," # Re-tokenize and count for the filtered dataset\n"," filtered_instruction_counts, filtered_explanation_counts, filtered_question_counts, filtered_options_counts, filtered_combined_counts = tokenize_and_count(filtered_dataset)\n","\n"," # Plotting the distributions for the filtered datasets, organizing by row based on the split\n"," plot_distribution(filtered_instruction_counts, f\"{split_name} (filtered): Instruction\", axs[row, 0])\n"," plot_distribution(filtered_explanation_counts, f\"{split_name} (filtered): Explanation\", axs[row, 1])\n"," plot_distribution(filtered_question_counts, f\"{split_name} (filtered): Question\", axs[row, 2])\n"," plot_distribution(filtered_options_counts, f\"{split_name} (filtered): Options\", axs[row, 3])\n"," plot_distribution(filtered_combined_counts, f\"{split_name} (filtered): Combined\", axs[row, 4])\n","\n","# Adjust layout to prevent overlap and ensure clarity\n","plt.tight_layout(pad=3.0)\n","plt.show()\n"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":898},"id":"WYzaKhSvz9Yk","executionInfo":{"status":"ok","timestamp":1708322857920,"user_tz":-480,"elapsed":8192,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"ee8da4e6-3afb-4534-885b-9780571cde25"},"execution_count":null,"outputs":[{"output_type":"display_data","data":{"text/plain":["
"],"image/png":"\n"},"metadata":{}}]},{"cell_type":"code","source":["# Function to filter dataset and plot new distribution\n","def filter_and_plot(dataset_split_name, dataset_split_data, combined_token_counts, axs, position):\n"," # Filter out rows with more than 2048 tokens\n"," valid_indices = [i for i, count in enumerate(combined_token_counts) if count <= 2048]\n"," print(f\"Number of valid rows in {dataset_split_name}: {len(valid_indices)}\")\n"," print(f\"Removing {len(dataset_split_data) - len(valid_indices)} rows from {dataset_split_name}...\")\n","\n"," # Extract valid rows based on indices\n"," valid_dataset = [dataset_split_data[i] for i in valid_indices]\n","\n"," # Re-calculate token counts for the valid dataset if necessary\n"," # This step is assumed necessary only if the token counts need to be recalculated for the filtered dataset\n"," # Otherwise, valid_token_counts = [combined_token_counts[i] for i in valid_indices] would suffice\n"," _, _, _, _, valid_combined_counts = tokenize_and_count(valid_dataset)\n","\n"," # Plot the new distribution for valid rows\n"," plot_distribution(valid_combined_counts, f\"New distribution after filtering {dataset_split_name}\", axs[position])\n","\n","# Create a figure with subplots\n","fig, axs = plt.subplots(3, 1, figsize=(6, 9)) # Adjust figsize as necessary\n","\n","# Assuming the 'dataset' variable is a dictionary containing data splits 'train', 'test', and 'val'\n","for i, split_name in enumerate(['train', 'test', 'val']):\n"," # Tokenize and count for the specific dataset split\n"," _, _, _, _, combined_counts = tokenize_and_count(dataset[split_name])\n","\n"," # Filter datasets based on token count and plot the new distribution\n"," filter_and_plot(split_name, dataset[split_name], combined_counts, axs, i)\n","\n","plt.tight_layout()\n","plt.show()\n"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":1000},"id":"LTw4ITHDl92J","executionInfo":{"status":"ok","timestamp":1708322864043,"user_tz":-480,"elapsed":2649,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"9f5de7d2-6e9a-4e2e-d3ee-73df3a6ad59f"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["Number of valid rows in train: 334\n","Removing 0 rows from train...\n","Number of valid rows in test: 41\n","Removing 0 rows from test...\n","Number of valid rows in val: 43\n","Removing 0 rows from val...\n"]},{"output_type":"display_data","data":{"text/plain":["
"],"image/png":"\n"},"metadata":{}}]},{"cell_type":"code","source":["# Initialize a flag to indicate whether any entries were removed in any split\n","entries_removed = False\n","\n","# Iterate over each split in the dataset\n","for split_name in ['train', 'test', 'val']:\n"," # Get the original length of the split\n"," original_length = len(dataset[split_name])\n"," # Tokenize and count tokens in the split\n"," _, _, _, _, combined_counts = tokenize_and_count(dataset[split_name])\n"," # Determine valid indices (entries with <= 2048 tokens)\n"," valid_indices = [i for i, count in enumerate(combined_counts) if count <= 2048]\n"," # Check if any entries were removed\n"," if len(valid_indices) < original_length:\n"," entries_removed = True\n"," # Update the dataset split with filtered entries\n"," dataset[split_name] = dataset[split_name].select(valid_indices)\n","\n","# Flag to control execution of subsequent code\n","continue_execution = True\n","\n","if not entries_removed:\n"," print(\"No entries removed due to token count. Skipping saving.\")\n"," continue_execution = False\n","\n","# Proceed with further steps only if entries were removed\n","if continue_execution:\n"," # Save the filtered dataset to disk\n"," dataset.save_to_disk('new_mcq_data')\n"," print(\"Filtered dataset saved successfully.\")"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"2N8lcoTS5jkm","executionInfo":{"status":"ok","timestamp":1708322873795,"user_tz":-480,"elapsed":922,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"d1028a6f-0388-47ea-8486-e715bcbc52fd"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["No entries removed due to token count. Skipping saving.\n"]}]},{"cell_type":"markdown","source":["---\n","\n","## 4. Near-deduplication Using Embeddings\n","\n","* Near-deduplication with embeddings is a technique that employs vector representations to effectively identify and manage nearly identical data entries.\n","\n","* By transforming data into these vectors (embeddings), we can quantitatively measure how similar different pieces of data are. This transformation significantly improves our ability to manage large datasets, where sorting through and removing near-duplicates manually would be impractical.\n","\n","* Widely used in fields like database management, information retrieval, and machine learning, this approach is crucial for efficient data handling and analysis.\n","\n","---\n","\n","### We Will Not Perform Deduplication on Our MCQ Dataset.\n","\n","* **Intentional Repetition for Emphasis**: In educational contexts, certain concepts may be intentionally repeated to underscore their significance. Deduplication could diminish the dataset's educational effectiveness by removing these purposeful repetitions.\n","\n","* **Variations of Similar Questions**: MCQ datasets often feature questions that, while seemingly similar, include minor variations in wording, options, or context. Inadequately designed deduplication algorithms risk eliminating these nuances, thereby losing valuable elements of the dataset.\n","\n","* **Difficulty in Defining \"Duplicates\"**: Identifying duplicates within MCQs poses a significant challenge, as questions that appear identical might differ in subtle yet crucial ways. These distinctions often represent unique learning opportunities that would be lost through deduplication.\n","\n","---\n"],"metadata":{"id":"kXzTKu99w3g4"}},{"cell_type":"markdown","source":["## 5. Top-k sampling\n","\n","Only keep the top k samples with the most tokens.\n","\n","---\n","\n","### Decision on \"Top-k Sampling\" for Our MCQ Dataset\n","\n","\n","We have decided against employing \"Top-k sampling\" to select only the top k samples with the most tokens in our MCQ dataset. This approach does not align with the core objectives of MCQ dataset development for several critical reasons:\n","\n","\n","**Practical Considerations**\n","\n","* **Conciseness and Effectiveness**: The hallmark of high-quality MCQs lies in their conciseness and meaningfulness. Favoring question length over substance could detract from the dataset's quality, as longer questions do not necessarily equate to higher educational value. Succinct yet profound questions are typically the most beneficial and stimulating for learners.\n","\n","\n","* Given these considerations, we conclude that \"Top-k sampling,\" which prioritizes token count, falls short of fulfilling the requirements of our MCQ dataset. The true merit of a valuable MCQ dataset resides in its diverse and balanced assortment of topics and difficulty levels, not merely in question length. This philosophy ensures our dataset remains versatile and effective across various educational and machine learning applications.\n","---"],"metadata":{"id":"TOCspcgXNOav"}},{"cell_type":"code","source":["# @title\n","# Push to Hugging Face Hub\n","dataset.push_to_hub(\"ssoh/mcq_dataset_2\")"],"metadata":{"id":"pj1b5S_68KB0","colab":{"base_uri":"https://localhost:8080/","height":244,"referenced_widgets":["d245dbe5cf8946e28f08287c15926a6d","45f5b19c94a544d79b3d65e13e67a0f1","ecb708463202466b96b244822fddd7c3","9cc0c38f4fb4452e8a14f1b208aaaefd","9ec8035efb334973bf54923d9b25b491","60fc22bbcb204b008cf416f44835ab84","686e9cf6109241b28c44aba78667aaa9","bf5c401f943e4040893e01cade21b632","5bb2a8fc3af2421c9b1f92039ea61ba2","5050f71288f2478c8d5bdeff47fcdca5","551a53a84dd8459b8cdd3e3e09ae687c","5af01ff36b2d4edebc842f0aea68bbdd","243e232f096047ca84cafe43fdd02abf","aee15bf2685349e684a7c1e0847fe830","324b3953949e4416b3614572969d1124","20471d4effb54abfabce62c072f90ebe","34914f01bdad45aca133fda48cb7b77d","015ca0b73d984087b070630305d794af","df0642f7d06d4a1eaf10ddf27460f3f1","9398e6c79bd947ae8e7090363bd669a4","814c3545d598458182c9d638cd7d78c0","46cbf4315b0e4a00b41157bc4d03f0fd","00efd4b16cd24e7c92d9dd2daa03b10f","9e4171a5c49a494a9ae72aa72a67862c","7569df6a04a14194866afbb4e78f6468","39ff79fe9a214782894e983ce4d45acd","91b5f1a2be93471eb880235771722cd9","57c0c6ab34974fc89e2036bd1074a098","bb364cb3c7134948b24ba1b3a0c2186d","4108281cd53941288d99b60e994770f1","1e11face32ee40f890cb97751209d6ac","45a866686d954d33ba1b3034397661ea","f5014c5a31d342e7952bcebf024250a2","8288c75935d74e85bd62e1682f8f04fe","1981153d1d2943b1936946a362048f3f","d5aec20dc1034e57980a8e9121fdd8e0","9795e18cccce455485accc5309d8d9de","f0bea944fba44d508575eb73b8684596","f9b825f130304fccadb8d660daa0b3ad","4d18d9a7763749faacffaf660cb6c100","4ba50000d3b14f74a288aa8249723fd6","152308af6482481aa88914fb1f37eb13","2e3600f228c446349e2fe7126fb95255","57849a6643f64b58802bb57ab8e18412","426bad77baaa4db49164e3cf7b3505b9","2f6008ded0274199ba675ec6f0a7469d","7c290b50d93a4f0cb209a5e444934596","bc438acf4f2047e6b7e49829c8b078ee","9862cd6a17af4bfd80a02f9b68b80629","c949f9dd143c4ccda644e3a601a46302","b603160aed09476c937847a2c1fa6236","e3e9b4527bd84cbe9f83479d090416c0","3db20951b33c4ce3b14acd7fe1a438ce","96497d48b0bd436b964826dcbaa92a76","2fb73ee317e7482e94e7c696c6a0a2de","48cd0eb61edc469482a93d6a5cf8dabd","1ee577704a9048e39d73ad278b587215","5cd9bf0a588947a8895e187842dfca0c","cfcd77f72de043b3ad28dbaa216909d3","621a509dcda84c90b6f1e76b39b9b97a","e7ea517d5821476fa388b2d67c098675","14348abed9184177a87d6a8c0f558097","dbc3be056d4e4e7d87beb01252eecb37","0acc8312636b4361b915e9bd39a02057","f0f3a78cb34f41caba416d266c270952","f118a0a1a7b444508e6c02aa0acc83d7"]},"executionInfo":{"status":"ok","timestamp":1708322881797,"user_tz":-480,"elapsed":3008,"user":{"displayName":"szehanz","userId":"16137883221268059572"}},"outputId":"266d2243-1d5a-41a4-b0e2-25a573b7f602"},"execution_count":null,"outputs":[{"output_type":"display_data","data":{"text/plain":["Uploading the dataset shards: 0%| | 0/1 [00:00 "]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_142852-r9yt9wy0"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run vermilion-springroll-11 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/r9yt9wy0"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"4e64b2267c9c4213a5914056f6811d1b","version_major":2,"version_minor":0},"text/plain":["Map: 0%| | 0/334 [00:00\n"," \n"," \n"," [66/66 10:49, Epoch 6/6]\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
103.0054002.068154
201.4362001.290664
300.7781001.072349
400.6132000.931616
500.4969000.921341
600.4680000.888872

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:04]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.006 MB uploaded\\r'), FloatProgress(value=1.0, max=1.0)))"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▃▂▁▁▁▁
eval/runtime▁▅▇▇▇█▇
eval/samples_per_second█▄▂▂▂▁▂
eval/steps_per_second█▄▂▂▁▁▂
eval_loss
train/epoch▁▁▂▂▄▄▅▅▆▆▇▇██
train/global_step▁▁▂▂▃▃▅▅▆▆▇▇███
train/learning_rate█▇▅▄▂▁
train/loss█▄▂▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.88887
eval/runtime5.8859
eval/samples_per_second7.306
eval/steps_per_second1.019
eval_loss0.88887
train/epoch6.0
train/global_step66
train/learning_rate2e-05
train/loss0.468
train/total_flos2090258212601856.0
train/train_loss1.06766
train/train_runtime658.5513
train/train_samples_per_second3.043
train/train_steps_per_second0.1

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run vermilion-springroll-11 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/r9yt9wy0
Synced 6 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_142852-r9yt9wy0/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 14:40:09,515] Trial 0 finished with value: 0.8888720870018005 and parameters: {'learning_rate': 0.00022063199006940203, 'num_train_epochs': 6, 'per_device_train_batch_size': 32, 'warmup_steps': 3}. Best is trial 0 with value: 0.8888720870018005.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"3de6384b3841485d8068f7265fbda30e","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113223888807826, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_144009-p10q3kv9"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run lunar-ox-12 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/p10q3kv9"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"0a6c66cadea549f78fe5a183e841734e","version_major":2,"version_minor":0},"text/plain":["Map: 0%| | 0/334 [00:00\n"," \n"," \n"," [66/66 10:55, Epoch 6/6]\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.8819001.890017
201.0036001.057150
300.6157000.908779
400.5004000.848109
500.4300000.856009
600.4028000.839791

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.023 MB uploaded\\r'), FloatProgress(value=0.2557933392427504, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▂▁▁▁▁▁
eval/runtime▁▄▆█▆▆▅
eval/samples_per_second█▅▃▁▃▃▃
eval/steps_per_second█▅▃▁▄▃▄
eval_loss
train/epoch▁▁▂▂▄▄▅▅▆▆▇▇██
train/global_step▁▁▂▂▃▃▅▅▆▆▇▇███
train/learning_rate█▇▅▄▂▁
train/loss█▃▂▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.83979
eval/runtime5.9095
eval/samples_per_second7.276
eval/steps_per_second1.015
eval_loss0.83979
train/epoch6.0
train/global_step66
train/learning_rate4e-05
train/loss0.4028
train/total_flos2090258212601856.0
train/train_loss0.91779
train/train_runtime664.2846
train/train_samples_per_second3.017
train/train_steps_per_second0.099

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run lunar-ox-12 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/p10q3kv9
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_144009-p10q3kv9/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 14:51:33,062] Trial 1 finished with value: 0.8397907614707947 and parameters: {'learning_rate': 0.000388078354781562, 'num_train_epochs': 6, 'per_device_train_batch_size': 32, 'warmup_steps': 5}. Best is trial 1 with value: 0.8397907614707947.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"9a11d1cc83a44c18a071c37ee29cb191","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113095899862755, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_145133-zcwhia3h"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run lunar-envelope-13 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/zcwhia3h"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [120/168 11:45 < 04:46, 0.17 it/s, Epoch 5/8]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.8054001.864337
201.0785001.049373
300.6376000.906046
400.5493000.883616
500.4913000.882419
600.4567000.849847
700.4312000.859990
800.4107000.856455
900.3987000.832274
1000.3759000.863837
1100.3876000.843072
1200.3749000.846136

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.17044687077892842, max=1.…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▂▂▁▁▁▁▁▁▁▁▁▁
eval/runtime▁▃▄▅▃▄▃▃▅█▅▅▄
eval/samples_per_second█▆▅▄▆▅▆▆▄▁▄▄▅
eval/steps_per_second█▆▅▄▅▅▆▆▄▁▄▄▄
eval_loss
train/epoch▁▁▂▂▂▂▃▃▄▄▄▄▅▅▅▅▆▆▇▇▇▇████
train/global_step▁▁▂▂▂▂▃▃▄▄▄▄▅▅▅▅▆▆▇▇▇▇█████
train/learning_rate█▇▇▆▅▅▄▄▃▂▂▁
train/loss█▃▂▂▁▁▁▁▁▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.83227
eval/runtime5.8969
eval/samples_per_second7.292
eval/steps_per_second1.017
eval_loss0.83227
train/epoch5.71
train/global_step120
train/learning_rate0.00011
train/loss0.3749
train/total_flos1821144316674048.0
train/train_loss0.69982
train/train_runtime710.1571
train/train_samples_per_second3.763
train/train_steps_per_second0.237

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run lunar-envelope-13 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/zcwhia3h
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_145133-zcwhia3h/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 15:03:40,491] Trial 2 finished with value: 0.8322736024856567 and parameters: {'learning_rate': 0.00038013816677024434, 'num_train_epochs': 8, 'per_device_train_batch_size': 16, 'warmup_steps': 4}. Best is trial 2 with value: 0.8322736024856567.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"192adca007164c1188758c608dfd03f7","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.01111326985539765, max=1.0)…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_150340-0rux0mbt"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run dazzling-orchid-14 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/0rux0mbt"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [84/84 08:09, Epoch 4/4]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
103.1171002.101251
201.5435001.223637
300.7658001.053006
400.6335000.956405
500.5513000.914150
600.4887000.883170
700.4532000.863325
800.4317000.861434

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.022 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.6507556781402156, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▃▂▂▁▁▁▁▁
eval/runtime▁▄▃▅█▆▅▇▅
eval/samples_per_second█▅▆▄▁▃▄▂▄
eval/steps_per_second█▅▆▅▁▃▅▂▄
eval_loss
train/epoch▁▁▂▂▃▃▄▄▅▅▆▆▇▇████
train/global_step▁▁▂▂▃▃▄▄▅▅▆▆▇▇█████
train/learning_rate█▇▆▅▄▃▂▁
train/loss█▄▂▂▁▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.86143
eval/runtime5.9003
eval/samples_per_second7.288
eval/steps_per_second1.017
eval_loss0.86143
train/epoch4.0
train/global_step84
train/learning_rate1e-05
train/loss0.4317
train/total_flos1274022822739968.0
train/train_loss0.96978
train/train_runtime494.7488
train/train_samples_per_second2.7
train/train_steps_per_second0.17

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run dazzling-orchid-14 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/0rux0mbt
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_150340-0rux0mbt/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 15:12:11,748] Trial 3 finished with value: 0.8614340424537659 and parameters: {'learning_rate': 0.00023956952379873406, 'num_train_epochs': 4, 'per_device_train_batch_size': 16, 'warmup_steps': 5}. Best is trial 2 with value: 0.8322736024856567.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"f7b8197a4e4c4242a7965774efc249e1","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113144411097488, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_151211-6nazgql4"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run prosperous-dragon-15 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/6nazgql4"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [66/66 10:56, Epoch 6/6]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.6801001.782772
200.8941001.025657
300.5874000.897337
400.4763000.830805
500.4204000.864126
600.3949000.841267

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.1705031517334534, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▂▁▁▁▁▁
eval/runtime▁▇▆▆█▇▅
eval/samples_per_second█▂▄▄▁▂▄
eval/steps_per_second█▁▃▃▁▁▃
eval_loss
train/epoch▁▁▂▂▄▄▅▅▆▆▇▇██
train/global_step▁▁▂▂▃▃▅▅▆▆▇▇███
train/learning_rate█▇▅▄▂▁
train/loss█▃▂▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.83081
eval/runtime5.9104
eval/samples_per_second7.275
eval/steps_per_second1.015
eval_loss0.83081
train/epoch6.0
train/global_step66
train/learning_rate4e-05
train/loss0.3949
train/total_flos2090258212601856.0
train/train_loss0.85955
train/train_runtime665.5241
train/train_samples_per_second3.011
train/train_steps_per_second0.099

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run prosperous-dragon-15 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/6nazgql4
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_151211-6nazgql4/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 15:23:34,430] Trial 4 finished with value: 0.8308054208755493 and parameters: {'learning_rate': 0.00041915607985727055, 'num_train_epochs': 6, 'per_device_train_batch_size': 32, 'warmup_steps': 3}. Best is trial 4 with value: 0.8308054208755493.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"d8dc9ac41d544e3fb4094e8c4b80cecc","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113028755709011, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_152334-gqh7mnqx"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run crimson-dragon-16 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/gqh7mnqx"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [44/44 07:13, Epoch 4/4]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.6064001.642761
200.8001001.017194
300.5857000.903642
400.4734000.855594

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.17057383277516674, max=1.…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▂▁▁▁
eval/runtime▁▅▅█▁
eval/samples_per_second█▄▅▁█
eval/steps_per_second█▅▅▁█
eval_loss
train/epoch▁▁▃▃▅▅▇▇██
train/global_step▁▁▃▃▅▅▇▇███
train/learning_rate█▆▃▁
train/loss█▂▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.85559
eval/runtime5.9033
eval/samples_per_second7.284
eval/steps_per_second1.016
eval_loss0.85559
train/epoch4.0
train/global_step44
train/learning_rate5e-05
train/loss0.4734
train/total_flos1395517145776128.0
train/train_loss1.0546
train/train_runtime442.5635
train/train_samples_per_second3.019
train/train_steps_per_second0.099

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run crimson-dragon-16 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/gqh7mnqx
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_152334-gqh7mnqx/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 15:31:13,768] Trial 5 finished with value: 0.855594277381897 and parameters: {'learning_rate': 0.0004882684074952214, 'num_train_epochs': 4, 'per_device_train_batch_size': 32, 'warmup_steps': 3}. Best is trial 4 with value: 0.8308054208755493.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"b17199cf7d1c41ee969d27814d9efe8b","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113176900026802, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_153113-emmid59b"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run beaming-paper-17 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/emmid59b"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [66/66 10:55, Epoch 6/6]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.8145001.876597
201.0222001.064405
300.6246000.902199
400.5075000.854691
500.4362000.856049
600.4078000.836567

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.17026547676022635, max=1.…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▃▁▁▁▁▁
eval/runtime▁▁▄▃█▅▃
eval/samples_per_second██▅▅▁▄▆
eval/steps_per_second██▄▅▁▄▅
eval_loss
train/epoch▁▁▂▂▄▄▅▅▆▆▇▇██
train/global_step▁▁▂▂▃▃▅▅▆▆▇▇███
train/learning_rate█▇▅▄▂▁
train/loss█▃▂▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.83657
eval/runtime5.9028
eval/samples_per_second7.285
eval/steps_per_second1.016
eval_loss0.83657
train/epoch6.0
train/global_step66
train/learning_rate4e-05
train/loss0.4078
train/total_flos2090258212601856.0
train/train_loss0.91464
train/train_runtime664.3977
train/train_samples_per_second3.016
train/train_steps_per_second0.099

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run beaming-paper-17 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/emmid59b
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_153113-emmid59b/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 15:42:35,259] Trial 6 finished with value: 0.8365665674209595 and parameters: {'learning_rate': 0.000371977101120841, 'num_train_epochs': 6, 'per_device_train_batch_size': 32, 'warmup_steps': 4}. Best is trial 4 with value: 0.8308054208755493.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"3718f807c93646f8ba07bbc2d3594547","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113231277947003, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_154235-x78afq0n"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run glittering-monkey-18 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/x78afq0n"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [44/44 07:14, Epoch 4/4]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
103.1951002.187073
201.5745001.620554
300.9015001.120180
400.6909001.036435

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.17055463319920083, max=1.…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▅▂▁▁
eval/runtime▁▆▄█▅
eval/samples_per_second█▃▅▁▄
eval/steps_per_second█▃▅▁▄
eval_loss
train/epoch▁▁▃▃▅▅▇▇██
train/global_step▁▁▃▃▅▅▇▇███
train/learning_rate█▆▃▁
train/loss█▃▂▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss1.03644
eval/runtime5.9099
eval/samples_per_second7.276
eval/steps_per_second1.015
eval_loss1.03644
train/epoch4.0
train/global_step44
train/learning_rate2e-05
train/loss0.6909
train/total_flos1395517145776128.0
train/train_loss1.50181
train/train_runtime442.6998
train/train_samples_per_second3.018
train/train_steps_per_second0.099

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run glittering-monkey-18 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/x78afq0n
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_154235-x78afq0n/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 15:50:14,768] Trial 7 finished with value: 1.0364350080490112 and parameters: {'learning_rate': 0.00021352963324526537, 'num_train_epochs': 4, 'per_device_train_batch_size': 32, 'warmup_steps': 5}. Best is trial 4 with value: 0.8308054208755493.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"d31e8ddbc07347adb1c8a3798ee36db8","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113219644499218, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_155014-8e8ip35f"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run beaming-fuse-19 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/8e8ip35f"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [120/126 11:45 < 00:35, 0.17 it/s, Epoch 5/6]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.7196001.833074
201.0752001.046847
300.6402000.922950
400.5504000.871228
500.4986000.876081
600.4539000.857398
700.4283000.860341
800.4096000.856458
900.3941000.830379
1000.3715000.847909
1100.3818000.842489
1200.3595000.851102

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:04]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.012 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.35110723430597374, max=1.…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▃▂▁▁▁▁▁▁▁▁▁▁
eval/runtime▁▂▂▃▄▆█▄█▇▅█▁
eval/samples_per_second█▇▇▆▅▃▁▅▁▂▄▁█
eval/steps_per_second▇▇▇▆▅▃▁▅▁▁▅▁█
eval_loss
train/epoch▁▁▂▂▂▂▃▃▄▄▄▄▅▅▅▅▆▆▇▇▇▇████
train/global_step▁▁▂▂▂▂▃▃▄▄▄▄▅▅▅▅▆▆▇▇▇▇█████
train/learning_rate█▇▇▆▅▅▄▄▃▂▂▁
train/loss█▃▂▂▁▁▁▁▁▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.83038
eval/runtime5.8734
eval/samples_per_second7.321
eval/steps_per_second1.022
eval_loss0.83038
train/epoch5.71
train/global_step120
train/learning_rate2e-05
train/loss0.3595
train/total_flos1821144316674048.0
train/train_loss0.69023
train/train_runtime710.9352
train/train_samples_per_second2.819
train/train_steps_per_second0.177

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run beaming-fuse-19 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/8e8ip35f
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_155014-8e8ip35f/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 16:02:22,467] Trial 8 finished with value: 0.8303791284561157 and parameters: {'learning_rate': 0.0003782307395143863, 'num_train_epochs': 6, 'per_device_train_batch_size': 16, 'warmup_steps': 3}. Best is trial 8 with value: 0.8303791284561157.\n"]},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"d8cdb5224da6481691ab8959404e3d24","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='Waiting for wandb.init()...\\r'), FloatProgress(value=0.011113188377607407, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_160222-bob3m06p"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run dazzling-ox-20 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/Education-Chatbot-Optimization"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/bob3m06p"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [140/168 13:43 < 02:47, 0.17 it/s, Epoch 6/8]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.7859001.793209
201.0033001.026370
300.6111000.900247
400.5459000.883518
500.4864000.883806
600.4442000.881959
700.4328000.877395
800.4108000.867789
900.4000000.852276
1000.3731000.867085
1100.3862000.850435
1200.3738000.855305
1300.3920000.853369
1400.3604000.869668

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

\n"," \n"," \n"," [6/6 00:05]\n","
\n"," "],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.1705731731337404, max=1.0…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▂▁▁▁▁▁▁▁▁▁▁▁▁▁
eval/runtime▁▄▅▆▆▆▆▇▇█▆▆▆▇▆
eval/samples_per_second█▅▄▃▃▃▃▂▁▁▃▃▃▂▃
eval/steps_per_second█▅▄▂▃▄▃▂▂▁▂▃▄▂▃
eval_loss
train/epoch▁▁▂▂▂▂▃▃▃▃▄▄▄▄▅▅▅▅▆▆▆▆▇▇▇▇████
train/global_step▁▁▂▂▂▂▃▃▃▃▄▄▄▄▅▅▅▅▆▆▆▆▇▇▇▇█████
train/learning_rate█▇▇▆▆▅▅▄▄▃▃▂▂▁
train/loss█▃▂▂▁▁▁▁▁▁▁▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.85044
eval/runtime5.8867
eval/samples_per_second7.305
eval/steps_per_second1.019
eval_loss0.85044
train/epoch6.67
train/global_step140
train/learning_rate8e-05
train/loss0.3604
train/total_flos2129326975303680.0
train/train_loss0.64328
train/train_runtime828.8076
train/train_samples_per_second3.224
train/train_steps_per_second0.203

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run dazzling-ox-20 at: https://wandb.ai/szehanz/Education-Chatbot-Optimization/runs/bob3m06p
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_160222-bob3m06p/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"name":"stderr","output_type":"stream","text":["[I 2024-02-19 16:16:28,206] Trial 9 finished with value: 0.8504351377487183 and parameters: {'learning_rate': 0.000457264058410859, 'num_train_epochs': 8, 'per_device_train_batch_size': 16, 'warmup_steps': 5}. Best is trial 8 with value: 0.8303791284561157.\n"]}],"source":["def objective(trial):\n","\n"," # Define hyperparameters outside the wandb.init to use them later in the code\n"," learning_rate = trial.suggest_float('learning_rate', 2e-4, 5e-4, log=True)\n"," num_train_epochs = trial.suggest_categorical('num_train_epochs', [4, 6, 8])\n"," per_device_train_batch_size = trial.suggest_categorical('per_device_train_batch_size', [16, 32])\n"," warmup_steps = trial.suggest_int('warmup_steps', 3, 5)\n","\n"," wandb.init(\n"," project=\"Education-Chatbot-Optimization\",\n"," entity=\"szehanz\",\n"," group=\"optuna-optimization\",\n"," job_type=\"hyperparameter_search\",\n"," reinit=True,\n"," config={\n"," \"learning_rate\": learning_rate,\n"," \"num_train_epochs\": num_train_epochs,\n"," \"per_device_train_batch_size\": per_device_train_batch_size,\n"," \"warmup_steps\": warmup_steps\n"," }\n"," )\n","\n"," # Format the current date and time\n"," current_time = datetime.now().strftime(\"%Y%m%d-%H%M%S\")\n"," output_dir = f\"train_out_dir_{current_time}\" # Append the current date and time to the directory name\n","\n"," # Create the output directory\n"," os.makedirs(output_dir, exist_ok=True) # Using exist_ok=True to avoid error if the directory already exists\n","\n","\n"," # Define TrainingArguments with the suggested hyperparameters\n"," training_args = TrainingArguments(\n"," output_dir=output_dir, # Directory for saving output models and checkpoints.\n"," save_strategy=\"steps\", # Save model checkpoints at regular step intervals.\n"," save_steps=10, # Save model checkpoints every 10 steps.\n"," learning_rate=learning_rate, # Initial learning rate for the optimizer.\n"," per_device_train_batch_size=per_device_train_batch_size, # Batch size per device during training.\n"," per_device_eval_batch_size=8, # Batch size per device during evaluation.\n"," num_train_epochs=num_train_epochs, # Total number of training epochs.\n"," warmup_steps=warmup_steps, # Number of warmup steps for the learning rate scheduler.\n"," evaluation_strategy='steps', # Perform evaluation at regular step intervals.\n"," eval_steps=10, # Perform evaluation every 10 steps.\n"," logging_steps=10,\n"," optim='paged_adamw_8bit', # Specifies the optimizer to use.\n"," lr_scheduler_type='linear', # Type of learning rate scheduler.\n"," gradient_accumulation_steps=1, # Number of steps to accumulate gradients before performing an update.\n"," load_best_model_at_end=True, # Load the best model based on evaluation metric at the end of training.\n"," report_to='wandb', # Disable automatic integrations with external reporting tools.\n"," )\n","\n","\n"," # Initialize the Trainer with early stopping callback inside the objective function\n"," trainer = SFTTrainer(\n"," model=model, # Ensure a function or a mechanism to initialize your model\n"," train_dataset=train_dataset,\n"," eval_dataset=val_dataset,\n"," peft_config=peft_config,\n"," dataset_text_field=\"Instruction\",\n"," tokenizer=tokenizer,\n"," args=training_args,\n"," max_seq_length=4096,\n"," callbacks=[EarlyStoppingCallback(early_stopping_patience=3)],\n"," )\n","\n"," # Train the model and evaluate within the objective function\n"," trainer.train()\n"," eval_result = trainer.evaluate()\n","\n"," # Log the primary metric to WandB\n"," wandb.log({\"eval_loss\": eval_result[\"eval_loss\"]})\n","\n"," # Finish the WandB run for this trial\n"," wandb.finish()\n","\n"," # Return the metric to be optimized\n"," return eval_result[\"eval_loss\"]\n","\n","\n","# Run the optimization\n","study = optuna.create_study(direction='minimize')\n","study.optimize(objective, n_trials=10)"]},{"cell_type":"code","execution_count":null,"metadata":{"id":"fmdlQTVSHT8e","outputId":"a2935a56-5cad-4dbc-c55c-53b3b5ad1368"},"outputs":[{"name":"stdout","output_type":"stream","text":["Best trial:\n"," Value: 0.8303791284561157\n"," Params: \n"," learning_rate: 0.0003782307395143863\n"," num_train_epochs: 6\n"," per_device_train_batch_size: 16\n"," warmup_steps: 3\n"]}],"source":["# Best trial results\n","print(\"Best trial:\")\n","print(f\" Value: {study.best_trial.value}\")\n","print(\" Params: \")\n","for key, value in study.best_trial.params.items():\n"," print(f\" {key}: {value}\")"]},{"cell_type":"code","execution_count":null,"metadata":{"id":"mKlA_ahVHT8e","outputId":"6365a674-b011-48bb-94ea-7aa9d657d323","colab":{"referenced_widgets":[""]}},"outputs":[{"data":{"text/html":["Tracking run with wandb version 0.16.3"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Run data is saved locally in /home/iot/ITI110/poc-playground/Final project/wandb/run-20240219_161628-5gyifk7s"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Syncing run floating-fish-2 to Weights & Biases (docs)
"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View project at https://wandb.ai/szehanz/huggingface"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run at https://wandb.ai/szehanz/huggingface/runs/5gyifk7s"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","
\n"," \n"," \n"," [126/126 12:16, Epoch 6/6]\n","
\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
StepTraining LossValidation Loss
102.7158001.857712
201.0773001.051454
300.6475000.913019
400.5472000.881412
500.4899000.886365
600.4574000.855178
700.4284000.860198
800.4072000.863780
900.3950000.834071
1000.3723000.848378
1100.3795000.848452
1200.3588000.857301

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"","version_major":2,"version_minor":0},"text/plain":["VBox(children=(Label(value='0.006 MB of 0.034 MB uploaded\\r'), FloatProgress(value=0.17130191715842674, max=1.…"]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["\n","

Run history:


eval/loss█▂▂▁▁▁▁▁▁▁▁▁
eval/runtime▁▄▄▆▆▆▅█▇▇▆█
eval/samples_per_second█▅▅▃▃▃▄▁▁▂▃▁
eval/steps_per_second█▅▅▃▃▃▄▂▂▂▃▁
train/epoch▁▁▂▂▂▂▃▃▃▃▄▄▅▅▅▅▆▆▆▆▇▇███
train/global_step▁▁▂▂▂▂▃▃▃▃▄▄▅▅▅▅▆▆▆▆▇▇███
train/learning_rate█▇▇▆▅▅▄▄▃▂▂▁
train/loss█▃▂▂▁▁▁▁▁▁▁▁
train/total_flos
train/train_loss
train/train_runtime
train/train_samples_per_second
train/train_steps_per_second

Run summary:


eval/loss0.8573
eval/runtime5.9051
eval/samples_per_second7.282
eval/steps_per_second1.016
train/epoch6.0
train/global_step126
train/learning_rate2e-05
train/loss0.3588
train/total_flos1913972332118016.0
train/train_loss0.67577
train/train_runtime746.485
train/train_samples_per_second2.685
train/train_steps_per_second0.169

"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":[" View run floating-fish-2 at: https://wandb.ai/szehanz/huggingface/runs/5gyifk7s
Synced 5 W&B file(s), 0 media file(s), 0 artifact file(s) and 0 other file(s)"],"text/plain":[""]},"metadata":{},"output_type":"display_data"},{"data":{"text/html":["Find logs at: ./wandb/run-20240219_161628-5gyifk7s/logs"],"text/plain":[""]},"metadata":{},"output_type":"display_data"}],"source":["# Use best hyperparameters from the study\n","best_trial = study.best_trial\n","\n","best_learning_rate = best_trial.params['learning_rate']\n","best_num_train_epochs = best_trial.params['num_train_epochs']\n","best_per_device_train_batch_size = best_trial.params['per_device_train_batch_size']\n","best_warmup_steps = best_trial.params['warmup_steps']\n","\n","\n","# Define TrainingArguments with the best hyperparameters for retraining\n","best_training_args = TrainingArguments(\n"," output_dir=\"best_train_out_dir\",\n"," save_strategy=\"steps\",\n"," save_steps=10,\n"," learning_rate=best_learning_rate,\n"," per_device_train_batch_size=best_per_device_train_batch_size,\n"," per_device_eval_batch_size=8,\n"," num_train_epochs=best_num_train_epochs,\n"," warmup_steps=best_warmup_steps,\n"," evaluation_strategy='steps',\n"," eval_steps=10,\n"," logging_steps=10,\n"," optim='paged_adamw_8bit',\n"," lr_scheduler_type='linear',\n"," gradient_accumulation_steps=1,\n"," load_best_model_at_end=True,\n"," report_to='wandb',\n",")\n","\n","# Reinitialize the Trainer with the best hyperparameters\n","best_trainer = SFTTrainer(\n"," model=model,\n"," train_dataset=train_dataset,\n"," eval_dataset=val_dataset,\n"," peft_config=peft_config,\n"," dataset_text_field=\"Instruction\",\n"," tokenizer=tokenizer,\n"," args=best_training_args,\n"," max_seq_length=4096,\n",")\n","\n","# Retrain the model with the best hyperparameters\n","best_trainer.train()\n","\n","\n","# Save trained model\n","best_trainer.model.save_pretrained(new_model)\n","\n","# Finish the WandB run for this trial\n","wandb.finish()"]},{"cell_type":"markdown","metadata":{"id":"_g0fB7P9s0ol"},"source":["Merging the base model with the trained adapter."]},{"cell_type":"code","execution_count":null,"metadata":{"colab":{"referenced_widgets":["aafec7a64d034e05b1aaf17bb153136b","1191c9b140394f1aa3952c1cecda8fed","68107c402ec343ffa40e22171e9fe3e9"]},"id":"QQn30cRtAZ-P","outputId":"6508be7b-0a96-494e-bd33-d35c5c331f52"},"outputs":[{"data":{"application/vnd.jupyter.widget-view+json":{"model_id":"68107c402ec343ffa40e22171e9fe3e9","version_major":2,"version_minor":0},"text/plain":["Loading checkpoint shards: 0%| | 0/2 [00:00
Copy a token from your Hugging Face\ntokens page and paste it below.
Immediately click login after copying\nyour token or it might be stored in plain text in this notebook file.
"}},"14d0ff6e07a148c0b65224af86af6b95":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"187745df1f154e7da09a0c646a22cebb":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_638517f3c94445c7b610b84744859ef8","placeholder":"​","style":"IPY_MODEL_5206f9fc25234586b735fc9c65e6b19a","value":"generation_config.json: 100%"}},"19b8b7692cfd46eca5abb50ac2854262":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"19d3adedf1a245f69a2498b1838f6415":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HBoxView","box_style":"","children":["IPY_MODEL_31ffba7793b944cd80bec0fb6ac2fbbe","IPY_MODEL_1ae848be349a41239a7ba6efadc82edf","IPY_MODEL_9a33d92ad0604eff9ac56944586facbb"],"layout":"IPY_MODEL_293d425206684064b0572e990a815cee"}},"1ae848be349a41239a7ba6efadc82edf":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_8ae8a9d03adc476a8dd64f09e194907d","max":2,"min":0,"orientation":"horizontal","style":"IPY_MODEL_aeb94dbf3f8a4b43bb07c833af3b98c9","value":2}},"1c6e5dd4ca3446ec8cf8a03b85ad57dc":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HBoxView","box_style":"","children":["IPY_MODEL_0786fc506a7640a287016991f57d3768","IPY_MODEL_3af05a188f40469dbd52fda55c7a9e22","IPY_MODEL_0af30e36f2524c1084c1237948f3b18e"],"layout":"IPY_MODEL_dfe1c2648a564676bfe1e09bda61d439"}},"1ea12323b1244c768a17a6ed5420f854":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"1f9e8ddb08814b2db7761478eb9069f5":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_7982d1d315964d138f31643db445b48a","max":614,"min":0,"orientation":"horizontal","style":"IPY_MODEL_67346bc716384552a69d1afdc844db00","value":614}},"20cc750b1bde4efdb3c3a5b5321222b4":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_a199f20edde447829b17804b17af2fe5","max":188,"min":0,"orientation":"horizontal","style":"IPY_MODEL_447f608108364159ab7ed546ad02559a","value":188}},"21556be54ed34b15b909bf8e7b8fd93a":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_b8113970ea7245e9890221d4e4cf5e8e","placeholder":"​","style":"IPY_MODEL_d3224d16458249a3bfd29253c2d6a86f","value":"\nPro Tip: If you don't already have one, you can create a dedicated\n'notebooks' token with 'write' access, that you can then easily reuse for all\nnotebooks. "}},"2254fe28315f43a99c3579195c0c0008":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"22e8bae9ff6744bca3e90d46d220106f":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"23176fa0130348c5bc1b0deb591709f3":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"236b3417f0e94039a1a449e7ab96738b":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"25543b118c8e423aa7fd8c898d5e1256":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"2924e96aa10346efb39684e5369e2170":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"LabelModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"LabelModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"LabelView","description":"","description_tooltip":null,"layout":"IPY_MODEL_bce8d1501218410ba8b042aeb3f0fc26","placeholder":"​","style":"IPY_MODEL_405603de026d484ab283f053f4b17c6d","value":"Your token has been saved to /root/.cache/huggingface/token"}},"293d425206684064b0572e990a815cee":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"29f303aa6ac8464aa91124c3fe659379":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"2cb9cfbde1e0483c97a2c531e0034adf":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"2dce1978d19e4de3a6a1b1cef6ed518f":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"3076e4abb7fe427fa4fccb43e9f3371e":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"30d47a9da70a4cf5a7e597f13d28d526":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"31ffba7793b944cd80bec0fb6ac2fbbe":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_7ee17b5879ec4909b7b616b79d5eab36","placeholder":"​","style":"IPY_MODEL_d8b847adb46d4049a4b43bba2a0e06da","value":"Downloading shards: 100%"}},"32a57f9df45b41da91e1d638f531505d":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"3af05a188f40469dbd52fda55c7a9e22":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_82829039182948efb800703e3e21d96d","max":26788,"min":0,"orientation":"horizontal","style":"IPY_MODEL_32a57f9df45b41da91e1d638f531505d","value":26788}},"3e4a06b9b13444e3b82e0c3c26e17b8f":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"CheckboxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"CheckboxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"CheckboxView","description":"Add token as git credential?","description_tooltip":null,"disabled":false,"indent":true,"layout":"IPY_MODEL_f91ebc43c1344e8688e2eeb2771c7b65","style":"IPY_MODEL_ea00aa1eb73949fc94083f1d31372915","value":true}},"405603de026d484ab283f053f4b17c6d":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"42262413163d49fd8902c275c13da1e6":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_cf370d3015b04b6caa9cbfb8f21dd5e7","placeholder":"​","style":"IPY_MODEL_1ea12323b1244c768a17a6ed5420f854","value":" 9.98G/9.98G [01:29<00:00, 126MB/s]"}},"441c37014e6b4c8995e7a5305f96fa38":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"447f608108364159ab7ed546ad02559a":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"46e41557d0bc4cb4a03c6894fcc57d0e":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"4cee3e23bcbd4643ba542d94b82dbfd2":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"5206f9fc25234586b735fc9c65e6b19a":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"52f2b6adb9904cbeb014387429c0d1ad":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HBoxView","box_style":"","children":["IPY_MODEL_b61ac64b2a0c4beb8ea209782b9dd09b","IPY_MODEL_9e9c2d712ae84d3aa87fbe433ae0701b","IPY_MODEL_fe4d028a78e64a4d8b47f8bc8b8ec15d"],"layout":"IPY_MODEL_88a7aeaf3b2547ea91cf20ed2f1a3dc0"}},"537d173a4313491f8f89f799f1dc7b2a":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"541ef20ab6f34337a2d6d20098f6fef5":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"LabelModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"LabelModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"LabelView","description":"","description_tooltip":null,"layout":"IPY_MODEL_b0f0ac261e364edd99d7b75e747e2c47","placeholder":"​","style":"IPY_MODEL_ff9f726db3434e3184e723d5da884d0a","value":"Token is valid (permission: write)."}},"55e506bd03d24dd7b24129dc64da9c01":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"5e5b95c9801443cdbce9c8e629c33589":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"PasswordModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"PasswordModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"PasswordView","continuous_update":true,"description":"Token:","description_tooltip":null,"disabled":false,"layout":"IPY_MODEL_29f303aa6ac8464aa91124c3fe659379","placeholder":"​","style":"IPY_MODEL_3076e4abb7fe427fa4fccb43e9f3371e","value":""}},"62f1cf19fe204aa4a424248e807ce061":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"638517f3c94445c7b610b84744859ef8":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"6430379d01874ec3a7cf9fea59c42914":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ButtonModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ButtonModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ButtonView","button_style":"","description":"Login","disabled":false,"icon":"","layout":"IPY_MODEL_2dce1978d19e4de3a6a1b1cef6ed518f","style":"IPY_MODEL_e81c501824f94e7d839684fafbc65b31","tooltip":""}},"67346bc716384552a69d1afdc844db00":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"6edf40f558f54d8b82d949f83557d609":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"LabelModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"LabelModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"LabelView","description":"","description_tooltip":null,"layout":"IPY_MODEL_08400a144d3c497a94ae4d84e72a1067","placeholder":"​","style":"IPY_MODEL_976a3440d2c3423c8be835b0d6f56492","value":"Connecting..."}},"77c0d63f6cd6494daf7e94cfe0397e45":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_4cee3e23bcbd4643ba542d94b82dbfd2","placeholder":"​","style":"IPY_MODEL_25543b118c8e423aa7fd8c898d5e1256","value":" 188/188 [00:00<00:00, 12.3kB/s]"}},"77daad9def9f4254933afdeb7202e5d2":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"7982d1d315964d138f31643db445b48a":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"7c701d53772344f6a057cd43742d47f5":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_b9e4e59acd4d468393f42857d5635e25","placeholder":"​","style":"IPY_MODEL_30d47a9da70a4cf5a7e597f13d28d526","value":"Loading checkpoint shards: 100%"}},"7ee17b5879ec4909b7b616b79d5eab36":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"8080eb1edf8f44f69b5f218046715f36":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_e2df35aaf7e0420b9524d533cb122521","max":2,"min":0,"orientation":"horizontal","style":"IPY_MODEL_d90c3a424bac4bb09e99339c28df2eda","value":2}},"82829039182948efb800703e3e21d96d":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"83127281047a429a846b9bb3b3209760":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"86014f569ee34895badef4acfe958051":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"86616a0d2f9d4da1a75e00d092ca9d4d":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HBoxView","box_style":"","children":["IPY_MODEL_a7a17a5883784ed9961437d0102fd9a4","IPY_MODEL_9934540648a444a09f0783843ab2d901","IPY_MODEL_42262413163d49fd8902c275c13da1e6"],"layout":"IPY_MODEL_83127281047a429a846b9bb3b3209760"}},"88a7aeaf3b2547ea91cf20ed2f1a3dc0":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"8ae8a9d03adc476a8dd64f09e194907d":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"963973c1e970410cb2ba97e9fb4ef511":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"976a3440d2c3423c8be835b0d6f56492":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"9934540648a444a09f0783843ab2d901":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_f702ae2d46834e9b934a40abca49a990","max":9976576152,"min":0,"orientation":"horizontal","style":"IPY_MODEL_14d0ff6e07a148c0b65224af86af6b95","value":9976576152}},"9a33d92ad0604eff9ac56944586facbb":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_537d173a4313491f8f89f799f1dc7b2a","placeholder":"​","style":"IPY_MODEL_55e506bd03d24dd7b24129dc64da9c01","value":" 2/2 [01:56<00:00, 52.47s/it]"}},"9c7550ec29c34ef2b674a3667ebd740e":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_236b3417f0e94039a1a449e7ab96738b","placeholder":"​","style":"IPY_MODEL_77daad9def9f4254933afdeb7202e5d2","value":"config.json: 100%"}},"9e4d26b87e1243f3ae744ddbbe2c5b51":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"9e9c2d712ae84d3aa87fbe433ae0701b":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"FloatProgressModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"FloatProgressModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"ProgressView","bar_style":"success","description":"","description_tooltip":null,"layout":"IPY_MODEL_86014f569ee34895badef4acfe958051","max":3500296424,"min":0,"orientation":"horizontal","style":"IPY_MODEL_22e8bae9ff6744bca3e90d46d220106f","value":3500296424}},"a066a28914bc46cc85ebcb7d90d27fbe":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"a199f20edde447829b17804b17af2fe5":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"a39b2a49ff814181aa0df84c57e9245b":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"a7a17a5883784ed9961437d0102fd9a4":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_ae173a50f4374c7fa55d8e360d30d4d1","placeholder":"​","style":"IPY_MODEL_b93a21bcaf6344c49a0ab8e2a887922c","value":"model-00001-of-00002.safetensors: 100%"}},"a7ef8ff133144d4b9817800e5b4739a4":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"a9628200c0a34cccacdb0ea66e200b87":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_0cda6495ef4b4c9ab3eeff2392d89163","placeholder":"​","style":"IPY_MODEL_9e4d26b87e1243f3ae744ddbbe2c5b51","value":" 2/2 [00:58<00:00, 26.87s/it]"}},"a96de5c1f80f43d8b0b689d5db0fa248":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"ae173a50f4374c7fa55d8e360d30d4d1":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"aeb94dbf3f8a4b43bb07c833af3b98c9":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"b0f0ac261e364edd99d7b75e747e2c47":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"b4ed4d5e266a4cd5b9011436b4e92951":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HBoxView","box_style":"","children":["IPY_MODEL_7c701d53772344f6a057cd43742d47f5","IPY_MODEL_8080eb1edf8f44f69b5f218046715f36","IPY_MODEL_a9628200c0a34cccacdb0ea66e200b87"],"layout":"IPY_MODEL_2254fe28315f43a99c3579195c0c0008"}},"b577aeffe358446ea41e4b3c58b80641":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_441c37014e6b4c8995e7a5305f96fa38","placeholder":"​","style":"IPY_MODEL_a39b2a49ff814181aa0df84c57e9245b","value":" 614/614 [00:00<00:00, 15.7kB/s]"}},"b61ac64b2a0c4beb8ea209782b9dd09b":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_a066a28914bc46cc85ebcb7d90d27fbe","placeholder":"​","style":"IPY_MODEL_a96de5c1f80f43d8b0b689d5db0fa248","value":"model-00002-of-00002.safetensors: 100%"}},"b634ae14caf84bc99dcafd3a70b7a383":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"b64f26ac024c46eabfc4728586369130":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"LabelModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"LabelModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"LabelView","description":"","description_tooltip":null,"layout":"IPY_MODEL_a7ef8ff133144d4b9817800e5b4739a4","placeholder":"​","style":"IPY_MODEL_ba4cf32b2f71428282721e7818b34a5a","value":"Login successful"}},"b8113970ea7245e9890221d4e4cf5e8e":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"b93a21bcaf6344c49a0ab8e2a887922c":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"b9e4e59acd4d468393f42857d5635e25":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"ba4cf32b2f71428282721e7818b34a5a":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"bce8d1501218410ba8b042aeb3f0fc26":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"c3d08bc595a74c3180a7a83afc569584":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"VBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"VBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"VBoxView","box_style":"","children":["IPY_MODEL_541ef20ab6f34337a2d6d20098f6fef5","IPY_MODEL_fe46e1cf697f4b1fab764104be32da95","IPY_MODEL_2924e96aa10346efb39684e5369e2170","IPY_MODEL_b64f26ac024c46eabfc4728586369130"],"layout":"IPY_MODEL_cad60b6f14f249c187d573dd3a4428e0"}},"ca8d07b9b454471e883d43e03b475a9f":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"cad60b6f14f249c187d573dd3a4428e0":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":"center","align_self":null,"border":null,"bottom":null,"display":"flex","flex":null,"flex_flow":"column","grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":"50%"}},"cf370d3015b04b6caa9cbfb8f21dd5e7":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"d3224d16458249a3bfd29253c2d6a86f":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"d8b847adb46d4049a4b43bba2a0e06da":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"d90c3a424bac4bb09e99339c28df2eda":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ProgressStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ProgressStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","bar_color":null,"description_width":""}},"de756c426cf0492bb122a45b94d4bbe7":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"df0d8231953b44f1bf8e6d2cd39b2f66":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HBoxModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HBoxView","box_style":"","children":["IPY_MODEL_9c7550ec29c34ef2b674a3667ebd740e","IPY_MODEL_1f9e8ddb08814b2db7761478eb9069f5","IPY_MODEL_b577aeffe358446ea41e4b3c58b80641"],"layout":"IPY_MODEL_19b8b7692cfd46eca5abb50ac2854262"}},"dfe1c2648a564676bfe1e09bda61d439":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"e2df35aaf7e0420b9524d533cb122521":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"e81c501824f94e7d839684fafbc65b31":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"ButtonStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"ButtonStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","button_color":null,"font_weight":""}},"ea00aa1eb73949fc94083f1d31372915":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}},"f702ae2d46834e9b934a40abca49a990":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"f81ada25e7ff4f5da6b3f6c6e73590e4":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"f91ebc43c1344e8688e2eeb2771c7b65":{"model_module":"@jupyter-widgets/base","model_module_version":"1.2.0","model_name":"LayoutModel","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"fe46e1cf697f4b1fab764104be32da95":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"LabelModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"LabelModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"LabelView","description":"","description_tooltip":null,"layout":"IPY_MODEL_f81ada25e7ff4f5da6b3f6c6e73590e4","placeholder":"​","style":"IPY_MODEL_62f1cf19fe204aa4a424248e807ce061","value":"Your token has been saved in your configured git credential helpers (store)."}},"fe4d028a78e64a4d8b47f8bc8b8ec15d":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"HTMLModel","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"HTMLModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"HTMLView","description":"","description_tooltip":null,"layout":"IPY_MODEL_b634ae14caf84bc99dcafd3a70b7a383","placeholder":"​","style":"IPY_MODEL_46e41557d0bc4cb4a03c6894fcc57d0e","value":" 3.50G/3.50G [00:26<00:00, 176MB/s]"}},"ff9f726db3434e3184e723d5da884d0a":{"model_module":"@jupyter-widgets/controls","model_module_version":"1.5.0","model_name":"DescriptionStyleModel","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"DescriptionStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":""}}}}},"nbformat":4,"nbformat_minor":0} \ No newline at end of file