Update data/gaia_validation_20.jsonl
Browse files
data/gaia_validation_20.jsonl
CHANGED
|
@@ -7,6 +7,7 @@
|
|
| 7 |
{"task_id": "cf106601-ab4f-4af9-b045-5295fe67b37d", "Question": "What country had the least number of athletes at the 1928 Summer Olympics? If there's a tie for a number of athletes, return the first in alphabetical order. Give the IOC country code as your answer.", "Level": 1, "file_name": "", "Final answer": "CUB"}
|
| 8 |
{"task_id": "a0c07678-e491-4bbc-8f0b-07405144218f", "Question": "Who are the pitchers with the number before and after Taish\u014d Tamai's number as of July 2023? Give them to me in the form Pitcher Before, Pitcher After, use their last names only, in Roman characters.", "Level": 1, "file_name": "", "Final answer": "Yoshida, Uehara"}
|
| 9 |
{"task_id": "5a0c1adf-205e-4841-a666-7c3ef95def9d", "Question": "What is the first name of the only Malko Competition recipient from the 20th Century (after 1977) whose nationality on record is a country that no longer exists?", "Level": 1, "file_name": "", "Final answer": "Claus"}
|
|
|
|
| 10 |
{"task_id": "a1e91b78-d3d8-4675-bb8d-62741b4b68a6", "Question": "In the video https://www.youtube.com/watch?v=L1vXCYZAYYM, what is the highest number of bird species to be on camera simultaneously?", "Level": 1, "file_name": "", "Final answer": "3"}
|
| 11 |
{"task_id": "cca530fc-4052-43b2-b130-b30968d8aa44", "Question": "Review the chess position provided in the image. It is black's turn. Provide the correct next move for black which guarantees a win. Please provide your response in algebraic notation.", "Level": 1, "file_name": "cca530fc-4052-43b2-b130-b30968d8aa44.png", "Final answer": "Rd5"}
|
| 12 |
{"task_id": "6f37996b-2ac7-44b0-8e68-6d28256631b4", "Question": "Given this table defining * on the set S = {a, b, c, d, e}\n\n|*|a|b|c|d|e|\n|---|---|---|---|---|---|\n|a|a|b|c|b|d|\n|b|b|c|a|e|c|\n|c|c|a|b|b|a|\n|d|b|e|b|e|d|\n|e|d|b|a|d|c|\n\nprovide the subset of S involved in any possible counter-examples that prove * is not commutative. Provide your answer as a comma separated list of the elements in the set in alphabetical order.", "Level": 1, "file_name": "", "Final answer": "b, e"}
|
|
@@ -16,5 +17,4 @@
|
|
| 16 |
{"task_id": "f918266a-b3e0-4914-865d-4faa564f1aef", "Question": "What is the final numeric output from the attached Python code?", "Level": 1, "file_name": "f918266a-b3e0-4914-865d-4faa564f1aef.py", "Final answer": "0"}
|
| 17 |
{"task_id": "1f975693-876d-457b-a649-393859e79bf3", "Question": "Hi, I was out sick from my classes on Friday, so I'm trying to figure out what I need to study for my Calculus mid-term next week. My friend from class sent me an audio recording of Professor Willowbrook giving out the recommended reading for the test, but my headphones are broken :(\n\nCould you please listen to the recording for me and tell me the page numbers I'm supposed to go over? I've attached a file called Homework.mp3 that has the recording. Please provide just the page numbers as a comma-delimited list. And please provide the list in ascending order.", "Level": 1, "file_name": "1f975693-876d-457b-a649-393859e79bf3.mp3", "Final answer": "132, 133, 134, 197, 245"}
|
| 18 |
{"task_id": "840bfca7-4f7b-481a-8794-c560c340185d", "Question": "On June 6, 2023, an article by Carolyn Collins Petersen was published in Universe Today. This article mentions a team that produced a paper about their observations, linked at the bottom of the article. Find this paper. Under what NASA award number was the work performed by R. G. Arendt supported by?", "Level": 1, "file_name": "", "Final answer": "80GSFC21M0002"}
|
| 19 |
-
{"task_id": "bda648d7-d618-4883-88f4-3466eabd860e", "Question": "Where were the Vietnamese specimens described by Kuznetzov in Nedoshivina's 2010 paper eventually deposited? Just give me the city name without abbreviations.", "Level": 1, "file_name": "", "Final answer": "Saint Petersburg"}
|
| 20 |
{"task_id": "7bd855d8-463d-4ed5-93ca-5fe35145f733", "Question": "The attached Excel file contains the sales of menu items for a local fast-food chain. What were the total sales that the chain made from food (not including drinks)? Express your answer in USD with two decimal places.", "Level": 1, "file_name": "7bd855d8-463d-4ed5-93ca-5fe35145f733.xlsx", "Final answer": "89706.00"}
|
|
|
|
| 7 |
{"task_id": "cf106601-ab4f-4af9-b045-5295fe67b37d", "Question": "What country had the least number of athletes at the 1928 Summer Olympics? If there's a tie for a number of athletes, return the first in alphabetical order. Give the IOC country code as your answer.", "Level": 1, "file_name": "", "Final answer": "CUB"}
|
| 8 |
{"task_id": "a0c07678-e491-4bbc-8f0b-07405144218f", "Question": "Who are the pitchers with the number before and after Taish\u014d Tamai's number as of July 2023? Give them to me in the form Pitcher Before, Pitcher After, use their last names only, in Roman characters.", "Level": 1, "file_name": "", "Final answer": "Yoshida, Uehara"}
|
| 9 |
{"task_id": "5a0c1adf-205e-4841-a666-7c3ef95def9d", "Question": "What is the first name of the only Malko Competition recipient from the 20th Century (after 1977) whose nationality on record is a country that no longer exists?", "Level": 1, "file_name": "", "Final answer": "Claus"}
|
| 10 |
+
{"task_id": "bda648d7-d618-4883-88f4-3466eabd860e", "Question": "Where were the Vietnamese specimens described by Kuznetzov in Nedoshivina's 2010 paper eventually deposited? Just give me the city name without abbreviations.", "Level": 1, "file_name": "", "Final answer": "Saint Petersburg"}
|
| 11 |
{"task_id": "a1e91b78-d3d8-4675-bb8d-62741b4b68a6", "Question": "In the video https://www.youtube.com/watch?v=L1vXCYZAYYM, what is the highest number of bird species to be on camera simultaneously?", "Level": 1, "file_name": "", "Final answer": "3"}
|
| 12 |
{"task_id": "cca530fc-4052-43b2-b130-b30968d8aa44", "Question": "Review the chess position provided in the image. It is black's turn. Provide the correct next move for black which guarantees a win. Please provide your response in algebraic notation.", "Level": 1, "file_name": "cca530fc-4052-43b2-b130-b30968d8aa44.png", "Final answer": "Rd5"}
|
| 13 |
{"task_id": "6f37996b-2ac7-44b0-8e68-6d28256631b4", "Question": "Given this table defining * on the set S = {a, b, c, d, e}\n\n|*|a|b|c|d|e|\n|---|---|---|---|---|---|\n|a|a|b|c|b|d|\n|b|b|c|a|e|c|\n|c|c|a|b|b|a|\n|d|b|e|b|e|d|\n|e|d|b|a|d|c|\n\nprovide the subset of S involved in any possible counter-examples that prove * is not commutative. Provide your answer as a comma separated list of the elements in the set in alphabetical order.", "Level": 1, "file_name": "", "Final answer": "b, e"}
|
|
|
|
| 17 |
{"task_id": "f918266a-b3e0-4914-865d-4faa564f1aef", "Question": "What is the final numeric output from the attached Python code?", "Level": 1, "file_name": "f918266a-b3e0-4914-865d-4faa564f1aef.py", "Final answer": "0"}
|
| 18 |
{"task_id": "1f975693-876d-457b-a649-393859e79bf3", "Question": "Hi, I was out sick from my classes on Friday, so I'm trying to figure out what I need to study for my Calculus mid-term next week. My friend from class sent me an audio recording of Professor Willowbrook giving out the recommended reading for the test, but my headphones are broken :(\n\nCould you please listen to the recording for me and tell me the page numbers I'm supposed to go over? I've attached a file called Homework.mp3 that has the recording. Please provide just the page numbers as a comma-delimited list. And please provide the list in ascending order.", "Level": 1, "file_name": "1f975693-876d-457b-a649-393859e79bf3.mp3", "Final answer": "132, 133, 134, 197, 245"}
|
| 19 |
{"task_id": "840bfca7-4f7b-481a-8794-c560c340185d", "Question": "On June 6, 2023, an article by Carolyn Collins Petersen was published in Universe Today. This article mentions a team that produced a paper about their observations, linked at the bottom of the article. Find this paper. Under what NASA award number was the work performed by R. G. Arendt supported by?", "Level": 1, "file_name": "", "Final answer": "80GSFC21M0002"}
|
|
|
|
| 20 |
{"task_id": "7bd855d8-463d-4ed5-93ca-5fe35145f733", "Question": "The attached Excel file contains the sales of menu items for a local fast-food chain. What were the total sales that the chain made from food (not including drinks)? Express your answer in USD with two decimal places.", "Level": 1, "file_name": "7bd855d8-463d-4ed5-93ca-5fe35145f733.xlsx", "Final answer": "89706.00"}
|