bigcodebench-leaderboard

Running

App Files Files Community

Terry Zhuo commited on Jun 11, 2024

Commit

e122c3e

1 Parent(s): 3a9e36a

fix typos

Browse files

Files changed (2) hide show

README.md +8 -0
app.py +1 -12

README.md CHANGED Viewed

@@ -4,7 +4,15 @@ emoji: 🥇
 colorFrom: green
 colorTo: indigo
 sdk: gradio
 app_file: app.py
 pinned: false
 license: apache-2.0
 ---

 colorFrom: green
 colorTo: indigo
 sdk: gradio
+sdk_version: 4.36.1
+app_file: app.py
+disable_embedding: true
 app_file: app.py
 pinned: false
 license: apache-2.0
+tags:
+  - leaderboard
+  - eval:code
+  - test:public
+  - judge:auto
 ---

app.py CHANGED Viewed

@@ -124,17 +124,13 @@ def search_table(df, leaderboard_table, query):
 df = make_clickable_names(df)
-#            <div style='background-color: #F5F1CB; text-align: center; padding: 10px;'>
-#                <p><b>Warning</b>: This leaderboard is not regularily updated with the latest instruction-tuned code models, check the <b>Submit Results</b> section for submitting new evaluation results.
-#            You can also check other code leaderboards like <a href="https://evalplus.github.io/leaderboard.html">EvalPlus</a> & <a href="https://huggingface.co/spaces/mike-ravkine/can-ai-code-results">Can-AI-Code</a> .</p>
-#            </div>
 demo = gr.Blocks(css=custom_css)
 with demo:
     with gr.Row():
         gr.Markdown(
             """<div style="text-align: center;"><h1> 🌸<span style='color: #A74E95;'>Big</span><span style='color: #C867B5;'>Code</span><span style='color: #DD71C8;'>Bench</span> Leaderboard🌸</h1></div>\
             <br>\
-            <p>Inspired from the <a href="https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard">🤗 Open LLM Leaderboard</a> and <a href="https://huggingface.co/spaces/bigcode/bigcode-models-leaderboard">🤗 Big Code Models Leaderboard 🏋️</a>, we compare performance of LLMs on <a href="https://huggingface.co/datasets/bigcode/bigcodebench">BigCodeBench</a> benchmark.</p>
 """,
             elem_classes="markdown-text",
         )
@@ -223,13 +219,6 @@ with demo:
                         [hidden_leaderboard_df, shown_columns],
                         leaderboard_df,
                     )
-                    # <li>
-                    # <i>Complete</i> vs <i>Instruct</i>:
-                    #     <br />
-                    #     <i><strong><u>Complete</u></strong></i>: Code Completion based on the (verbose) structured docstring. This variant tests if the models are good at coding.
-                    #     <br />
-                    #     <i><strong><u>Instruct</u></i> (🔥Vibe Check🔥)</strong>: Code Generation based on the (less verbose) NL-oriented instructions. This variant tests if the models are really capable enough to understand human intents to code.
-                    # </li>
                     gr.Markdown(
                         """
                     **Notes:**

 df = make_clickable_names(df)
 demo = gr.Blocks(css=custom_css)
 with demo:
     with gr.Row():
         gr.Markdown(
             """<div style="text-align: center;"><h1> 🌸<span style='color: #A74E95;'>Big</span><span style='color: #C867B5;'>Code</span><span style='color: #DD71C8;'>Bench</span> Leaderboard🌸</h1></div>\
             <br>\
+            <p>Inspired from the <a href="https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard">🤗 Open LLM Leaderboard</a> and <a href="https://huggingface.co/spaces/bigcode/bigcode-models-leaderboard">⭐ Big Code Models Leaderboard</a>, we compare performance of LLMs on <a href="https://huggingface.co/datasets/bigcode/bigcodebench">BigCodeBench</a> benchmark.</p>
 """,
             elem_classes="markdown-text",
         )
                         [hidden_leaderboard_df, shown_columns],
                         leaderboard_df,
                     )
                     gr.Markdown(
                         """
                     **Notes:**