Asankhaya Sharma's picture

Asankhaya Sharma PRO

codelion

·

http://asankhaya.github.io/

AI & ML interests

Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and PTS. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.

Recent Activity

reacted to their post with ❤️ 1 day ago

Over 40 percent of AI-generated code contains security vulnerabilities. We recently worked on a LoRA to write secure code by default using automated Semgrep analysis and GRPO, achieving 97 percent reduction in vulnerabilities without requiring security-specific prompts. Technical Approach: Automated security training pipeline combining Semgrep vulnerability detection with preference learning. Generate multiple solutions with varying security awareness, automatically analyze for vulnerabilities, create preference pairs based on security scores, train using GRPO with multi-factor scoring. Scoring System (100 points total): - Functionality: 40 points - Does the code work correctly - Security patterns: 40 points - Uses secure coding practices - Low vulnerabilities: 20 points - Semgrep score below threshold This balanced scoring prevents reward hacking where models generate empty functions to avoid vulnerabilities. Real Transformation Examples: Database query before: query = f"SELECT * FROM products WHERE name = '{name}'" Database query after: query = "SELECT * FROM products WHERE name = ?" db.execute(query, (name,)) Password hashing before: password_hash = hashlib.md5(password).hexdigest() Password hashing after: salt = bcrypt.gensalt(rounds=12) password_hash = bcrypt.hashpw(password.encode('utf-8'), salt) Model: https://huggingface.co/codelion/Qwen2.5-Coder-0.5B-Instruct-security-grpo-lora Notebook: https://github.com/codelion/ellora/blob/main/Ellora_Recipe_5_Secure_Code_Generation_LoRA.ipynb Repository: https://github.com/codelion/ellora

reacted to their post with 👀 1 day ago

Over 40 percent of AI-generated code contains security vulnerabilities. We recently worked on a LoRA to write secure code by default using automated Semgrep analysis and GRPO, achieving 97 percent reduction in vulnerabilities without requiring security-specific prompts. Technical Approach: Automated security training pipeline combining Semgrep vulnerability detection with preference learning. Generate multiple solutions with varying security awareness, automatically analyze for vulnerabilities, create preference pairs based on security scores, train using GRPO with multi-factor scoring. Scoring System (100 points total): - Functionality: 40 points - Does the code work correctly - Security patterns: 40 points - Uses secure coding practices - Low vulnerabilities: 20 points - Semgrep score below threshold This balanced scoring prevents reward hacking where models generate empty functions to avoid vulnerabilities. Real Transformation Examples: Database query before: query = f"SELECT * FROM products WHERE name = '{name}'" Database query after: query = "SELECT * FROM products WHERE name = ?" db.execute(query, (name,)) Password hashing before: password_hash = hashlib.md5(password).hexdigest() Password hashing after: salt = bcrypt.gensalt(rounds=12) password_hash = bcrypt.hashpw(password.encode('utf-8'), salt) Model: https://huggingface.co/codelion/Qwen2.5-Coder-0.5B-Instruct-security-grpo-lora Notebook: https://github.com/codelion/ellora/blob/main/Ellora_Recipe_5_Secure_Code_Generation_LoRA.ipynb Repository: https://github.com/codelion/ellora

reacted to their post with 🚀 1 day ago

Over 40 percent of AI-generated code contains security vulnerabilities. We recently worked on a LoRA to write secure code by default using automated Semgrep analysis and GRPO, achieving 97 percent reduction in vulnerabilities without requiring security-specific prompts. Technical Approach: Automated security training pipeline combining Semgrep vulnerability detection with preference learning. Generate multiple solutions with varying security awareness, automatically analyze for vulnerabilities, create preference pairs based on security scores, train using GRPO with multi-factor scoring. Scoring System (100 points total): - Functionality: 40 points - Does the code work correctly - Security patterns: 40 points - Uses secure coding practices - Low vulnerabilities: 20 points - Semgrep score below threshold This balanced scoring prevents reward hacking where models generate empty functions to avoid vulnerabilities. Real Transformation Examples: Database query before: query = f"SELECT * FROM products WHERE name = '{name}'" Database query after: query = "SELECT * FROM products WHERE name = ?" db.execute(query, (name,)) Password hashing before: password_hash = hashlib.md5(password).hexdigest() Password hashing after: salt = bcrypt.gensalt(rounds=12) password_hash = bcrypt.hashpw(password.encode('utf-8'), salt) Model: https://huggingface.co/codelion/Qwen2.5-Coder-0.5B-Instruct-security-grpo-lora Notebook: https://github.com/codelion/ellora/blob/main/Ellora_Recipe_5_Secure_Code_Generation_LoRA.ipynb Repository: https://github.com/codelion/ellora

View all activity

Organizations

codelion 's datasets 28

codelion/gemma-3-270m-icm-dpo

Viewer • Updated 2 days ago • 1.11k • 71 • 1

codelion/Qwen2.5-Coder-0.5B-Instruct-security-preference

Viewer • Updated Aug 2 • 245 • 120

codelion/Qwen2.5-Coder-0.5B-Instruct-progressive-2M-context

Viewer • Updated Jul 20 • 400 • 23

codelion/Llama-3.2-1B-Instruct-magpie-tool-calling

Viewer • Updated Jul 18 • 1.2k • 41 • 1

codelion/Qwen3-0.6B-icm-dpo-pairs

Viewer • Updated Jul 18 • 122 • 26

codelion/Qwen3-0.6B-icm

Viewer • Updated Jul 18 • 500 • 19 • 1

codelion/gemma-3-1b-it-magpie-reasoning

Viewer • Updated Jul 18 • 131 • 85 • 1

codelion/Qwen3-0.6B-magpie

Viewer • Updated Jul 12 • 735 • 39 • 1

codelion/Qwen3-0.6B-pts-thought-anchors

Viewer • Updated Jul 10 • 148 • 37 • 2

codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts-thought-anchors

Viewer • Updated Jul 10 • 110 • 26 • 1

codelion/fineweb-edu-1B

Viewer • Updated Jul 7 • 970k • 132

codelion/dclm-baseline-1B

Viewer • Updated Jul 7 • 774k • 29

codelion/fineweb-edu-100M

Viewer • Updated Jul 6 • 115k • 151

codelion/dclm-baseline-100M

Viewer • Updated Jul 6 • 77.2k • 22

codelion/fineweb-edu-10M

Viewer • Updated Jul 6 • 9.46k • 28

codelion/dclm-baseline-10M

Viewer • Updated Jul 6 • 7.95k • 20

codelion/Qwen3-0.6B-pts-dpo-pairs

Viewer • Updated May 19 • 681 • 36 • 2

codelion/Qwen3-0.6B-pts-steering-vectors

Viewer • Updated May 19 • 1.38k • 20 • 4

codelion/Qwen3-0.6B-pts

Viewer • Updated May 19 • 1.38k • 27 • 2

codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts-steering-vectors

Preview • Updated May 13 • 6 • 1

codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts

Preview • Updated May 13 • 17 • 1

codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts-dpo-pairs

Preview • Updated May 13 • 6 • 1

codelion/math500-cot-experiment

Viewer • Updated Apr 30 • 1.5k • 27 • 5

codelion/optillmbench

Viewer • Updated Apr 15 • 500 • 87 • 5

codelion/distilled-QwQ-32B-fineweb-edu

Preview • Updated Apr 13 • 1 • 1

codelion/optillm-router-dataset

Viewer • Updated Apr 12 • 2.81k • 26 • 6

codelion/Sky-T1_data_17k

Viewer • Updated Jan 11 • 16.4k • 6 • 1

codelion/worker-safety-qa-eval

Viewer • Updated Jun 20, 2024 • 34 • 71 • 3