Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
98.0
TFLOPS
69
111
224
Asankhaya Sharma
PRO
codelion
Follow
rocco987's profile picture
grady6329's profile picture
tripp6547's profile picture
323 followers
·
16 following
http://asankhaya.github.io/
asankhaya
codelion
asankhaya
AI & ML interests
Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and PTS. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.
Recent Activity
reacted
to
their
post
with ❤️
1 day ago
Over 40 percent of AI-generated code contains security vulnerabilities. We recently worked on a LoRA to write secure code by default using automated Semgrep analysis and GRPO, achieving 97 percent reduction in vulnerabilities without requiring security-specific prompts. Technical Approach: Automated security training pipeline combining Semgrep vulnerability detection with preference learning. Generate multiple solutions with varying security awareness, automatically analyze for vulnerabilities, create preference pairs based on security scores, train using GRPO with multi-factor scoring. Scoring System (100 points total): - Functionality: 40 points - Does the code work correctly - Security patterns: 40 points - Uses secure coding practices - Low vulnerabilities: 20 points - Semgrep score below threshold This balanced scoring prevents reward hacking where models generate empty functions to avoid vulnerabilities. Real Transformation Examples: Database query before: query = f"SELECT * FROM products WHERE name = '{name}'" Database query after: query = "SELECT * FROM products WHERE name = ?" db.execute(query, (name,)) Password hashing before: password_hash = hashlib.md5(password).hexdigest() Password hashing after: salt = bcrypt.gensalt(rounds=12) password_hash = bcrypt.hashpw(password.encode('utf-8'), salt) Model: https://huggingface.co/codelion/Qwen2.5-Coder-0.5B-Instruct-security-grpo-lora Notebook: https://github.com/codelion/ellora/blob/main/Ellora_Recipe_5_Secure_Code_Generation_LoRA.ipynb Repository: https://github.com/codelion/ellora
reacted
to
their
post
with 👀
1 day ago
Over 40 percent of AI-generated code contains security vulnerabilities. We recently worked on a LoRA to write secure code by default using automated Semgrep analysis and GRPO, achieving 97 percent reduction in vulnerabilities without requiring security-specific prompts. Technical Approach: Automated security training pipeline combining Semgrep vulnerability detection with preference learning. Generate multiple solutions with varying security awareness, automatically analyze for vulnerabilities, create preference pairs based on security scores, train using GRPO with multi-factor scoring. Scoring System (100 points total): - Functionality: 40 points - Does the code work correctly - Security patterns: 40 points - Uses secure coding practices - Low vulnerabilities: 20 points - Semgrep score below threshold This balanced scoring prevents reward hacking where models generate empty functions to avoid vulnerabilities. Real Transformation Examples: Database query before: query = f"SELECT * FROM products WHERE name = '{name}'" Database query after: query = "SELECT * FROM products WHERE name = ?" db.execute(query, (name,)) Password hashing before: password_hash = hashlib.md5(password).hexdigest() Password hashing after: salt = bcrypt.gensalt(rounds=12) password_hash = bcrypt.hashpw(password.encode('utf-8'), salt) Model: https://huggingface.co/codelion/Qwen2.5-Coder-0.5B-Instruct-security-grpo-lora Notebook: https://github.com/codelion/ellora/blob/main/Ellora_Recipe_5_Secure_Code_Generation_LoRA.ipynb Repository: https://github.com/codelion/ellora
reacted
to
their
post
with 🚀
1 day ago
Over 40 percent of AI-generated code contains security vulnerabilities. We recently worked on a LoRA to write secure code by default using automated Semgrep analysis and GRPO, achieving 97 percent reduction in vulnerabilities without requiring security-specific prompts. Technical Approach: Automated security training pipeline combining Semgrep vulnerability detection with preference learning. Generate multiple solutions with varying security awareness, automatically analyze for vulnerabilities, create preference pairs based on security scores, train using GRPO with multi-factor scoring. Scoring System (100 points total): - Functionality: 40 points - Does the code work correctly - Security patterns: 40 points - Uses secure coding practices - Low vulnerabilities: 20 points - Semgrep score below threshold This balanced scoring prevents reward hacking where models generate empty functions to avoid vulnerabilities. Real Transformation Examples: Database query before: query = f"SELECT * FROM products WHERE name = '{name}'" Database query after: query = "SELECT * FROM products WHERE name = ?" db.execute(query, (name,)) Password hashing before: password_hash = hashlib.md5(password).hexdigest() Password hashing after: salt = bcrypt.gensalt(rounds=12) password_hash = bcrypt.hashpw(password.encode('utf-8'), salt) Model: https://huggingface.co/codelion/Qwen2.5-Coder-0.5B-Instruct-security-grpo-lora Notebook: https://github.com/codelion/ellora/blob/main/Ellora_Recipe_5_Secure_Code_Generation_LoRA.ipynb Repository: https://github.com/codelion/ellora
View all activity
Organizations
codelion
's datasets
28
Sort: Recently updated
codelion/gemma-3-270m-icm-dpo
Viewer
•
Updated
2 days ago
•
1.11k
•
71
•
1
codelion/Qwen2.5-Coder-0.5B-Instruct-security-preference
Viewer
•
Updated
Aug 2
•
245
•
120
codelion/Qwen2.5-Coder-0.5B-Instruct-progressive-2M-context
Viewer
•
Updated
Jul 20
•
400
•
23
codelion/Llama-3.2-1B-Instruct-magpie-tool-calling
Viewer
•
Updated
Jul 18
•
1.2k
•
41
•
1
codelion/Qwen3-0.6B-icm-dpo-pairs
Viewer
•
Updated
Jul 18
•
122
•
26
codelion/Qwen3-0.6B-icm
Viewer
•
Updated
Jul 18
•
500
•
19
•
1
codelion/gemma-3-1b-it-magpie-reasoning
Viewer
•
Updated
Jul 18
•
131
•
85
•
1
codelion/Qwen3-0.6B-magpie
Viewer
•
Updated
Jul 12
•
735
•
39
•
1
codelion/Qwen3-0.6B-pts-thought-anchors
Viewer
•
Updated
Jul 10
•
148
•
37
•
2
codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts-thought-anchors
Viewer
•
Updated
Jul 10
•
110
•
26
•
1
codelion/fineweb-edu-1B
Viewer
•
Updated
Jul 7
•
970k
•
132
codelion/dclm-baseline-1B
Viewer
•
Updated
Jul 7
•
774k
•
29
codelion/fineweb-edu-100M
Viewer
•
Updated
Jul 6
•
115k
•
151
codelion/dclm-baseline-100M
Viewer
•
Updated
Jul 6
•
77.2k
•
22
codelion/fineweb-edu-10M
Viewer
•
Updated
Jul 6
•
9.46k
•
28
codelion/dclm-baseline-10M
Viewer
•
Updated
Jul 6
•
7.95k
•
20
codelion/Qwen3-0.6B-pts-dpo-pairs
Viewer
•
Updated
May 19
•
681
•
36
•
2
codelion/Qwen3-0.6B-pts-steering-vectors
Viewer
•
Updated
May 19
•
1.38k
•
20
•
4
codelion/Qwen3-0.6B-pts
Viewer
•
Updated
May 19
•
1.38k
•
27
•
2
codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts-steering-vectors
Preview
•
Updated
May 13
•
6
•
1
codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts
Preview
•
Updated
May 13
•
17
•
1
codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts-dpo-pairs
Preview
•
Updated
May 13
•
6
•
1
codelion/math500-cot-experiment
Viewer
•
Updated
Apr 30
•
1.5k
•
27
•
5
codelion/optillmbench
Viewer
•
Updated
Apr 15
•
500
•
87
•
5
codelion/distilled-QwQ-32B-fineweb-edu
Preview
•
Updated
Apr 13
•
1
•
1
codelion/optillm-router-dataset
Viewer
•
Updated
Apr 12
•
2.81k
•
26
•
6
codelion/Sky-T1_data_17k
Viewer
•
Updated
Jan 11
•
16.4k
•
6
•
1
codelion/worker-safety-qa-eval
Viewer
•
Updated
Jun 20, 2024
•
34
•
71
•
3