Nikita Kezins's picture

Nikita Kezins

entfane

·

AI & ML interests

LLM post-training, adversarial training, safety, knowledge transfer

Recent Activity

updated a model 1 day ago

entfane/toxic_gemma2b_classifier

published a model 1 day ago

entfane/toxic_gemma2b_classifier

upvoted a paper 13 days ago

Blockwise Advantage Estimation for Multi-Objective RL with Verifiable Rewards

View all activity

Organizations

New activity in huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated 19 days ago

Как создавать изображения ?

#9 opened 20 days ago by

New activity in mistralai/Voxtral-Mini-4B-Realtime-2602 25 days ago

How to add another language ?

#22 opened about 1 month ago by

TheRealTancrede

New activity in lmstudio-community/DeepSeek-R1-Distill-Qwen-7B-GGUF 4 months ago

🚩 Report: Ethical issue(s)

#4 opened about 1 year ago by

New activity in openai/gpt-oss-20b 4 months ago

so much censorship

#48 opened 8 months ago by

New activity in moonshotai/Kimi-K2-Thinking 4 months ago

Token Count Calculation in SFT Data Distribution Curation

#31 opened 4 months ago by

New activity in Qwen/Qwen2.5-3B 4 months ago

Is it actually a base model?

#6 opened 4 months ago by

New activity in openai/gpt-oss-20b 7 months ago

CUDA out of memory issues when running gptoss model on colab T4

#99 opened 8 months ago by

Not able to deploy gpt-oss-20b model in A100s

#124 opened 7 months ago by

Unable to load gpt-oss-20b on dual L40 (48GB) GPUs with vLLM

#136 opened 7 months ago by

New activity in ethicalabs/computer-says-no 7 months ago

Diversity of responses

#2 opened 7 months ago by

New activity in yasserrmd/gpt-oss-coder-20b 7 months ago

Reasoning effort during training

#1 opened 7 months ago by

New activity in openai/gpt-oss-20b 7 months ago

NVIDIA L40S GPU's for MXFP4 quantization

#100 opened 7 months ago by

New activity in openai/gpt-oss-20b 8 months ago

question: setting reasoning effort

#66 opened 8 months ago by

New activity in QuixiAI/dolphin-r1 8 months ago

creation process?

#7 opened about 1 year ago by

New activity in openai/gpt-oss-20b 8 months ago

Thinking but no solution?

#54 opened 8 months ago by

OOM on 3090

#60 opened 8 months ago by

New activity in suriya7/t5-base-text-to-sql 8 months ago

french to sql model

#2 opened 9 months ago by

New activity in Qwen/Qwen3-Reranker-0.6B 8 months ago

reranker0.6b and embedding0.6b are the same model weights？

#6 opened 10 months ago by

New activity in ScienceOne-AI/S1-Base-8B 8 months ago

Benchmarks

#1 opened 8 months ago by

New activity in HuggingFaceTB/SmolLM2-135M-Instruct 8 months ago

Release of SFT tuned model

#8 opened about 1 year ago by