Personal Wiki
Search
Search
Dark mode
Light mode
Explorer
Tag: fine-tuning
2 items with this tag.
Apr 24, 2026
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
reinforcement-learning
llm
deep-learning
chain-of-thought
grpo
reward-modeling
reasoning
fine-tuning
Apr 24, 2026
Generative AI LLM Exam Study Guide
transformer
llm
deep-learning
rag
fine-tuning
peft
rlhf
inference-optimization
quantization
trustworthy-ai