Personal Wiki
Search
Search
Dark mode
Light mode
Explorer
Tag: fine-tuning
8 items with this tag.
May 27, 2026
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
reinforcement-learning
llm
deep-learning
chain-of-thought
grpo
reward-modeling
reasoning
fine-tuning
May 27, 2026
Generative AI with Diffusion Models
deep-learning
neural-network
transformer
tutorial
fine-tuning
May 27, 2026
Software Development
deep-learning
transformer
gpu-acceleration
llm
fine-tuning
trustworthy-ai
May 27, 2026
An Introduction to Large Language Models: Prompt Engineering and P-Tuning
llm
prompt-engineering
chain-of-thought
reasoning
fine-tuning
peft
nvidia-nemo
zero-shot-learning
May 27, 2026
An Introduction to Large Language Models: Prompt Engineering and P-Tuning (Cognition)
llm
prompt-engineering
chain-of-thought
reasoning
fine-tuning
peft
nvidia-nemo
zero-shot-learning
May 27, 2026
Are Large Language Models In-Context Graph Learners?
llm
rag
prompt-engineering
zero-shot-learning
chain-of-thought
fine-tuning
May 27, 2026
Data Flywheel: What It Is and How It Works
data-flywheel
fine-tuning
lora
peft
llm
rag
guardrails
nvidia-nemo
nvidia-nim
nvidia-blueprints
agent-evaluation
agentic-ai
human-in-the-loop
knowledge-distillation
llmops
May 27, 2026
How to Make Your LLM More Accurate with RAG & Fine-Tuning
rag
fine-tuning
llm
vector-search
embedding
langchain
lora
hallucination
peft