Personal Wiki

Tag: fine-tuning

8 items with this tag.

  • May 27, 2026

    DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

    • reinforcement-learning
    • llm
    • deep-learning
    • chain-of-thought
    • grpo
    • reward-modeling
    • reasoning
    • fine-tuning
  • May 27, 2026

    Generative AI with Diffusion Models

    • deep-learning
    • neural-network
    • transformer
    • tutorial
    • fine-tuning
  • May 27, 2026

    Software Development

    • deep-learning
    • transformer
    • gpu-acceleration
    • llm
    • fine-tuning
    • trustworthy-ai
  • May 27, 2026

    An Introduction to Large Language Models: Prompt Engineering and P-Tuning

    • llm
    • prompt-engineering
    • chain-of-thought
    • reasoning
    • fine-tuning
    • peft
    • nvidia-nemo
    • zero-shot-learning
  • May 27, 2026

    An Introduction to Large Language Models: Prompt Engineering and P-Tuning (Cognition)

    • llm
    • prompt-engineering
    • chain-of-thought
    • reasoning
    • fine-tuning
    • peft
    • nvidia-nemo
    • zero-shot-learning
  • May 27, 2026

    Are Large Language Models In-Context Graph Learners?

    • llm
    • rag
    • prompt-engineering
    • zero-shot-learning
    • chain-of-thought
    • fine-tuning
  • May 27, 2026

    Data Flywheel: What It Is and How It Works

    • data-flywheel
    • fine-tuning
    • lora
    • peft
    • llm
    • rag
    • guardrails
    • nvidia-nemo
    • nvidia-nim
    • nvidia-blueprints
    • agent-evaluation
    • agentic-ai
    • human-in-the-loop
    • knowledge-distillation
    • llmops
  • May 27, 2026

    How to Make Your LLM More Accurate with RAG & Fine-Tuning

    • rag
    • fine-tuning
    • llm
    • vector-search
    • embedding
    • langchain
    • lora
    • hallucination
    • peft

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community