Personal Wiki
Search
Search
Dark mode
Light mode
Explorer
Tag: llm
135 items with this tag.
May 27, 2026
Building Agentic AI Applications with LLMs
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Generative AI LLM Exam Study Guide
llm
transformer
peft
inference-optimization
rag
guardrails
trustworthy-ai
rlhf
May 27, 2026
Experimentation
deep-learning
transformer
trustworthy-ai
neural-network
llm
May 27, 2026
NCP-AAI Part 0: Agentic AI — Foundations, Architecture, and Ethics
agentic-ai
agent-architecture
perceive-reason-act
responsible-ai
llm
rag
data-flywheel
guardrails
May 27, 2026
NVIDIA DLI: Building Agentic AI Applications with LLMs
agentic-ai
react-loop
structured-output
data-flywheel
guardrails
canvasing
llm
prompt-engineering
May 27, 2026
NCP-AAI Part 4 — Building Retriever Nodes: Hands-On Assessment Study Guide
agentic-ai
rag
structured-output
pydantic
prompt-engineering
llm
langchain
python
tool-calling
vector-search
May 27, 2026
NCP-AAI Part 1 Exam Prep — Simple LLM Agent Systems: Full Study Guide
agentic-ai
llm
crewai
context-window
deep-learning
multi-agent
May 27, 2026
**Understanding the planning of LLM agents: A survey**
llm
agent-architecture
memory-augmentation
rag
peft
hallucination
survey
react-loop
May 27, 2026
Sec. 1 — Foundations and Responsible AI
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 2 — Course Preamble: Foundations and Responsible AI
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 3 — Simple LLM Agent Systems
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 4 — Notebook 1: Making A Simple Agent
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 5 — Basic of CrewAI
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 6 — Limitations of LLM
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 7 — Control, Structure, and Tooling
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 8 — Structuring Outputs
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 9 — Tooling Your LLMs
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 10 — Server-Side Tooling
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 11 — Caching and Retrieval
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 12 — Data Flywheel
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 13 — Notebook 2: Structuring Thoughts and Outputs
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
Sec. 14 — Notebook 2t: Tooling-Enabled LLM Systems
agentic-ai
llm
responsible-ai
crewai
structured-output
multi-agent
tool-calling
May 27, 2026
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
reinforcement-learning
llm
deep-learning
chain-of-thought
grpo
reward-modeling
reasoning
fine-tuning
May 27, 2026
Ch. 1 — Introduction
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 2 — DeepSeek-R1-Zero
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 3 — DeepSeek-R1
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 4 — Experiment
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 5 — Ethics and Safety Statement
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 6 — Conclusion, Limitation, and Future Work
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 7 — Author List
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 8 — Background
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 9 — Training Details
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 10 — Self-Evolution of DeepSeek-R1-Zero
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 11 — Evaluation of DeepSeek-R1
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 12 — More Analysis
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 13 — DeepSeek-R1 Distillation
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 14 — Discussion
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 15 — Related Work
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 16 — Open Weights, Code, and Data
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Ch. 17 — Evaluation Prompts and Settings
reinforcement-learning
grpo
knowledge-distillation
chain-of-thought
reasoning
llm
May 27, 2026
Training Compute-Optimal Large Language Models
llm
transformer
deep-learning
distributed-training
mixed-precision
May 27, 2026
Ch. 1 — Introduction
llm
transformer
deep-learning
May 27, 2026
Ch. 2 — Related Work
llm
transformer
deep-learning
May 27, 2026
Ch. 3 — Estimating Optimal Parameter/Token Allocation
llm
transformer
deep-learning
May 27, 2026
Ch. 4 — Chinchilla Training and Results
llm
transformer
deep-learning
distributed-training
mixed-precision
May 27, 2026
Ch. 5 — Discussion and Conclusion
llm
transformer
deep-learning
May 27, 2026
Ch. 6 — Appendices (A–J)
llm
transformer
deep-learning
distributed-training
May 27, 2026
KV Caching Explained: Optimizing Transformer Inference Efficiency
kv-cache
inference-optimization
transformer
self-attention
inference-latency
context-window
llm
May 27, 2026
Optimizing Inference for Long Context and Large Batch Sizes with NVFP4 KV Cache
kv-cache
quantization
inference-latency
time-to-first-token
serving-throughput
gpu-memory-bandwidth
tensorrt-llm
llm
inference-optimization
mixture-of-experts
May 27, 2026
Sec. 1 — Core Machine Learning and AI Knowledge
llm
transformer
peft
inference-optimization
rag
guardrails
trustworthy-ai
rlhf
May 27, 2026
Sec. 2 — Data Analysis
llm
transformer
peft
inference-optimization
rag
guardrails
trustworthy-ai
rlhf
May 27, 2026
Sec. 3 — Experimentation
llm
transformer
peft
inference-optimization
rag
guardrails
trustworthy-ai
rlhf
May 27, 2026
Sec. 4 — LLMs training, customizing and inferencing
llm
transformer
peft
inference-optimization
rag
guardrails
trustworthy-ai
rlhf
May 27, 2026
Sec. 5 — Mastering LLM Techniques: Customization
llm
transformer
peft
inference-optimization
rag
guardrails
trustworthy-ai
rlhf
May 27, 2026
Sec. 6 — Mastering LLM Techniques: Inference Optimization
llm
transformer
peft
inference-optimization
rag
guardrails
trustworthy-ai
rlhf
May 27, 2026
Sec. 7 — Software Development
llm
transformer
peft
inference-optimization
rag
guardrails
trustworthy-ai
rlhf
May 27, 2026
Sec. 8 — RAG
llm
transformer
peft
inference-optimization
rag
guardrails
trustworthy-ai
rlhf
May 27, 2026
Sec. 9 — Trustworthy AI
llm
transformer
peft
inference-optimization
rag
guardrails
trustworthy-ai
rlhf
May 27, 2026
Sec. 1 — CLIP: Connecting text and images
deep-learning
transformer
trustworthy-ai
neural-network
llm
May 27, 2026
Sec. 2 — Data Visualization
deep-learning
transformer
trustworthy-ai
neural-network
llm
May 27, 2026
Sec. 3 — Essential chart types for data visualization
deep-learning
transformer
trustworthy-ai
neural-network
llm
May 27, 2026
Sec. 4 — 7 Ways to Handle Missing Values in Machine Learning
deep-learning
transformer
trustworthy-ai
neural-network
llm
May 27, 2026
Sec. 5 — Guide To Data Cleaning
deep-learning
transformer
trustworthy-ai
neural-network
llm
May 27, 2026
Sec. 6 — A Complete Guide to Data Augmentation
deep-learning
transformer
trustworthy-ai
neural-network
llm
May 27, 2026
Sec. 7 — Basics of Speech Recognition and Customization of Riva ASR
deep-learning
transformer
trustworthy-ai
neural-network
llm
May 27, 2026
Sec. 8 — The effects of racially biased AI
deep-learning
transformer
trustworthy-ai
neural-network
llm
May 27, 2026
Sec. 9 — GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium
deep-learning
transformer
trustworthy-ai
neural-network
llm
May 27, 2026
Software Development
deep-learning
transformer
gpu-acceleration
llm
fine-tuning
trustworthy-ai
May 27, 2026
Ch. 1 — Document Overview
agentic-ai
agent-architecture
perceive-reason-act
responsible-ai
llm
rag
data-flywheel
guardrails
May 27, 2026
Ch. 2 — What Is Agentic AI
agentic-ai
agent-architecture
perceive-reason-act
responsible-ai
llm
rag
data-flywheel
guardrails
May 27, 2026
Ch. 3 — Agent Principles and Characteristics
agentic-ai
agent-architecture
perceive-reason-act
responsible-ai
llm
rag
data-flywheel
guardrails
May 27, 2026
Ch. 4 — Agent Architecture Components
agentic-ai
agent-architecture
perceive-reason-act
responsible-ai
llm
rag
data-flywheel
guardrails
May 27, 2026
Ch. 5 — Types of AI Agents
agentic-ai
agent-architecture
perceive-reason-act
responsible-ai
llm
rag
data-flywheel
guardrails
May 27, 2026
Ch. 6 — Responsible AI Principles
agentic-ai
agent-architecture
perceive-reason-act
responsible-ai
llm
rag
data-flywheel
guardrails
May 27, 2026
Ch. 7 — Resources and References
agentic-ai
agent-architecture
perceive-reason-act
responsible-ai
llm
rag
data-flywheel
guardrails
May 27, 2026
Ch. 8 — Practice Questions
agentic-ai
agent-architecture
perceive-reason-act
responsible-ai
llm
rag
data-flywheel
guardrails
May 27, 2026
Ch. 9 — Answer Key
agentic-ai
agent-architecture
perceive-reason-act
responsible-ai
llm
rag
data-flywheel
guardrails
May 27, 2026
Ch. 10 — One-Page Quick Reference Summary
agentic-ai
agent-architecture
perceive-reason-act
responsible-ai
llm
rag
data-flywheel
guardrails
May 27, 2026
Ch. 1 — NVIDIA DLI: Building Agentic AI Applications with LLMs
agentic-ai
react-loop
structured-output
data-flywheel
guardrails
canvasing
llm
prompt-engineering
May 27, 2026
Ch. 2 — 0: Foundational Concepts
agentic-ai
react-loop
structured-output
data-flywheel
guardrails
canvasing
llm
prompt-engineering
May 27, 2026
Ch. 3 — Key Concepts and Principles
agentic-ai
react-loop
structured-output
data-flywheel
guardrails
canvasing
llm
prompt-engineering
May 27, 2026
Ch. 4 — Self-Assessment Checklist
agentic-ai
react-loop
structured-output
data-flywheel
guardrails
canvasing
llm
prompt-engineering
May 27, 2026
Ch. 5 — Practice Questions
agentic-ai
react-loop
structured-output
data-flywheel
guardrails
canvasing
llm
prompt-engineering
May 27, 2026
Ch. 6 — Answer Key with Explanations
agentic-ai
react-loop
structured-output
data-flywheel
guardrails
canvasing
llm
prompt-engineering
May 27, 2026
Ch. 7 — Quick Reference Summary
agentic-ai
react-loop
structured-output
data-flywheel
guardrails
canvasing
llm
prompt-engineering
May 27, 2026
Ch. 8 — Study Tips and Exam Strategy
agentic-ai
react-loop
structured-output
data-flywheel
guardrails
canvasing
llm
prompt-engineering
May 27, 2026
Ch. 1 — Document Overview
agentic-ai
llm
crewai
context-window
deep-learning
multi-agent
May 27, 2026
Ch. 2 — Deep Learning and Function Approximation
agentic-ai
llm
crewai
context-window
deep-learning
multi-agent
May 27, 2026
Ch. 3 — LLM Architecture
agentic-ai
llm
crewai
context-window
deep-learning
multi-agent
May 27, 2026
Ch. 4 — LLMs as Semantic Reasoners
agentic-ai
llm
crewai
context-window
deep-learning
multi-agent
May 27, 2026
Ch. 5 — Persona Agents and Chat Systems
agentic-ai
llm
crewai
context-window
deep-learning
multi-agent
May 27, 2026
Ch. 6 — CrewAI Framework
agentic-ai
llm
crewai
context-window
deep-learning
multi-agent
May 27, 2026
Ch. 7 — LLM Limitations and Context Management
agentic-ai
llm
crewai
context-window
deep-learning
multi-agent
May 27, 2026
Ch. 8 — Practice Questions
agentic-ai
llm
crewai
context-window
deep-learning
multi-agent
May 27, 2026
Ch. 9 — Answer Key
agentic-ai
llm
crewai
context-window
deep-learning
multi-agent
May 27, 2026
Ch. 10 — One-Page Quick Reference Summary
agentic-ai
llm
crewai
context-window
deep-learning
multi-agent
May 27, 2026
Harness, Scaffold, and the AI Agent Terms Worth Getting Right
agentic-ai
agent-architecture
multi-agent
tool-calling
react-loop
memory-augmentation
llm-orchestration
reinforcement-learning
reward-modeling
grpo
llm
perceive-reason-act
May 27, 2026
Building Autonomous AI with NVIDIA Agentic NeMo
agentic-ai
llm
rag
guardrails
agent-architecture
tool-calling
inference-optimization
llm-orchestration
state-management
lora
perceive-reason-act
nvidia-nemo
memory-augmentation
deployment-scaling
May 27, 2026
Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint
agentic-ai
rag
nvidia-nim
nvidia-nemo
nvidia-blueprints
langgraph
llm
tool-calling
memory-augmentation
agent-architecture
embedding
llm-orchestration
observability
May 27, 2026
What are AI Agents?
agentic-ai
multi-agent
agent-architecture
llm
rag
guardrails
llm-orchestration
tool-calling
chain-of-thought
react-loop
human-in-the-loop
May 27, 2026
An Introduction to Large Language Models: Prompt Engineering and P-Tuning
llm
prompt-engineering
chain-of-thought
reasoning
fine-tuning
peft
nvidia-nemo
zero-shot-learning
May 27, 2026
Design Considerations of Advanced Agentic AI for Real-World Applications
agentic-ai
multi-agent
agent-architecture
state-management
langchain
tool-calling
llm-orchestration
react-loop
memory-augmentation
vector-search
embedding
llm
May 27, 2026
An Introduction to Large Language Models: Prompt Engineering and P-Tuning (Cognition)
llm
prompt-engineering
chain-of-thought
reasoning
fine-tuning
peft
nvidia-nemo
zero-shot-learning
May 27, 2026
Are Large Language Models In-Context Graph Learners?
llm
rag
prompt-engineering
zero-shot-learning
chain-of-thought
fine-tuning
May 27, 2026
From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs
llm
survey
memory-augmentation
rag
kv-cache
context-window
agentic-ai
May 27, 2026
Jamba 1.5 LLMs Leverage Hybrid Architecture to Deliver Superior Reasoning and Long Context Handling
llm
transformer
mixture-of-experts
rag
context-window
tool-calling
May 27, 2026
Ch. 1 — Introduction
llm
agent-architecture
memory-augmentation
rag
peft
hallucination
survey
react-loop
May 27, 2026
Ch. 2 — Taxonomy
llm
agent-architecture
memory-augmentation
rag
peft
hallucination
survey
react-loop
May 27, 2026
Ch. 3 — Task Decomposition
llm
agent-architecture
memory-augmentation
rag
peft
hallucination
survey
react-loop
May 27, 2026
Ch. 4 — Multi-Plan Selection
llm
agent-architecture
memory-augmentation
rag
peft
hallucination
survey
react-loop
May 27, 2026
Ch. 5 — External Planner-Aided Planning
llm
agent-architecture
memory-augmentation
rag
peft
hallucination
survey
react-loop
May 27, 2026
Ch. 6 — Reflection and Refinement
llm
agent-architecture
memory-augmentation
rag
peft
hallucination
survey
react-loop
May 27, 2026
Ch. 7 — Memory-Augumented Planning
llm
agent-architecture
memory-augmentation
rag
peft
hallucination
survey
react-loop
May 27, 2026
Ch. 8 — Evaluation
llm
agent-architecture
memory-augmentation
rag
peft
hallucination
survey
react-loop
May 27, 2026
Ch. 9 — Conclusions and Future Directions
llm
agent-architecture
memory-augmentation
rag
peft
hallucination
survey
react-loop
May 27, 2026
Performance Tuning Guide — Megatron-Bridge LLM Training (Deployment and Scaling)
distributed-training
mixed-precision
quantization
nvidia-nemo
llm
deployment-scaling
May 27, 2026
Measure and Improve AI Workload Performance with NVIDIA DGX Cloud Benchmarking
deployment-scaling
inference-optimization
nvidia-nemo
mixed-precision
gpu-acceleration
llm
distributed-training
quantization
May 27, 2026
Performance Analysis — TensorRT LLM
inference-optimization
deployment-scaling
observability
cuda
gpu-acceleration
llm
mixture-of-experts
May 27, 2026
Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes
deployment-scaling
inference-optimization
llm
gpu-acceleration
observability
multi-tenancy
kv-cache
May 27, 2026
AI Agents in Production: Observability & Evaluation
agent-evaluation
observability
llmops
agentic-ai
llm-as-a-judge
rag
human-in-the-loop
llm
May 27, 2026
Data Flywheel: What It Is and How It Works
data-flywheel
fine-tuning
lora
peft
llm
rag
guardrails
nvidia-nemo
nvidia-nim
nvidia-blueprints
agent-evaluation
agentic-ai
human-in-the-loop
knowledge-distillation
llmops
May 27, 2026
Successful Agentic AI: Model Logic, Data Considerations and Manpower
agentic-ai
llm
rag
guardrails
responsible-ai
hallucination
multi-agent
data-preprocessing
May 27, 2026
Chain of Thought Prompting Explained (with examples)
chain-of-thought
prompt-engineering
llm
reasoning
langchain
May 27, 2026
Human in the Loop AI: Keeping AI Aligned with Human Values
human-in-the-loop
responsible-ai
trustworthy-ai
agentic-ai
llm
rlhf
May 27, 2026
Understanding Why AI Guardrails Are Necessary: Ensuring Ethical and Responsible AI Use
guardrails
responsible-ai
trustworthy-ai
hallucination
llm
rag
prompt-engineering
May 27, 2026
How to Make Your LLM More Accurate with RAG & Fine-Tuning
rag
fine-tuning
llm
vector-search
embedding
langchain
lora
hallucination
peft
May 27, 2026
Performance Tuning Guide — Megatron-Bridge LLM Training
distributed-training
mixed-precision
quantization
nvidia-nemo
llm
deployment-scaling
inference-optimization
May 27, 2026
Measure and Improve AI Workload Performance with NVIDIA DGX Cloud Benchmarking (NVIDIA Platform)
deployment-scaling
inference-optimization
nvidia-nemo
mixed-precision
gpu-acceleration
llm
distributed-training
quantization
May 27, 2026
Performance Analysis — TensorRT LLM (NVIDIA Platform)
inference-optimization
deployment-scaling
observability
cuda
gpu-acceleration
llm
mixture-of-experts
May 27, 2026
Scaling LLMs with NVIDIA Triton and TensorRT-LLM Using Kubernetes (NVIDIA Platform)
deployment-scaling
inference-optimization
llm
gpu-acceleration
observability
multi-tenancy
kv-cache
May 27, 2026
Agentic or Tool use
agentic-ai
agent-evaluation
tool-calling
rag
llm
May 27, 2026
Building Safer LLM Apps with LangChain Templates and NVIDIA NeMo Guardrails
llm
langchain
nvidia-nemo
rag
guardrails
hallucination
responsible-ai
May 27, 2026
Securing Generative AI Deployments with NVIDIA NIM and NVIDIA NeMo Guardrails
llm
guardrails
nvidia-nim
nvidia-nemo
langchain
trustworthy-ai
May 27, 2026
Document Overview
agentic-ai
llm
crewai
context-window
deep-learning
multi-agent
tool-calling