- Product extraction: Get prices, names, ratings, and availability
- Content scraping: Extract articles, reviews, or listings
- Data collection: Structured data from tables, lists, or complex layouts
- API-like responses: Transform HTML into clean JSON objects
Two modes
Parsing Schema offers two approaches to extract structured data:- Schema Mode
- LLM Mode
Schema Mode
Define exact data structure using parsers and CSS selectors for predictable, low-cost extraction. Best for:- Stable page structures
- Cost-sensitive applications (lower pricing)
- Precise control over data extraction
- High-volume, consistent extraction
- Create a parser with CSS selectors mapping to your data model
- Define data types and structure
- Nimble extracts data exactly as specified
Schema Mode
Precise extraction with CSS selectors and parsers
LLM Mode
Describe what data you need in natural language or provide a loose schema. AI dynamically analyzes each page to extract and structure information. Best for:- Pages with varying structures
- Quick setup without selector maintenance
- Complex extraction requirements
- Adaptive data extraction
- Define a data model (optional) or describe what to extract
- AI analyzes the page and identifies relevant data
- Data is extracted and structured automatically
LLM Mode
AI-powered extraction with natural language prompts
Quick comparison
| Feature | Schema Mode | LLM Mode |
|---|---|---|
| Setup | Manual parser creation | Natural language or loose schema |
| Cost | Lower (driver only) | Higher (driver + tokens) |
| Accuracy | Precise (if maintained) | Adaptive |
| Maintenance | Update when page changes | Self-healing |
| Speed | Faster (no inference) | Slower (LLM processing) |
| Control | Full control over selectors | AI-determined extraction |
| Best for | Stable pages, cost control | Varying pages, quick setup |
Both modes extract data within the same
extract method call. Choose the approach that matches your needs for cost, control, and maintenance.Choosing the right mode
Use Schema Mode when:- You need low-cost, predictable extraction
- Page structure is stable and CSS selectors work reliably
- You want complete control over what gets extracted
- Processing high volumes with consistent structure
- You want zero-setup, hands-free extraction
- Pages have varying structures or layouts
- Maintaining CSS selectors is challenging
- Higher cost is acceptable for reduced maintenance

