Writing Extractors¶
Define parsing rules to extract structured data from HTML:
- Use CSS selectors and patterns to capture fields
- Normalize and clean text
- Emit JSON for downstream processing
Integrate with parser::extractors and parser::schema for reusable rulesets.