Skip to content

Writing Extractors

Define parsing rules to extract structured data from HTML:

  • Use CSS selectors and patterns to capture fields
  • Normalize and clean text
  • Emit JSON for downstream processing

Integrate with parser::extractors and parser::schema for reusable rulesets.