Project Description
An open-source Python crawler script that can automatically extract data from HTML pages based on machine learning. After providing the crawler with output results as examples, it will automatically extract rules and crawl page data without specifying CSS selectors.