WebScrapThings is a collection of Python bots used by the ParalegApp system to automatically monitor judicial case updates in Chilean courts. Each bot targets a different court portal, parsing case data and extracting changes to keep lawyers informed.
Legal professionals typically need to check court websites manually for updates. WebScrapThings automates this:
- Contains multiple scraper scripts, each configured for a specific court portal.
- Periodically fetches current case data.
- Compares with previous versions to identify changes (rulings, new filings, status updates).
- Stores changes for ParalegApp to generate alerts and notifications.
- Modular scraper for multiple court websites
- Site-specific parsing using
RequestsandBeautifulSoup - Detects new or updated entries efficiently
- Central storage of scraped data for downstream processing
- Each script reads a list of cases and relevant URLs
- Executes periodic HTTP requests to fetch case pages
- Parses HTML to extract relevant content
- Compares against stored versions in ParalegApp database
- Flags and logs differences for notification workflows
- Python 3
- Requests / BeautifulSoup
- Integrated with ParalegApp’s Django models
- Supports cron or scheduler execution
- Multiple scrapers operational for different court systems
- Reliable pattern detection of case changes
- Regular maintenance required due to site structure updates
Developed by @pepelisto