Automate Your City Data with Python
December 14, 2023
18 min
Free
django
python
data-automation
pdf-processing
python-data-tooling
civic-data
api-interaction
etl
datasette
ocr
tesseract
web-scraping
github-actions
Description
In this talk, Philip James demonstrates how to automate the extraction, transformation, and loading (ETL) of city data using Python. He explains how to access data from various civic government websites, process PDFs through OCR to convert them into text, and load this structured data into a searchable database using Datasette. The presentation covers techniques for handling data from different formats and sources, making civic decisions more transparent and accessible.