TXTRNZ is a text-only site which serves news items from the RNZ website.
Built with Python and GitHub Actions, this web app scrapes and builds static versions of RNZ news articles every 6 hours. As well as stripping non-text media, it uses system fonts to prioritise speed and accessibility for the end user. Should users wish to view the full original article, there are links at the top of every page that takes them to the source URL as well.
This was my first major foray into using the BeautifulSoup Python package and web scraping in general. My knowledge of Python is limited, so I hope to continue improving this site over time.
I would like thank RNZ for allowing me to be able to scrape their site freely without limiting or blocking my connections.