Recovering Anthony Bourdain's (really) lost Li.st's

https://news.ycombinator.com/rss Hits: 24
Summary

Loved reading through GReg TeChnoLogY Anthony Bourdain鈥檚 Lost Li.st鈥檚 and seeing the list of lost Anthony Bourdain li.st鈥檚 made me think on whether at least some of them we can recover. Having worked in security and crawling space for majority of my career鈥擨 don鈥檛 have the access nor permission to use the proprietary storages鈥擨 thought we might be able to find something from publicly available crawl archives. Common Crawl If Internet Archive had the partial list that Greg published, what about the Common Crawl? Reading through their documentation, it seems straightforward enough to get prefix index for Tony鈥檚 lists and grep for any sub-paths. Putting something up with help of Claude to prove my theory, we have commoncrawl_search.py that makes a single index request to a specific dataset and if any hits discovered, retrieve them from the public s3 bucket鈥攕ince they are small straight-up HTML documents, seems even more feasible than I had initially thought. Simply have a python version around 3.14.2 and install the dependencies from requirements.txt. Run the below and we are in business. Now, below, you鈥檒l find the command I ran and then some manual archeological effort to prettify the findings. NOTE Images have been lost. Other avenues had struck no luck. I鈥檒l try again later. Any and all emphasis, missing punctuation, cool grammar is all by Anthony Bourdain. The only modifications I have made is to the layout, to represent li.st as closely as possible with no changes to the content. NOTE If you see these blocks, that鈥檚 me commenting if pictures have been lost. Recovering what we lost From Greg鈥檚 page, let鈥檚 go and try each entry one by one, I鈥檒l put the table of what I wasn鈥檛 able to find in Common Crawl, but I would assume exists elsewhere鈥擨鈥檇 be happy to take another look. And no, none of this above has been written by AI, only the code since I don鈥檛 really care about warcio encoding or writing the same python requests method for the Nth time. Enjoy! Things I No L...

First seen: 2025-12-13 21:52

Last seen: 2025-12-14 20:55