In my pet project eurlex.nu I find a lot of weird stuff when scraping documents from the official website eur-lex.europa.eu. The most recent specimen - Final adoption of amending budget No 4 of the European Union for the financial year 2008 - has the publish date 80/80/2200. That’s almost two hundred years into the future with an invalid day/month combo on top. This leads me to believe that the system is in such a broken state that even simple date validation isn’t implemented.

Someone delivered a really poor software project for our tax money. I would love to redo the european legal information website with proper standards (e.g. validating HTML, RDF and proper semantics).

Oh well…