Skip to main content
Member FDICFDIC-Insured - Backed by the full faith and credit of the U.S. Government

Dmoz-tddli.rar May 2026

“Getting a website listed in DMOZ can be very frustrating... but being listed will probably help our Google rankings.” WebWorkshop URL Classification Dataset [DMOZ] - Kaggle

Highly recommended for researchers looking to train text-classification models or explore the historical structure of the early-to-mid-2000s internet. Community Perspectives DMOZ-TDDLI.rar

“DMOZ — the Open Directory Project — officially closed today. It marks the end of an era of humans trying to catalog the entire web.” Search Engine Land · 9 years ago “Getting a website listed in DMOZ can be very frustrating

Unlike machine-generated lists, DMOZ data was curated by over 90,000 volunteer editors, making the classifications highly accurate for its time. 000 volunteer editors

The data includes deep taxonomic paths (e.g., Science/Technology/Space ), which is excellent for testing multi-level classification algorithms. Weaknesses: