I need the most recent alexa 1m database in csv and xml format with the following criteria:
Columns:
- Alexa rank
- Url
- Url shortened to just "[login to view URL]" ex [login to view URL]
- category tree separated into individual columns for each sub category
- Meta title
- Meta description
- Foreign sites removed, so if the sitenames, titles, or description are not in english then the whole row should be removed.
If the categories cannot be pulled from alexa, then they should be pulled from dmoz and still fulfill the above requirements.