Taxonomy fun facts (as of today!)

Taxonomy DNA for The New York Times – Tech section

In these 2 recent posts, we announced our AI-operated Taxonomy…

Introducing Taxonomy DNA

Taxonomy DNA (cont.) – comparing a specialist vs a generalist

… time now to share some fun facts about it:

10,000,000 words

is the dictionary of words used for the qualification of our taxonomy classifications. Those words were precisely selected to be meaningful for each of our taxonomy classifications (leaves).

100,000 new article abstracts collected daily.

Every day, 100k article abstracts are collected. This number should grow to 1 million a day within 3 months.

75,000 operations per article

… to classify within our taxonomy every single article for every single day for every single feed for every single media.

8 Billions classification operations daily

This is growing daily and should reach 50 to 70B shortly.

Allowing for sophisticated Taxonomy classifications filters.

Thereafter is an example of how to filter classifications and depth of specialization per classification (we’ll dig into this more in a coming post) for your corpus:

Corpus creation and maintenance (may change)

Of course, should you have questions, let us know!

Share this post via:

Published by

Freddy Mini

CEO & Co-founder