AI-powered classifications vs Keywords. Part 1/2: Editorial Orientations detection.[updated]

Going beyond a 1-dimensional access to knowledge.

For years, access to knowledge has been ruled by keywords presence. Search engines, corpus selection for business intelligence, DSPs for online advertising, Brand Safety, Watch alerts…

All is about the presence or absence of keywords to trigger the selection of content: A 1-dimensional access, keywords based, to knowledge. Linear. Limited to 0 (absent) or 1 (present).

Keywords presence does not sense angles, subtlety and orientations taken by the author (nor the sensitivity over time. Today’s meanings are the same any other day).

For example, the presence of “Christmas gift” might be “ok” but is it in a context of “Military defense” and “Weapon”? Can you maintain queries excluding all related, always evolving dictionaries of synonyms and be sure your brand won’t be exposed?

After all, a word can have several meanings depending on its context and the time it is read. AI-Powered Classifications are the solution:

AI-Powered Classifications are adding 2 more dimensions: Editorial orientations and timing context.

Today, we will focus on editorial orientations detection.

Next week, we will explain the sensitivity to the time of publication.

[update] The second part is now published:

AI-powered classifications vs Keywords. Part 2/2: Evolution over time.

Both, AI-Powered Classifications and keywords based selections are unbiased, universal and up-to-date. Because TrustedOut is AI-Powered, our machine learning guarantees the same non-humain, machine powered benefits.

Editorial Orientations Example:
1 event, 2 countries, 3 articles, 10 classifications.

The event: let’s take North Korea announcing a special Gift to the US.

The 2 countries: We then selected 3 articles from a Google Search on “North Korea Gift” for the US and “Coree du Nord Cadeau” for France.

The 3 articles: we randomly picked USAToday, CBSNews and Le Figaro.

Here are the top 10 classifications TrustedOut came up with. For each we’ve added how the media is spotted for its Political Orientations (beta)

USAToday

Vase or missiles? US awaits Christmas ‘gift’ from North Korea’s Kim

1 General › Politics › Diplomacy
2 General › Politics › International
3 Industries › Aerospace And Defense › Weapon
4 General › Politics › Military Defense
5 General › Politics › Civil Defense
6 Industries › Energy › Nuclear Power
7 Industries › Aerospace And Defense › Naval System
8 General › Politics › Administration
9 Industries › Aerospace And Defense › Aerospace Systems
10 General › Politics › Government

CBSNews

No sign of “Christmas gift” from North Korea yet, but deadline looms

1 General › Politics › Military Defense
2 Industries › Aerospace And Defense › Weapon
3 General › Politics › Diplomacy
4 General › Politics › International
5 Industries › Aerospace And Defense › Naval System
6 Industries › Aerospace And Defense › Aerospace Systems
7 Industries › Aerospace And Defense › Missiles And Rockets
8 Industries › Energy › Nuclear Power
9 Industries › Aerospace And Defense › Satellite
10 Industries › Transportation › Ship

Le Figaro

Trump is hoping for a “nice vase” instead of a North Korean missile for Christmas. (Trump espère un «beau vase» au lieu d’un missile nord-coréen pour Noël)

1 General › Politics › Diplomacy
2 Industries › Aerospace And Defense › Weapon
3 Industries › Aerospace And Defense › Aerospace Systems
4 Industries › Aerospace And Defense › Missiles And Rockets
5 General › Politics › International
6 General › Politics › Military Defense
7 People › Society › Opinion And Idea
8 Industries › Aerospace And Defense › Satellite
9 General › Law › International
10 Industries › Aerospace And Defense › Aircraft

Editorial Angles

Here’s a summary of the classifications for the 3 articles:

A few remarks:

  • USAToday and Le Figaro top classification is Diplomacy. CBSNews is Military Defense

  • The 2 US articles have the same top 4. (in a different order)

  • Le Figaro does not have Nuclear Power in its Top 10

  • All have Military Defense. Only USAToday has Civil Defense

  • All have Aerospace and Defense > Weapon in their top 3

  • Only Le Figaro has Society > Opinion and Idea and Law > International in its top 10

  • For Industry > Aerospace and Defense, USAToday has 3, CBSNews has 4, Le Figaro has 5 out of their Top 10.

Here’s how TrustedOut saw the Aerospace and Defense Industry, back in October:

Corpus Intelligence for an Industry: Aerospace & Defense – October 2019

Next: Evolution over time.

How AI-Powered Classifications are sensitive to the time of publication: Meaning, Classifications evolve with the time as our “bag of words” are permanently updated and why it matters… Continue to part 2/2

Questions? Ask!

AI-powered classifications vs Keywords. Part 2/2: Evolution over time.

For content selection: AI-powered classifications can sense Editorial Orientations AND Evolution over time. Keywords cannot.

For years, access to knowledge was all about the presence or absence of keywords to trigger the selection of content: A 1-dimensional access, keywords based, to knowledge. Linear. Limited to 0 (absent) or 1 (present).

Last week, we covered the first advantage of AI-Powered Classifications vs keywords based selection, Editorial Orientations, and showed how the same event, on 3 different publications can have different Editorial Orientations.

This is an additional dimension to access knowledge.

Read postLet’s now have a look at a 3rd dimension: Sensitivity over time.

Perception of an event evolves with time, so do our AI-Powered classifications.

France has been through a lot of social movements with the pension reform the French government is pushing for.

From the beginning of the protests until now, the perception has evolved.

Let’s look at the same article and how AI classifies it at two different times.

This article was published on Dec 10th 2019:

Pension reform: “It would be a misdiagnosis to talk about minced runs.”  [google Translation] (Réforme des retraites: “Ne parler que de parcours hachés serait une erreur de diagnostic”)

On Dec 10th, top classification was:

We are at the beginning of the movement, Employment and Unemployment is the top classification.

On Dec 31st, top classifications are now:

3 weeks later, the very same article with the very same content is classified as Senior first, then Social Assistance and, now in 3rd, Employment and Unemployment

Clearly, after 3 weeks of protests, Aging and Social are topping the Employment dimension.

How can AI-Powered Classification do this?

In a previous post, we explained how our AI worked:

How our AI-powered classification works.

Every new article is classified as follow:

Which means the day the article is published, we use Classifications Datasets (aka bags of words) on that very day.

Classification Datasets are also updated to sync with every single classification and sense the depth of expertise over time. This means some words can be in and out and with a different weight over time. This means classifications are set, by default, for the day an article is published but can be re-run on a different day and produce a different classification. Like in real life, your perception of something evolves with time.

Why it matters.

Simply because time is a vital dimension of perception.

Simply relying on the presence of keywords to select content for analytics, expose your brand via advertising etc… is dangerous.

What’s true at publication time might not be at analytics time, or advertising time…

In the example above, you may or may not want articles about “Seniors”. At publication time, the article was under the radar, 3 weeks later it is classified as “Seniors”. Is it still where your brand wants to be exposed? are those content the one you want to analyze today? do those articles matter for the education of your teams?

Relying on keywords that are present in content forever, not only does not give you the orientation of the content but is not sensitive to the evolution of perception. And as we know, in Marketing:

Perception is reality.

Questions? Ask!

 

Listen and watch content you trust – TrustedOut and RSS Readers.

Read what’s happening in your Corpus.

Let say you’d like to listen and watch Car Racing in Specialized sources in the USA.

Your Corpus query will look like this in TrustedOut:

Click on [Get] to have a look at those 21 feeds (sources) from 10 media

And download the OPML file of your Corpus

You will get this file (download it to play with it)

Download OPMLRead your Corpus with your favorite RSS Reader

There are plenty of excellent RSS Reader. Here are 2 examples:

Example #1: Feedly

Find “Organize Sources” and click on “Import OPML”

Select the OPML file from above and enjoy reading…

Example #2: Inoreader

Once logged/signed in, Go to Subscriptions > Manage Subscription > Import/Export and select the OPML of your Corpus

Enjoy reading…

Search within articles, alerts, newsletters…

Our 2 examples above offer both Searches (Inoreader offers it with the free account), Alerts and even team newsletters.

Get your selection of sources you trust with TrustedOut and enjoy reading, searching, alerting and spreading with your choice of RSS Reader…

Questions? Shoot!

 

 

Trusted Content as a Utility


Distrust in Media is a major, major issue.

Distrust in media is an issue everywhere. While trust in media in the US has stabilized around 40% (who trust), it was in the 70s-ish% in 1970. The situation is and is getting worse in Europe, as this Liberation article says :

“While there is also a downward trend of between 2% and 4% in most European countries, France is experiencing the largest drop in confidence. Above all, with only 24% of French people trusting the media, the country is 37th out of 38, just ahead of South Korea (22%). By way of comparison, the confidence rate is 47% in Germany or 40% in the United Kingdom.”

No trust in content, No trust in decisions made from it.

Can you imagine betting your future, the future of your business, on content you do not trust?

Can you imagine displaying your brand, the brand you’ve spent years building respect and trust on, on environment you don’t know, that do not fit your brand’s values?

Can you imagine having your PR and watch team listening media without understanding the profile of those media?

It is the motto of TrustedOut. The reason of our name: “If it’s not trusted in, it cannot be trusted out.”

Trusted Content should be like water or electricity: A Utility.

You need water. You open the faucet. You do not test the water. You trust it. You simply use it when you need it. Anytime. All the time.

You need electricity, You switch it on. You do not test the electrical. You simply use it when you need it. Anytime. All the time.

Imagine Trusted Content the same way.

You need trusted content. You open TrustedOut, define what you trust so you can trust the content you get. You simply use it when you need it. Anytime. All the time.

We now offer Unlimited access to TrustedOut, so access to content you trust is totally frictionless.

Define the content you trust for every segment of your business.

Hereafter is an example of an Enterprise organized by Industry. Also applies to any other type of business organization.

Want to give it a try?
Contact us!

Should governments deal with fake news?

In this article, “Singapore just used its fake news law. Critics say it’s just what they feared“, CNN Business explains why the new anti-fake news law in Singapore produced what they feared most: “increased censorship and official overreach in a country where freedom of expression is already under pressure.”, adding: “This week’s events suggest those fears may be justified.”

“as required by Singaporean law.”

We won’t debate on the two articles under the scrutiny of the Singapore government, but rather focus on one thing very important for us:

Censorship must be and remain personal.

It is always dangerous to leave to someone what you can read and cannot.

CNN reports: “Government ministers can decide whether to order something deemed fake news to be taken down, or require a correction to be put up alongside it. They can also order companies such as Facebook (FB) and Google (GOOGL) — both of which opposed the bill — to block accounts or sites spreading false information.

The government can also prosecute individuals with fines of up to 50,000 Singapore dollars (about $36,000) and/or up to five years in prison. If the alleged falsehood is posted using “an inauthentic online account or controlled by a bot,” the potential fine rises to 100,000 Singapore dollars (around $73,000), and/or up to 10 years in prison.

Companies found guilty of spreading so-called fake news can face fines of up to 1 million Singapore dollars (roughly $735,000).”

Again, we, TrustedOut do not defend the spread of fake news or any offending content but we believe, for the most part, news can be seen as fake for some people and not fake for others, thus, censorship should be and remain Personal.

Get information from Traditional Media, have conversation on Social Media. Not the other way around.

In a previous post, we wrote:
“Misinformation and biases infect social media, both intentionally and accidentally. This highly recommended article from The Conversation exposes 3 types of bias identified by Indiana University. Hereafter are our takeaways… Continue reading

Trust, Media and Democracy

Related to this matter, we also wrote on the excellent Knight Foundation Report.

The Aspen Institute and the Knight Foundation recently released a report on a commission they organized about Trust, Media and Democracy. While coming from America, we believe most can apply wider.

If you don’t have the time for the length report, this medium page is very interesting. Here are our takeaways in the light of our previous posts, regrouped in 3 main categories:

10 ways to rebuild trust in media and democracy…  Continue reading

Define the content you trust for every segment of your business.

Hereafter is an example of an Enterprise organized by Industry. Also applies to any other type of business organization.

Want to give it a try?
Contact us!

Talk the Google talk.

In this Wired article, Devin Nunes and the Power of Keyword Signaling, the author explains how political speeches can be tweaked to play with Search engines.

Talk Google.

Quotes using part of those speeches will, to be accurate, reuse keywords optimized for Search Engines, such as Google et al. Those keywords should prompt results where the political candidate and party will have better results, higher, in the 1st page.

Propaganda landing pages.

Those keywords, when searched in Google will be either those from a competitor where confusion can be introduced, but some will be usual in a context or out of date or simple fabricated. The more unique they are, the better SEO will operate. Rare, those keywords when searched, will return few, low inventory pages where the candidate crew will have much of the pages ready.

Talk the Google talk.

Better, words used in a speech will be seen as belonging to a specific wing. This way, the result is optimal:

  • Push the candidate
  • Push the point of the candidate with differentiation vs others
  • Push the party and augment bipartisanism

The cure: Watch who’s publishing.

Getting news from a search engine is very risky. SEO technics explained above and here in a previous post:

Keywords (Data) Voids: Misinformations via Google and Bing.

The solution is the same as for Businesses: Get content, and thus education, from Media you trust.

Your business is all about the content you trust.

Questions? Contact us!

 

Marketers must-haves: Media Sources reports and comparisons

Let say you are looking for a media in “Eating and drinking”, in France, to associate your brand with.

1. Select potential media partners

France and French. Taxonomy is Eating and Drinking. We will select “Covered” for media covering this subject and Past month, for a more stable taxonomy than the last 7 days.
TrustedOut offers 40 media and 60 sources, 214 articles per day and 43,000 articles in the archive. Corpus creation looks like this:

Click on “Get” and we get Media and Sources.

A click on a media shows how the media is perceived for toxic contents and political orientations.

2. Diving on your selected two.

From the media list, you want more info on two: Le Figaro Gastronomie and Elle A Table.

A click on a source shows the trends in classifications [ 1 ], week vs month and month vs quarter, as well as the top classifications per period of time.

3. Get an instant report on a Media Source

For each source, click on the “Report” button, [ 2 ] in the screenshot above, and receive the report in a PDF format:

Le Figaro – Gastronomie
ELLE – ELLE A TABLE

Caution: Report timestamp is very important as data are permanently updated.

4. Comparing 2 sources profiles

[beta]

By comparing profiles, you can define what is best for your operations.

Questions? Contact-us!

 

Create a corpus from a list of articles (ex. here: popular on Facebook).

You want to create a Corpus of Media for your analytics and/or a whitelist of media similar to a list of articles?

Here is how Corpus Intelligence can help in 3 steps:

Step 1. Collect Materials.  

Let’s start with a list of popular articles: Today, the Top 15 of the most engaged articles on Facebook in Sept 2019

In this article you can find the following top 15 articles:

Table to show the top 15 web stories on Facebook in September 2019, ranked by engagement

Step 2. Understand Profiles

From the list above, we’ve collected the profiles of  the corresponding media.

Here are the top 30 most popular classifications from our 3-level AI-powered taxonomy.  

This means the top 5 most popular media types are:

  1. Politics
  2. Law
  3. Entertainment and Leisure
  4. Lifestyle
  5. Society 

Computing C.scores give precisely the classifications to shoot for.

Step 3. Build and manage Corpuses.

The hardest part is done. Let’s play with TrustedOut now:
Want all media in Politics and Law? here it is:
Want to target the dedicated media in the 1st classification, International, in Politics? Voila:

From here, feed your analytics tool and/or create a whitelist for your DSP.

Questions? Contact us!

TrustedOut: Intelligence you trust. Brand Safety you trust. PR you trust.

TrustedOut helps your business all the way, all the time.

Let say here, your company, SmartBizFurnitures, builds and commercializes high end, classy Business Furnitures.  SmartBizFurnitures believes Lifestyle applies to Business environments.

Sources you trust to get reliable intelligence.

You must trust the content you use to trust any decision you make.

SmartBizFurnitures asks TrustedOut for all media sources covering:
People › Lifestyle › Decoration And Design And Architecture

Because they are looking for insights from their intelligence tool (Digimind, Netvibes, etc…), they decide to:

  • Solely select the classification your business is in: People › Lifestyle › Decoration And Design And Architecture
  • Pick “covered” to get a large but relevant number of sources for our analytic tools.
  • Go with the shortest period of taxonomy, the rolling “Past week”,  because they want to be news-sensitive.
Mouse over to zoom

TrustedOut finds 481 sources. Click on “Get”…

Intelligence tools ready.

… download the CSV file of those 481 sources and import them in your semantic/social intelligence tool. Your tool will continue to listen to all new articles from those sources. Should you need articles abstracts from the past, ask us as we archive everything we’ve computed.

This download of sources is manual and thus your list of sources won’t be updated unless you do regular download and import to your intelligence tool. To avoid this and always analyze the freshest, most up to date sources, select “Connect”.

Media you trust to keep your brand safe.

Here you want the widest AND most relevant (read consistant) list of media for the safest whitelist you will feed your ad server with.

This time ask TrustedOut for the whole classification group: Lifestyle, but this time we want to get the largest but secure list of media who have published on Lifestyle in the past 7 days.

TrustedOut finds 3,717 media (and 10,037 sources)

mouse over to zoom

Brand cannot be with “Politics”

SmartBizFurnitures does not want to be involved into “Politics”, so, those 7% should disappear from our Corpus map.

Let’s remove the whole classification “Politics”… with the 4th line: Media covering Politics in the past 7 days should be removed from the list:

mouse over to zoom

Of course, we have less media: 936 instead of 3,717 but it’s the choice made by SmartBizFurnitures to keep their brand safe and consistant.

DSP ready.

Same as with Sources. Here you will download the Media list as CSV but, as your brand must remain safe all over an ad campaign, you will want to “Connect” so the list of media = the Whitelist, will be up to date at all time.

Articles you trust to build on your PR efforts.

For our reading. we want the latest articles from the most relevant sources.

We will ask TrustedOut to give us our classification, as with Analytics, People › Lifestyle › Decoration And Design And Architecture, but since here, it’s not a machine analyzing but humans, likely a PR team or directly executives, we will select: Dedicated over the past 7 days.

mouse over to zoom

100+ a day. 20,000+ archived.

On average, TrustedOut will provide 102 articles abstracts and can provide more than 20,000 from archives upon request.

A click on the right “Get” gives us:

mouse over to zoom

Spot on for your PR efforts.

Click on the abstract article to read the article. Very interesting for SmartBizFurnitures. PR effort right here.

mouse over to zoom. Click to read.

Watch tool ready.

Simple. Download: RSS.

Use this RSS as any RSS. Corporate pages, alerts, newsletters…

For example, in Netvibes:

mouse over to zoom. Click to discover Netvibes

or in Feedly:

mouse over to zoom. click to visit feedly.com

Your business is made of the content you trust.

Your business is all about the content you trust.

Contact us: contact@trustedout.com

Media Coverage Market Shares US vs France – Source TrustedOut – 10/01/19

What Media are covering the most.

In the US.

As of today and over the past 90 days, here are the top “covered” group classifications. This means the percentage of media in America covering the following group classification (like a countrywide newsstand shelf)

In bold are the differences above 5%. This means group classification that are more covered in the US vs France.

Top 3 Most topics covered are very different from France

US Media cover much more Entertainment & Leisure, Society and Education and, to a less degree, Sports, Lifestyle and Medecine & Health.

In France.

In bold are the differences above 5%. This means group classification that are more covered in France vs the US.

Greater coverage in Economy & Enterprise and Tech.

French Media cover much more in Economy & Enterprise and Tech which, both, are not even in the Top 10 US.

Largest Media Coverage differences.

As for above, we deliberately put a cliff at 5%.

Comparing Apple to Apple.

Tables above prove coverage between countries are very different, meaning feeding your intelligence tool and creating your whitelists based on random numbers of sources or media or articles will drive to unreliable, dangerous outcomes.

As demonstrated in this business case: Business Case #3. Country comparisons

Questions? Contact us!