News organizations such as the Associated Press, The New York Times and Thomson Reuters are teaching computers to categorize text and images by building robust taxonomies that their systems use to tag news content.
Adding digital information under the hood in this way helps link stories together and serve up relevant content to news audiences.
In a recent interview with Poynter, Associated Press staffers talked about the AP’s News Taxonomy and why a news organization might consider using it.
Taxonomy is the practice of classifying information. News organizations do this already: putting articles in the sports section instead of the business section is a way of classifying them. What’s different today is organizations are classifying articles using computers instead of human judgment.
Stuart Myles, director of information management at the AP, led the team that built the AP News Taxonomy with machine-learning and natural-language-processing tools to teach computers how to make decisions instead of having a person read every article or look up a caption on every photo. Once the computer decides the appropriate tags to add, those tags are attached to the article’s or photo’s metadata.
“We’ve created a system of rules that evaluate every single bit of English text we handle,” Myles told Poynter by phone.… Read more