Article
May 2, 2025
Welcome back to The Precision Content Series - our focus on the key areas that help deliver precision content. In our previous post, we introduced what we mean by precision content and outlined the five pillars required to achieve it:
Today, we spotlight the often unsung hero of quality localisation: Content Processing and Filtering.
While most organizations often focus much of their effort on translation mechanics, what happens before your content reaches translators or AI systems can dramatically impact quality, efficiency, and cost.
This crucial first step involves deciding what content to include or exclude for translation, identifying or generating additional context, and determining what adaptations are required before or after translation.
Consider the scenario, a global e-commerce company needs to translate its product catalog into 36 languages.
At this scale, organisations have typically invested in a Product Information Management System (PIM) and structured their content hierarchy to ease the management of the content at scale.
Without effective content processing, the following may happen:
As the content grows and markets scale, the challenges magnify exponentially, meaning any manual processes to support avoiding these becomes a more significant effort.
The good news is that much of this challenge can be overcome through effective Content Processing and Filtering.
Content Processing and Filtering encompasses several critical functions that set the stage for successful localization:
Before translation can begin, content must be properly segmented. This means it should be broken down into manageable, translatable units while preserving structural integrity.
Furthermore, not all content requires translation, and some requires special handling to ensure elements are protected and existing content re-used. Effective filtering involves:
By implementing robust filtering rules, organizations can reduce unnecessary translation volume, leading to significant time and cost savings.
Given the complexity of language, context is king in translation. To ensure a high-quality translation, processing should include:
Whilst some of the above may need up-front effort, often it can be derived at filtering time from the content itself, or and build on other assets such glossaries or context sources.
The best systems take advantage of the ability to generate file format specific context to enhance the translation process with minimal effort, providing a significant improvement from basic filtering.
Some elements require adaptation before or after translation, often automating steps that would otherwise require intervention:
The above are just some examples of automations that can save significant effort especially at scale. With the right framework, these automations can be chained together to create powerful pipelines.
The true power of content processing emerges when the above capabilities are implemented through automated, rules-based systems that can:
When we talk about content processing with others in the industry, often many are confused about why we focus so much effort on optimising this part of the process.
In its most basic form, ensuring the right content is extracted and protected is fundamental to translation, and is an element in every localisation solution.
Most of the solutions on the market provide filters that process a piece of content and a capability to protect pieces of text that shouldn’t be translated (e.g. placeholder variables or product SKUs).
Many however only provided limited customisation options within each filter type and off-load the rest of the adaptations required as added manual activities such as “Localization Engineering”, or leave them to the end-user outside of the tool.
Furthermore, often these rules and configurations are applied at a file type level - i.e. all Word or InDesign files - meaning that different formats across an organisation require additional effort to handle.
Whilst it may not be the most exciting feature to assess when reviewing a content platform, it can be one of the most impactful, with quality issues often tracing back to root causes triggered at this stage.
While content processing and filtering create the foundation of an effective translation process, it also serves as a crucial bridge to our next pillar: Context Enrichment.
These two pillars work in tandem, with effective processing creating opportunities for meaningful context enrichments.
Consider these connections:
The most sophisticated processing systems are designed with context enrichment in mind, creating structured opportunities for context insertion rather than treating it as an afterthought.
Content processing and filtering is evolving from a technical necessity to a strategic advantage. Organizations that invest in this foundation see compounding returns across their global content operations.
For business and localization leaders, the key implications are clear:
In our next installment of the Precision Content Series, we’ll explore the second pillar: Context Enrichment, in depth.
We’ll examine how context transforms translation quality, the different types of context that matter for different content, and how organizations can systematically implement context enrichment at scale.
If you’d like to stay up to date on our latest blog posts, you can subscribe to our mailing list and/or follow us on LinkedIn.