Lost in translation: AI language errors jeopardise asylum cases

Garbled translations could lead to application rejections

Immigrants queue around the government Home Office building of Becket House in London, England.  File photo: CHRIS J RATCLIFFE/GETTY IMAGES
Immigrants queue around the government Home Office building of Becket House in London, England. File photo: CHRIS J RATCLIFFE/GETTY IMAGES

Washington — Names translated as months of the year, incorrect time frames and mixed-up pronouns, the everyday failings of artificial intelligence (AI)-driven translation apps, are causing havoc in the US asylum system, say critics.

“We have countless examples of this nature,” said Ariel Koren, founder of Respond Crisis Translation, a global collective that has translated more than 13,000 asylum applications, warning that errors can lead to unfounded denials.

In one case, she said, attorneys missed a crucial detail in a woman’s account of domestic abuse because the translation app they were using kept breaking down, and they ran out of time.

“The machines themselves are not operating with even a fraction of the quality they need to be able to do case work that’s acceptable for someone in a high-stakes situation,” said Koren, who used to work for Google Translate.

She said a translator with the group estimated that 40% of Afghan asylum cases he worked on encountered problems due to machine translation. Cases involving Haitian Creole speakers also faced huge issues.

Government contractors and large aid organisations are using AI machine translation tools increasingly due to “an immense amount of incentive to cut costs”, said Koren.

The extent to which such tools are being used in US immigration processing is unclear, however, amid a broad lack of transparency, said Aliya Bhatia, a policy analyst with the Center for Democracy and Technology (CDT) think-tank.

“We know governments and asylum lobbies around the world ... are moving towards using automated technology,” said Bhatia.

A 2019 report from investigative news outlet ProPublica found that immigration officials were being directed to use Google Translate to “vet” social media use for refugee applications.

The US justice department and the Immigration and Customs Enforcement (ICE) agency did not respond to requests for comment, and nor did the White House, which recently released a national “blueprint” on AI guidelines.

Asked about concern about the use of machine translation in asylum cases, a Google spokesperson said its Google Translate tool underwent strict quality controls, pointing out that it was offered free of charge.

“We rigorously train and test our systems to ensure each of the 133 languages we support meets a high standard for translation quality,” said the spokesperson.

Training gap

A major shortcoming of translation tools’ use in asylum cases stems from difficulties in building in checks, said Gabe Nicholas, a research fellow at the CDT and co-author with Bhatia on a May paper on the models being used for machine translation.

“Because the person speaks only one language, the potential for mistakes and errors to go uncaught is really, really high,” he said.

Machine translation made huge progress in recent years, according to Nicholas and Bhatia, but it is still nowhere near good enough to be relied on in complex, high-stakes situations such as the asylum process.

A core problem is how the apps are trained in the first place: on digitised text, for which there are masses available for English but far less for other languages.

This results in less nuanced or simply incorrect translations, and it also means English or some other high-resource languages become “intermediaries though which these models view the world”, said Bhatia.

The result is Anglo-centric translations that often fail to capture crucial details accurately. 

Like many other sectors, the translation industry was upended in recent months by the release of “generative” AI tools such as ChatGPT.

“ChatGPT and AI are now on everybody’s minds,” said Jill Kushner Bishop, founder and CEO of Multilingual Connections, a company based in the Chicago area.

“There are cases for it, and those are more and more compelling all the time. But it’s still not ready in most cases to be used with the training wheels off and without a human involved,” said Bishop.

Production director Katie Baumann said tests tools and different languages regularly, but continues to find problems with text translations involving, say, Turkish or Japanese, or AI-driven audio transcriptions with background noise.

“We’ve run tests of extracts of law-enforcement interviews, processing and putting it through machine translation — a lot of it is nonsense. It wouldn’t save you any time, so we wouldn’t use it,” said Baumann.

So even as Multilingual Connections uses machine translation increasingly, a human is always involved. “You don’t know what you don’t know. So for someone who is not a speaker of the language ... you don’t know where the mistakes will be,” said Bishop.

“Think about asylum cases ... and what might be misunderstood without a human verifying,” she said.

OpenAI, which developed ChatGPT, declined to comment, but a spokesperson pointed to policies that bar use for “high risk government decision-making” including law enforcement, criminal justice, migration and asylum.

‘Terrible mess’

At Respond Crisis Translation, the shortcomings of AI-driven translation tools are also creating an extra layer of work for Koren and her colleagues.

“The people who need to clean up the mess are human translators,” she said.

One of the collective’s translators, Samara Zuza, has been working for three years with a Brazilian asylum seeker whose papers were poorly translated by an AI app while he was in immigration detention in California, she said.

The application was “full of insane mistakes”, said Zuza. “The names of the city and state are wrong. The sentences are reversed — and that’s the form that was sent to the court.”

She thinks it was these inaccuracies that resulted in the rejection of initial attempts to secure the man’s release. The man, who asked to be identified only as Carlos, a pseudonym, was eventually released in May 2020 after the two started working together.

“The language was the worst aspect for me,” Carlos said of his six months in immigration detention after he fled gang activity in Brazil.

He spoke by phone from Massachusetts, where he is now living as he applies for US residency.

Carlos, who is illiterate and speaks Brazilian Portuguese, said he could not communicate with immigration officials and other detainees for months.

To fill out his asylum paperwork, he relied on a tablet computer’s voice recorder coupled with an app that used machine translation.

“So many of the words were being wrongly translated,” he said. “My asylum papers were a terrible mess.”

Thomson Reuters Foundation

Would you like to comment on this article?
Sign up (it's quick and free) or sign in now.

Comment icon