About This Project

We are providing an automated method to detect ambiguous sentences based on machine translation. The proposed method works regardless of the semantics and therefore, is highly generalizable.

Potential practical applications are automatic translation and detection of ambiguity of critical texts such as in law or politics. Furthermore, it can be applied in fine-tuning multilingual LLMs and development of a ambiguity-free global human language.

Ask the Scientists

Join The Discussion

What is the context of this research?

Detecting ambiguity in text prevents automatic machine translators from misleading the reader of the target translation in understanding the text. Furthermore, by discovering a function that detects ambiguity, we would be able to fine-tune LLM-based machine translators to preserve the ambiguity in ambiguous sentences in translation.

Apart from that, this project, is part of a bigger project at Synaptosearch which tries to optimize an AI-generated global human language. By finding the ambiguity detection function, we would be able to use it as part of the optimization criteria to achieve an ambiguity-free language.

What is the significance of this project?

Language ambiguity brings up misunderstandings and conflicts in real world interactions such as legal, political, commercial and cultural interactions. This misunderstanding can lead to either waste of huge amount of time in negotiation between the parties for conflict resolution or even in worse case results in conflicting actions.

Detecting ambiguous statements in global interactions can be a critical problem that requires scientific research in providing solutions. One major organization that can benefit from the proposed research is the United Nations (UN) where different countries with different languages interact with each other.

What are the goals of the project?

The goal of this project is to propose a generalizable method to detect ambiguity in sentences, regardless of the semantics. In order to do so, we consider machine translation, as potential actions performed on single text. Conflict in different actions, accordingly, specifies ambiguity in the text.

In order to provide a semantic-free approach, we propose investigating how a text is translated to many target language and then translated back to the source language. By mapping the two step translations together and studying the properties of this mapping, we would be able to detect ambiguity, regardless of the semantics of the sentence.

Budget

Please wait...

The project initiates by a literature review on ambiguity in NLP, machine translation and LLMs. Later on, the prototype is implemented and tested. After that experiments are designed, implemented and conducted. Data annotation is performed in order to understand the source of ambiguities in translation. Finally the results and findings are submitted in well known journals.

Along the project, the management process verifies the steps and budget, handles the meetings, etc. Miscellaneous budget items including subscriptions such as overleaf, AWS, etc., are also considered for the project.

Research on this project will help us equip automatic machine translators with the capability of detecting ambiguity n the input text to prevent misunderstanding and mistakes in translation of critical texts such as legal or political documents. Furthermore, the project is part of a bigger project, focusing on developing AI-generated global human language, planned at Synaptosearch.

Endorsed by

This is a great project. I am excited to read the proposal. It is an ambitious project but once it is completed it will have a lot of utility.

Behnaam Aazhang

J.S. Abercrombie Professor

Rice University

Project Timeline

The expected timeline for the project is about 4 months which consists of research, implementation, experiments, annotation and finalization.

Mar 15, 2024

Literature review

Apr 15, 2024

Prototype

May 01, 2024

Experiments

May 15, 2024

Annotation

Jun 10, 2024

Project Launched

Show more events

Meet the Team

Behrang Mehrparvar

Doctor

Affiliates

Synaptosearch

View Profile

Team Bio

Synaptosearch is a freelance company started in Amsterdam, the Netherlands, in 2024. The main activity of the company is in domains of Cognitive Science, Artificial Intelligence, Data Science and partially Software Engineering. The company’s mission is to serve with research services and consultancy.

Behrang Mehrparvar

Passionate about collaborative interdisciplinary research, mainly in cognitive science, artificial intelligence, data science and software engineering.

Lab Notes

Nothing posted yet.

Project Backers

2Backers
1%Funded
$60Total Donations
$30.00Average Donation

Please wait...

Detecting and translating language ambiguity

About This Project

Ask the Scientists

What is the context of this research?

What is the significance of this project?

What are the goals of the project?

Budget

Endorsed by

Project Timeline

Mar 15, 2024

Literature review

Apr 15, 2024

Prototype

May 01, 2024

Experiments

May 15, 2024

Annotation

Jun 10, 2024

Project Launched

Jul 01, 2024

Final report

Meet the Team

Affiliates

Team Bio

Behrang Mehrparvar

Lab Notes

Project Backers