About This Project
We are providing an automated method to detect ambiguous sentences based on machine translation. The proposed method works regardless of the semantics and therefore, is highly generalizable.
Potential practical applications are automatic translation and detection of ambiguity of critical texts such as in law or politics. Furthermore, it can be applied in fine-tuning multilingual LLMs and development of a ambiguity-free global human language.
Ask the Scientists
Join The DiscussionWhat is the context of this research?
Detecting ambiguity in text prevents automatic machine translators from misleading the reader of the target translation in understanding the text. Furthermore, by discovering a function that detects ambiguity, we would be able to fine-tune LLM-based machine translators to preserve the ambiguity in ambiguous sentences in translation.
Apart from that, this project, is part of a bigger project at Synaptosearch which tries to optimize an AI-generated global human language. By finding the ambiguity detection function, we would be able to use it as part of the optimization criteria to achieve an ambiguity-free language.
What is the significance of this project?
Language ambiguity brings up misunderstandings and conflicts in real world interactions such as legal, political, commercial and cultural interactions. This misunderstanding can lead to either waste of huge amount of time in negotiation between the parties for conflict resolution or even in worse case results in conflicting actions.
Detecting ambiguous statements in global interactions can be a critical problem that requires scientific research in providing solutions. One major organization that can benefit from the proposed research is the United Nations (UN) where different countries with different languages interact with each other.
What are the goals of the project?
The goal of this project is to propose a generalizable method to detect ambiguity in sentences, regardless of the semantics. In order to do so, we consider machine translation, as potential actions performed on single text. Conflict in different actions, accordingly, specifies ambiguity in the text.
In order to provide a semantic-free approach, we propose investigating how a text is translated to many target language and then translated back to the source language. By mapping the two step translations together and studying the properties of this mapping, we would be able to detect ambiguity, regardless of the semantics of the sentence.
Budget
The project initiates by a literature review on ambiguity in NLP, machine translation and LLMs. Later on, the prototype is implemented and tested. After that experiments are designed, implemented and conducted. Data annotation is performed in order to understand the source of ambiguities in translation. Finally the results and findings are submitted in well known journals.
Along the project, the management process verifies the steps and budget, handles the meetings, etc. Miscellaneous budget items including subscriptions such as overleaf, AWS, etc., are also considered for the project.
Research on this project will help us equip automatic machine translators with the capability of detecting ambiguity n the input text to prevent misunderstanding and mistakes in translation of critical texts such as legal or political documents. Furthermore, the project is part of a bigger project, focusing on developing AI-generated global human language, planned at Synaptosearch.
Endorsed by
Project Timeline
The expected timeline for the project is about 4 months which consists of research, implementation, experiments, annotation and finalization.
Mar 15, 2024
Literature review
Apr 15, 2024
Prototype
May 01, 2024
Experiments
May 15, 2024
Annotation
Jun 10, 2024
Project Launched
Meet the Team
Team Bio
Synaptosearch is a freelance company started in Amsterdam, the Netherlands, in 2024. The main activity of the company is in domains of Cognitive Science, Artificial Intelligence, Data Science and partially Software Engineering. The company’s mission is to serve with research services and consultancy.
Behrang Mehrparvar
Passionate about collaborative interdisciplinary research, mainly in cognitive science, artificial intelligence, data science and software engineering.
Lab Notes
Nothing posted yet.
Project Backers
- 2Backers
- 1%Funded
- $60Total Donations
- $30.00Average Donation