Analyzing the impact of language clarity on scientific literature quality.

Raised of $8,800 Goal
Ended on 11/24/23
Campaign Ended
  • $311
  • 4%
  • Finished
    on 11/24/23

About This Project

Tortured phrases, characterized by complex and ambiguous language, obscure authors' real meanings and pose a growing threat to scholarly credibility. For instance, "counterfeit neural organizations" instead of "Artificial Neural Networks" and "portrayal learning" instead of "representation learning." To counter this issue, we will analyze 15,000 articles, investigating how tortured phrases correlate with article retractions, author ethics, and their impact on publishers and institutions.

Ask the Scientists

Join The Discussion

What is the context of this research?

While investigating various research papers, we encountered tortured phrases that challenged our understanding. For example, in an article in 2021, we read the sentence: "The secret segments move dependent upon the disease. Coronary vein disease, stroke, and periphery supply course ailment incorporate atherosclerosis." Upon further examination, we realized the intended statement was: "The underlying mechanisms vary depending on the disease. Coronary artery disease, stroke, and peripheral artery disease involve atherosclerosis." These sentences made us struggle to grasp the author's intended meaning. So, we aim to understand their impact on scholarly credibility, retractions, ethical concerns, and the reputation of publishers and institutions.

What is the significance of this project?

Our project holds significant implications for scholarly research and communication, prioritizing clarity and transparency in scientific literature. We seek to enhance comprehension among peers and the public by addressing the challenge of tortured phrases and complex language. Through meticulous analysis of 15,000 articles, we plan to identify and replace tortured phrases, collaborating with authors, publishers, and institutions for meaningful change. By establishing guidelines and encouraging a cultural shift towards more explicit language, our work aims to transform scholarly communication, elevating accessibility and the impact of scientific knowledge for a lasting effect.

What are the goals of the project?

We will use the fund to pay for server services, training model processes, students, and volunteers. We will follow four specific steps, including gathering preprints from Research Square, ArXiv, & PubMed using Talbots. We then identify & quantify tortured phrases by calculating the ratio of tortured phrases to total pages. Thirdly, identify other quality metrics (Interest Score, Citation, & Altmetric index) to evaluate the preprint quality. Finally, analyze the ratio of tortured phrases to the quality score to check how their quality is affected by the presence or absence of tortured phrases. Computer Science, Engineering, & Education are our fields of expertise, so we only focus on them in this research. We will share our findings with the community through publication on ArXiv.


Please wait...

We have completed Phase 1 of our research with approximately $3,500 from the personal funds of our leader. The outcome of this phase has been the development of basic features for the Talbots platform. Our manuscript has been submitted to a journal and is currently under review. Now, we have depleted our funding resources and are even encountering challenges in sustaining the FREE Talbots platform for the user community.

The budget presented here is allocated towards maintaining the Talbots platform and completing the research over the next six months. Our desired outcome is a comprehensive report on tortured phrases in 15,000 scientific literature. We plan to publish it on arXiv, making it easily accessible for everyone interested.

Our ultimate goal is consistently keeping the Talbots platform FREE for the community. Simultaneously, we pledge to maintain a real-time list of sponsors on our project's website ( to express gratitude towards them.

Endorsed by

I wholeheartedly endorse the Scholarly Clarity Project, addressing the critical issue of "tortured phrases" in academic literature. Their analysis of 15,000 articles to unravel complex language and enhance scholarly credibility is commendable. Their commitment to transparency and improving the accessibility of scientific knowledge is poised to foster a significant cultural shift in academic communications. The detailed plan and budget reflect their dedication to keeping resources accessible, making this initiative both impactful and necessary.
What a wonderful chatbot that can help me classify scientific articles so that I can choose good quality articles. You can search any articles by DOI link
I am really excited to this project. Also, I believe this project would bring benefits for the field and open opportunities for further research in the near future.
I am really excited about this project. I believe it will answer critical questions in this field of study. This researcher is the best person to answer these questions.

Project Timeline

We outline a carefully structured plan for distinct phases, such as Phase 1 (one month), which focuses on updating the Talbots Library on the PyPi Platform. Phase 2 (one month) involves refining and deploying the Talbots Website. Phase 3 (two months) collects data via the Talbots website. Phase 4 (1.5 months) involves processing and analyzing the accumulated data. Lastly, Phase 5 (0.5 months) centers on composing and submitting a scholarly manuscript on arXiv.

Oct 25, 2023

Project Launched

Nov 30, 2023

Updating the Talbots Library on the PyPi Platform.

Dec 31, 2023

Correcting and Deploying the Talbots Website.

Mar 06, 2024

Collecting the Data with Talbots website.

Apr 17, 2024

Processing and Analyzing the Data.

Meet the Team

Tan H. Nguyen
Tan H. Nguyen


University of the People
View Profile

Tan H. Nguyen

Hello everyone

I'm Tan H. Nguyen, an experienced researcher with almost a decade of hands-on engagement in AI and its applications. My journey spans roles at FPT Education Organization, FPT Telecom, and volunteering at the Vietnam Ministry of Health. My focus areas include algorithms, computer vision, natural language processing, and robotics.

I'm currently pursuing my second Bachelor's degree at the University of the People. I've contributed to impactful research projects, from image processing to co-authoring textbooks and translating "Think Python: How to Think Like a Computer Scientist" into Vietnamese.

Noteworthy accomplishments include recognition at the National Youth Creativity Festival in 2015 and consecutive Dean's List and President's List placements (2021-2023) at UoPeople. is my gateway to share my research endeavors with the wider scientific community. Through this platform, I aim to foster connections, learn from fellow researchers, and collectively propel the frontiers of science and technology.

Explore with me on Learn more about me on LinkedIn:

Best regards,
Tan H. Nguyen

Additional Information

To know more information about our research, please don't hesitate to access the following link:

1. Talbots website:

2. Head of Research Team:

Project Backers

  • 8Backers
  • 4%Funded
  • $311Total Donations
  • $38.88Average Donation
Please wait...