AI vs AI: Scientists Develop Neural Networks to Detect Generated Text Insertions
A research team, including Alexander Shirnin from HSE University, has developed two models designed to detect AI-generated insertions in scientific texts. The AIpom system integrates two types of models: a decoder and an encoder. The Papilusion system is designed to detect modifications through synonyms and summarisation by neural networks, using one type of models: encoders. In the future, these models will assist in verifying the originality and credibility of scientific publications. Articles describing the Papilusion and AIpom systems have been published in the ACL Anthology Digital Archive.
As language models like ChatGPT and GigaChat become more popular and widely used, it becomes increasingly challenging to distinguish original human-written text from AI-generated content. Artificial intelligence is already being used to write scientific publications and graduation papers. Therefore, it is crucial to develop tools capable of identifying AI-generated insertions in texts. A research team, including scientists from HSE University, presented their solutions at the SemEval 2024 and DAGPap24 international scientific competitions.
The AIpom model was used to identify the boundaries between original and generated fragments in scientific papers. In each paper, the proportion of machine-generated text to the author's text varied. To train the models, the organisers provided texts on the same topic. However, during the verification stage, the topics changed, making the task more challenging.
Alexander Shirnin
'Models perform well on familiar topics, but their performance declines when presented with new topics,' according to Alexander Shirnin, co-author of the paper and Research Assistant at the Laboratory for Models and Methods of Computational Pragmatics, HSE Faculty of Computer Science. 'It's like a student who, having learned how to solve one type of problem, struggles to solve a problem on an unfamiliar topic or from a different subject as easily or accurately.'
To improve the system's performance, the researchers combined two models: a decoder and an encoder. At the first stage, a neural network decoder was used, with the input consisting of an instruction and the source text, and the output being a text fragment presumably generated by AI. Next, in the original text, the area where the model predicted the beginning of a generated fragment was highlighted using a special <BREAK> token. The encoder then processed the text marked up in the first stage and refined the decoder's predictions. To do this, it categorised each token—the smallest unit of text, such as a word or part of a word—and identified whether it was written by a human or generated by AI. This approach improved accuracy compared to systems that used only one type of model: AIpom ranked second at the SemEval-2024 competition.
The Papilusion model also distinguished between written text and generated text. Using Papilusion, sections of the text were classified into four categories: written by a human, modified with synonyms, generated, or summarised by a model. The task was to accurately identify each category. The number of categories and the length of insertions in the texts varied.
In this case, the developers used three models, all of the same type: encoders. They were trained to predict one of the four categories for each token in the text, with each model trained independently of the others. When a model made an error, a cost was applied, and the model was retrained with the lower layers frozen.
'Each model has a different number of layers, depending on its architecture. When training a model, we can leave the first ten or so layers unchanged and adjust only the parameters in the last two layers. This is done to prevent losing important data embedded in the first layers during training,' explains Alexander Shirnin. 'It can be compared to an athlete who makes an error in the movement of their hand. We only need to explain this part to them, rather than resetting their entire learning and retraining them, as they might forget how to move correctly overall. The same logic applies here. The method is not universal and may not work with all models, but in our case, it was effective.'
The three encoders independently determined the category for each token (word). The system's final prediction was based on the category that received the most points. Papilusion ranked sixth out of 30 in the competition.
According to the researchers, current AI detection models perform reasonably well but still have limitations. Primarily, they struggle to process data beyond what they were trained on, and overall, there is a lack of diverse data to train the models effectively.
'To obtain more data, we need to focus on collecting it. Both companies and laboratories have been doing this. Specifically for this type of task, it is necessary to collect datasets that include texts modified using multiple AI models and modification methods,' the researcher comments. 'Instead of continuing a text using just one model, more realistic scenarios should be created, such as asking the model to add to the text, rewrite the beginning for better coherence, remove parts of it, or generate a portion of the text in a new style using a different prompt. Of course, it is also important to collect data in different languages and on a variety of topics.'
See also:
Acoustic Battles for the Harem: How the Calls of Siberian Wapiti Reveal Their Status and Individuality
Researchers at HSE University, Lomonosov Moscow State University, and the A.N. Severtsov Institute of Ecology and Evolution of the Russian Academy of Sciences have studied the distinctive vocalisations of Siberian wapiti (Cervus canadensis sibiricus) stags during the peak of the mating season, when males produce rutting calls (bugles) to attract females (hinds) and deter rivals. The scientists have discovered how the acoustic parameters of these rutting calls reflect the stag's status—whether he currently holds a harem or is still attempting to acquire one—as well as his individual characteristics. The study has been published in Journal of Zoology.
Z-Flipons: How Specific DNA Regions Help Regulate Gene Function
Researchers at HSE University and InsideOutBio have applied machine learning to identify the location and functions of mirror-twisted DNA structures, known as Z-flipons, in human and mouse genomes. The scientists discovered which Z-DNA regions were conserved in both species throughout evolution and demonstrated for the first time that Z-DNA accelerates the process of creating RNA copies of genes. The findings will contribute to the development of new treatments for genetic diseases. The study has been published in Scientific Reports.
HSE Researchers Develop Python Library for Analysing Eye Movements
A research team at HSE University has developed EyeFeatures, a Python library for analysing and modelling eye movement data. This tool is designed to simplify the work of scientists and developers by enabling them to efficiently process complex data and create predictive models.
Scientists Identify Fifteen Key Motives Driving Human Behaviour
Researchers at HSE University and the London School of Hygiene and Tropical Medicine have identified 15 key motives that drive human behaviour. By analysing people's views, preferences, and actions through an evolutionary lens, they demonstrated how these motives intertwine to shape habits and interpersonal relationships. The findings have been published in Personality and Individual Differences.
HSE Neurolinguists Create Russian Adaptation of Classic Verbal Memory Test
Researchers at the HSE Centre for Language and Brain and Psychiatric Hospital No. 1 Named after N.A. Alexeev have developed a Russian-language adaptation of the Rey Auditory Verbal Learning Test. This classic neuropsychological test evaluates various aspects of auditory verbal memory in adults and is widely used in both clinical diagnostics and research. The study findings have been published in The Clinical Neuropsychologist.
Tickling the Nerves: Why Crime Content is Popular
Consumers of content about serial killers watch and read it to experience intense emotions that are often lacking in everyday life and to understand the reasons that drive people to commit crimes. However, such content does not contribute to increased aggression. These conclusions were drawn by sociologists from HSE University. The results of their study have been published in Crime, Media, Culture: An International Journal.
HSE Researchers Prove the Existence of Nash Equilibrium for a New Class of Problems in Game Theory
Researchers at HSE University's St Petersburg School of Economics and Management have been exploring methods for the efficient allocation of resources in systems involving multiple players. The scientists have proven the existence of strategies for optimal decision-making in competition for limited, discrete resources in four different cases. The developed mathematical model can be applied in various fields, ranging from education and medicine to managing networks and computing power. The paper has been published in Games and Economic Behaviour.
Researchers at HSE Centre for Language and Brain Reveal Key Factors Determining Language Recovery in Patients After Brain Tumour Resection
Alina Minnigulova and Maria Khudyakova at the HSE Centre for Language and Brain have presented the latest research findings on the linguistic and neural mechanisms of language impairments and their progression in patients following neurosurgery. The scientists shared insights gained from over five years of research on the dynamics of language impairment and recovery.
Neuroscientists Reveal Anna Karenina Principle in Brain's Response to Persuasion
A team of researchers at HSE University investigated the neural mechanisms involved in how the brain processes persuasive messages. Using functional MRI, the researchers recorded how the participants' brains reacted to expert arguments about the harmful health effects of sugar consumption. The findings revealed that all unpersuaded individuals' brains responded to the messages in a similar manner, whereas each persuaded individual produced a unique neural response. This suggests that successful persuasive messages influence opinions in a highly individual manner, appearing to find a unique key to each person's brain. The study findings have been published in PNAS.
Russian Scientists Improve Water Purification Membranes Using Metal Ions
Researchers have proposed using polymer membranes modified with copper, zinc, and chromium metal ions for water purification. These polymers were used for the first time in water purification via electrodialysis. Copper-based membranes demonstrated record selectivity for monovalent ions, opening new possibilities for sustainable water recycling. The study has been published in the Journal of Membrane Science.