Code for "Detection of LLM-Generated Java Code Using Discretized Nested Bigrams" (arXiv:2502.15740). Achieves state-of-the-art performance in distinguishing human vs. LLM-written Java.
-
Updated
May 15, 2025 - Java
Code for "Detection of LLM-Generated Java Code Using Discretized Nested Bigrams" (arXiv:2502.15740). Achieves state-of-the-art performance in distinguishing human vs. LLM-written Java.
Text analysis to determine rank-frequency curves for words and bigrams, and vocabulary growth curves.
Application that automatically extracts collocations from the Google 2-grams dataset using Amazon Elastic Map Reduce
Project for Parallel Computing course. Sequential and parallel implementations of Bigrams search algorithm in Java.
A Java implementation of different probabilistic part-of-speech tagging techniques.
Add a description, image, and links to the bigrams topic page so that developers can more easily learn about it.
To associate your repository with the bigrams topic, visit your repo's landing page and select "manage topics."