Text Classification
Transformers
PyTorch
Safetensors
English
roberta
mgt-detection
ai-detection
text-embeddings-inference
Instructions to use andreas122001/roberta-wiki-detector with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use andreas122001/roberta-wiki-detector with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="andreas122001/roberta-wiki-detector")# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("andreas122001/roberta-wiki-detector") model = AutoModelForSequenceClassification.from_pretrained("andreas122001/roberta-wiki-detector") - Notebooks
- Google Colab
- Kaggle
| license: openrail | |
| widget: | |
| - text: I am totally a human, trust me bro. | |
| example_title: default | |
| - text: >- | |
| In Finnish folklore, all places and things, and also human beings, have a | |
| haltija (a genius, guardian spirit) of their own. One such haltija is called | |
| etiäinen—an image, doppelgänger, or just an impression that goes ahead of a | |
| person, doing things the person in question later does. For example, people | |
| waiting at home might hear the door close or even see a shadow or a | |
| silhouette, only to realize that no one has yet arrived. Etiäinen can also | |
| refer to some kind of a feeling that something is going to happen. Sometimes | |
| it could, for example, warn of a bad year coming. In modern Finnish, the | |
| term has detached from its shamanistic origins and refers to premonition. | |
| Unlike clairvoyance, divination, and similar practices, etiäiset (plural) | |
| are spontaneous and can't be induced. Quite the opposite, they may be | |
| unwanted and cause anxiety, like ghosts. Etiäiset need not be too dramatic | |
| and may concern everyday events, although ones related to e.g. deaths are | |
| common. As these phenomena are still reported today, they can be considered | |
| a living tradition, as a way to explain the psychological experience of | |
| premonition. | |
| example_title: real wikipedia | |
| - text: >- | |
| In Finnish folklore, all places and things, animate or inanimate, have a | |
| spirit or "etiäinen" that lives there. Etiäinen can manifest in many forms, | |
| but is usually described as a kind, elderly woman with white hair. She is | |
| the guardian of natural places and often helps people in need. Etiäinen has | |
| been a part of Finnish culture for centuries and is still widely believed in | |
| today. Folklorists study etiäinen to understand Finnish traditions and how | |
| they have changed over time. | |
| example_title: generated wikipedia | |
| - text: >- | |
| This paper presents a novel framework for sparsity-certifying graph | |
| decompositions, which are important tools in various areas of computer | |
| science, including algorithm design, complexity theory, and optimization. | |
| Our approach is based on the concept of "cut sparsifiers," which are sparse | |
| graphs that preserve the cut structure of the original graph up to a certain | |
| error bound. We show that cut sparsifiers can be efficiently constructed | |
| using a combination of spectral techniques and random sampling, and we use | |
| them to develop new algorithms for decomposing graphs into sparse subgraphs. | |
| example_title: from ChatGPT | |
| - text: >- | |
| Recent work has demonstrated substantial gains on many NLP tasks and | |
| benchmarks by pre-training on a large corpus of text followed by fine-tuning | |
| on a specific task. While typically task-agnostic in architecture, this | |
| method still requires task-specific fine-tuning datasets of thousands or | |
| tens of thousands of examples. By contrast, humans can generally perform a | |
| new language task from only a few examples or from simple instructions - | |
| something which current NLP systems still largely struggle to do. Here we | |
| show that scaling up language models greatly improves task-agnostic, | |
| few-shot performance, sometimes even reaching competitiveness with prior | |
| state-of-the-art fine-tuning approaches. Specifically, we train GPT-3, an | |
| autoregressive language model with 175 billion parameters, 10x more than any | |
| previous non-sparse language model, and test its performance in the few-shot | |
| setting. For all tasks, GPT-3 is applied without any gradient updates or | |
| fine-tuning, with tasks and few-shot demonstrations specified purely via | |
| text interaction with the model. GPT-3 achieves strong performance on many | |
| NLP datasets, including translation, question-answering, and cloze tasks, as | |
| well as several tasks that require on-the-fly reasoning or domain | |
| adaptation, such as unscrambling words, using a novel word in a sentence, or | |
| performing 3-digit arithmetic. At the same time, we also identify some | |
| datasets where GPT-3's few-shot learning still struggles, as well as some | |
| datasets where GPT-3 faces methodological issues related to training on | |
| large web corpora. Finally, we find that GPT-3 can generate samples of news | |
| articles which human evaluators have difficulty distinguishing from articles | |
| written by humans. We discuss broader societal impacts of this finding and | |
| of GPT-3 in general. | |
| example_title: GPT-3 paper | |
| datasets: | |
| - NicolaiSivesind/human-vs-machine | |
| - gfissore/arxiv-abstracts-2021 | |
| language: | |
| - en | |
| pipeline_tag: text-classification | |
| tags: | |
| - mgt-detection | |
| - ai-detection | |
| Machine-generated text-detection by fine-tuning of language models | |
| === | |
| This project is related to a bachelor's thesis with the title "*Turning Poachers into Gamekeepers: Detecting Machine-Generated Text in Academia using Large Language Models*" (see [here](https://ntnuopen.ntnu.no/ntnu-xmlui/handle/11250/3078096)) written by *Nicolai Thorer Sivesind* and *Andreas Bentzen Winje* at the *Department of Computer Science* at the *Norwegian University of Science and Technology*. | |
| It contains text classification models trained to distinguish human-written text from text generated by language models like ChatGPT and GPT-3. The best models were able to achieve an accuracy of 100% on real and *GPT-3*-generated wikipedia articles (4500 samples), and an accuracy of 98.4% on real and *ChatGPT*-generated research abstracts (3000 samples). | |
| The dataset card for the dataset that was created in relation to this project can be found [here](https://huggingface.co/datasets/NicolaiSivesind/human-vs-machine). | |
| **NOTE**: the hosted inference on this site only works for the RoBERTa-models, and not for the Bloomz-models. The Bloomz-models otherwise can produce wrong predictions when not explicitly providing the attention mask from the tokenizer to the model for inference. To be sure, the [pipeline](https://huggingface.co/docs/transformers/main_classes/pipelines)-library seems to produce the most consistent results. | |
| ## Fine-tuned detectors | |
| This project includes 12 fine-tuned models based on the RoBERTa-base model, and three sizes of the bloomz-models. | |
| | Base-model | RoBERTa-base | Bloomz-560m | Bloomz-1b7 | Bloomz-3b | | |
| |------------|--------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------| | |
| | Wiki | [roberta-wiki](https://huggingface.co/andreas122001/roberta-wiki-detector) | [Bloomz-560m-wiki](https://huggingface.co/andreas122001/bloomz-560m-wiki-detector) | [Bloomz-1b7-wiki](https://huggingface.co/andreas122001/bloomz-1b7-wiki-detector) | [Bloomz-3b-wiki](https://huggingface.co/andreas122001/bloomz-3b-wiki-detector) | | |
| | Academic | [roberta-academic](https://huggingface.co/andreas122001/roberta-academic-detector) | [Bloomz-560m-academic](https://huggingface.co/andreas122001/bloomz-560m-academic-detector) | [Bloomz-1b7-academic](https://huggingface.co/andreas122001/bloomz-1b7-academic-detector) | [Bloomz-3b-academic](https://huggingface.co/andreas122001/bloomz-3b-academic-detector) | | |
| | Mixed | [roberta-mixed](https://huggingface.co/andreas122001/roberta-mixed-detector) | [Bloomz-560m-mixed](https://huggingface.co/andreas122001/bloomz-560m-mixed-detector) | [Bloomz-1b7-mixed](https://huggingface.co/andreas122001/bloomz-1b7-mixed-detector) | [Bloomz-3b-mixed](https://huggingface.co/andreas122001/bloomz-3b-mixed-detector) | | |
| ### Datasets | |
| The models were trained on selections from the [GPT-wiki-intros]() and [ChatGPT-Research-Abstracts](), and are separated into three types, **wiki**-detectors, **academic**-detectors and **mixed**-detectors, respectively. | |
| - **Wiki-detectors**: | |
| - Trained on 30'000 datapoints (10%) of GPT-wiki-intros. | |
| - Best model (in-domain) is Bloomz-3b-wiki, with an accuracy of 100%. | |
| - **Academic-detectors**: | |
| - Trained on 20'000 datapoints (100%) of ChatGPT-Research-Abstracts. | |
| - Best model (in-domain) is Bloomz-3b-academic, with an accuracy of 98.4% | |
| - **Mixed-detectors**: | |
| - Trained on 15'000 datapoints (5%) of GPT-wiki-intros and 10'000 datapoints (50%) of ChatGPT-Research-Abstracts. | |
| - Best model (in-domain) is RoBERTa-mixed, with an F1-score of 99.3%. | |
| ### Hyperparameters | |
| All models were trained using the same hyperparameters: | |
| ```python | |
| { | |
| "num_train_epochs": 1, | |
| "adam_beta1": 0.9, | |
| "adam_beta2": 0.999, | |
| "batch_size": 8, | |
| "adam_epsilon": 1e-08 | |
| "optim": "adamw_torch" # the optimizer (AdamW) | |
| "learning_rate": 5e-05, # (LR) | |
| "lr_scheduler_type": "linear", # scheduler type for LR | |
| "seed": 42, # seed for PyTorch RNG-generator. | |
| } | |
| ``` | |
| ### Metrics | |
| Metrics can be found at https://wandb.ai/idatt2900-072/IDATT2900-072. | |
| In-domain performance of wiki-detectors: | |
| | Base model | Accuracy | Precision | Recall | F1-score | | |
| |-------------|----------|-----------|--------|----------| | |
| | Bloomz-560m | 0.973 | *1.000 | 0.945 | 0.972 | | |
| | Bloomz-1b7 | 0.972 | *1.000 | 0.945 | 0.972 | | |
| | Bloomz-3b | *1.000 | *1.000 | *1.000 | *1.000 | | |
| | RoBERTa | 0.998 | 0.999 | 0.997 | 0.998 | | |
| In-domain peformance of academic-detectors: | |
| | Base model | Accuracy | Precision | Recall | F1-score | | |
| |-------------|----------|-----------|--------|----------| | |
| | Bloomz-560m | 0.964 | 0.963 | 0.965 | 0.964 | | |
| | Bloomz-1b7 | 0.946 | 0.941 | 0.951 | 0.946 | | |
| | Bloomz-3b | *0.984 | *0.983 | 0.985 | *0.984 | | |
| | RoBERTa | 0.982 | 0.968 | *0.997 | 0.982 | | |
| F1-scores of the mixed-detectors on all three datasets: | |
| | Base model | Mixed | Wiki | CRA | | |
| |-------------|--------|--------|--------| | |
| | Bloomz-560m | 0.948 | 0.972 | *0.848 | | |
| | Bloomz-1b7 | 0.929 | 0.964 | 0.816 | | |
| | Bloomz-3b | 0.988 | 0.996 | 0.772 | | |
| | RoBERTa | *0.993 | *0.997 | 0.829 | | |
| ## Credits | |
| - [GPT-wiki-intro](https://huggingface.co/datasets/aadityaubhat/GPT-wiki-intro), by Aaditya Bhat | |
| - [arxiv-abstracts-2021](https://huggingface.co/datasets/gfissore/arxiv-abstracts-2021), by Giancarlo | |
| - [Bloomz](bigscience/bloomz), by BigScience | |
| - [RoBERTa](https://huggingface.co/roberta-base), by Liu et. al. | |
| ## Citation | |
| Please use the following citation: | |
| ``` | |
| @misc {sivesind_2023, | |
| author = { {Nicolai Thorer Sivesind} and {Andreas Bentzen Winje} }, | |
| title = { Machine-generated text-detection by fine-tuning of language models }, | |
| url = { https://huggingface.co/andreas122001/roberta-academic-detector }, | |
| year = 2023, | |
| publisher = { Hugging Face } | |
| } | |
| ``` |