-
Notifications
You must be signed in to change notification settings - Fork 0
Description
Background
I am a college student currently working on implementing the ConDA-gen-text-detection code from Amrita Bhattacharjee's GitHub repository. During this process, I have encountered some issues and would appreciate some guidance and assistance.
Process
I downloaded the data from TuringBench, specifically the TuringBench.zip file, and successfully processed all CSV files into real_dataset.jsonl and fake_dataset.jsonl files.
After completing the preprocessing steps, I generated the relevant scrambled text and proceeded to run the multi_domain_runner.py script, resulting in the creation of a .pt model file.
Problem
However, when attempting to evaluate the model using the evaluation.py script, I did not obtain the F1 score mentioned in Amrita Bhattacharjee's paper. I am using the robert-base coding tool for this implementation.
Request
I would greatly appreciate some insights into the correct approach for obtaining the results mentioned in the paper. There may be specific parameters or steps that I overlooked during the process.
Relevant Information
Encoder tool used: robert-base
Data source: TuringBench.zip file
Processed files: real_dataset.jsonl and fake_dataset.jsonl
Contact Information
Name: Ruifan Zhao
University: Mongolian University
Email: zhaomr314@gmail.com
Thank you for your time and assistance. Looking forward to your response.