Skip to content

Assistance Request for ConDA-gen-text-detection Code Implementation #1

@Doran-fans

Description

@Doran-fans

Background
I am a college student currently working on implementing the ConDA-gen-text-detection code from Amrita Bhattacharjee's GitHub repository. During this process, I have encountered some issues and would appreciate some guidance and assistance.

Process
I downloaded the data from TuringBench, specifically the TuringBench.zip file, and successfully processed all CSV files into real_dataset.jsonl and fake_dataset.jsonl files.
After completing the preprocessing steps, I generated the relevant scrambled text and proceeded to run the multi_domain_runner.py script, resulting in the creation of a .pt model file.
Problem
However, when attempting to evaluate the model using the evaluation.py script, I did not obtain the F1 score mentioned in Amrita Bhattacharjee's paper. I am using the robert-base coding tool for this implementation.

Request
I would greatly appreciate some insights into the correct approach for obtaining the results mentioned in the paper. There may be specific parameters or steps that I overlooked during the process.

Relevant Information
Encoder tool used: robert-base
Data source: TuringBench.zip file
Processed files: real_dataset.jsonl and fake_dataset.jsonl
Contact Information
Name: Ruifan Zhao
University: Mongolian University
Email: zhaomr314@gmail.com
Thank you for your time and assistance. Looking forward to your response.

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions