The following is from example.py
if args.model_size == '7B':
adjusted_token_ids = [1] + adjusted_token_ids # Adjusting to recover the first token_ids of the sentences
adjusted_token_ids=torch.tensor([adjusted_token_ids])
labeled_token_indices = labeled_token_indices[2:] # Adjusting to skip the first two separators (beginning and ending of the problems)
My question is : when you add [1] to the "adjusted_token_ids", this will make every token in "the adjusted_token_ids" shift to the right and thus change the corresponding indices of "labeled_token_indices". So "labeled_token_indices" will shift right by one position. The above is my confusion.
The following is from example.py
My question is : when you add [1] to the "adjusted_token_ids", this will make every token in "the adjusted_token_ids" shift to the right and thus change the corresponding indices of "labeled_token_indices". So "labeled_token_indices" will shift right by one position. The above is my confusion.