textrec

Experiments in recommending text for composition tasks.

Workflows

Train the model

(TODO: write up)

See what tasks look like:

Demos:

http://mbp-zt.megacomplete.net:3000/?idx

Post writing task

Deploy code to production: make deploy. Test on server.
Post HIT (HITs/2018-04-09.html).

Run analysis

make get-data
make get-completed-participants
make data/analyzed/trial_cueX.csv

Qualify workers who completed the writing task

Download MTurk metadata: scripts/backup_mturk.py.
Run script on downloaded data: scripts/assign_qualification_to_workers.py.

TODO: update this for the cue writing tasks.

Make a hotfix for analysis of a deployed frontend build

Make a new branch from the frontend commit (git_rev in the "init" request): git branch tmp $frontend_commit
Make the hotfix there (perhaps by cherry-picking or copying from master).
Note the current hash.
Checkout master, edit analysis_util.py to add an entry to rev_overrides mapping frontend commit to the hash just recorded in #3.
Incorporate the hotfix commit into the git history: git merge -s ours tmp.

You can use a worktree to make this easier;

git worktree add tmp-worktere -b tmp $frontend_commit
then when you're done: git worktree remove tmp-worktree.

Approach for counterbalancing

Random assignment to condition ordering does not suffice to ensure adequate counterbalancing. So when a new participant connects, we assign them the condition ordering with the lowest expected number of completed trials given the assignments and completions so far. This expectation is a sum of indicators: 1 for any already-comple trial, and the probability of completion for any assigned-but-incomplete trial. We take the probability of completion of an incomplete trial to be 0.5.

This algorithm requires knowing, for each participant within the batch, what ordering they were assigned and whether or not they completed. We take the simplest possible approach for both of these: we read the ordering out of the login entry (the first line) of each log, and we create a new file ('{participant_id}.completed') for a participant who completed.

Project based on the cookiecutter data science project template. #cookiecutterdatascience

Name		Name	Last commit message	Last commit date
Latest commit History 1,193 Commits
HITs		HITs
data		data
docs		docs
models		models
notebooks		notebooks
references		references
reports		reports
scripts		scripts
src		src
tmp		tmp
vendor		vendor
.flake8		.flake8
.gitignore		.gitignore
.gitmodules		.gitmodules
.whitesource		.whitesource
2019-07-09 trend towards questions-in-same-category being more helpful.jrp		2019-07-09 trend towards questions-in-same-category being more helpful.jrp
LICENSE		LICENSE
Makefile		Makefile
Pipfile-aside		Pipfile-aside
Pipfile.lock-aside		Pipfile.lock-aside
README.md		README.md
README.protocol.md		README.protocol.md
analyze_design_study.jsl		analyze_design_study.jsl
analyze_spec1.jsl		analyze_spec1.jsl
blocked_analysis.jsl		blocked_analysis.jsl
fabfile.py		fabfile.py
final_analysis.jsl		final_analysis.jsl
graph_helpfulRanks.jsl		graph_helpfulRanks.jsl
other_analyses.jsl		other_analyses.jsl
poetry.lock		poetry.lock
pyproject-opennmt.toml		pyproject-opennmt.toml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run_analysis.jsl		run_analysis.jsl
run_notebook		run_notebook
setup.py		setup.py
simplified_analysis.jsl		simplified_analysis.jsl
test_environment.py		test_environment.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

textrec

Workflows

Train the model

See what tasks look like:

Post writing task

Run analysis

Qualify workers who completed the writing task

Make a hotfix for analysis of a deployed frontend build

Approach for counterbalancing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

textrec

Workflows

Train the model

See what tasks look like:

Post writing task

Run analysis

Qualify workers who completed the writing task

Make a hotfix for analysis of a deployed frontend build

Approach for counterbalancing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages