the_data.Rd
This data frame contains the pre-processed data used for the bulk of the empirical analyses in Crump, Lai, & Brosowsky. The data was originally collected by Behmer & Crump (2017).
This data frame contains individual keystroke typing data from 346 typists who copy typed 5 English paragraphs. It has been filtered to remove keystrokes from capitalized letters.
the_data
A data frame with 1017214 rows and 27 variables:
Subject
integer A unique number coding each subject
ParagraphType
character "N" stands for normal english. This data frame only contains data from yping English paragraphs. The raw data contains additional typing data from typing random letter strings, and english-like strings.
IKSIs
integer Interkeystroke interval in milliseconds, the time elapsed between the previous keystroke and the current keystroke.
Letters
character the character typed
PredBigram
character bigram showing the previous character and current character
SuccBigram
character bigram showing the current character and the next character
PredBigramCorrect
character Accuracy codes for the previous letter and current letter, 1 = correct, 0 = typo
SuccBigramCorrect
character Accuracy codes for the current letter and next letter, 1 = correct, 0 = typo
PredTrigram
character trigram for the previous two letters and current letter
SuccTrigram
character trigram for the current letter and next two letter
PredTrigramCorrect
character accuracy codes for previous two letters and current letter, 1 = correct, 0 = typo
SuccTrigramCorrect
character accuracy codes for current letter and next two letters, 1 = correct, 0 = typo
GUT
integer Letter frequencies estimated from the gutenberg corpus
PredBigramMLE
double Probability of current letter given previous letter, from gutenberg corpus
SuccBigramMLE
double Probability of current letter given next letter, from gutenberg corpus
PredBigramFreq
integer preceding bigram frequency from gutenberg
SuccBigramFreq
integer succeeding bigram frequency from gutenberg
PredTrigramMLE
double probability of current letter given preceding trigram
PredTrigramFreq
integer preceding trigram frequency from gutenberg
SuccTrigramMLE
double probability of current letter given succeeding trigram
SuccTrigramFreq
integer succeeding trigram frequency from gutenberg
let_pos
integer serial position of letter within word
word_lengths
integer number of letters in word
probs
double need to double-check what this is
whole_word
character the current word, note any word that begins with a capital letter is displayed with the capital letter removed
word_prob
double need to double-check what this is.
whole_word_acc
character 1 = the whole word was typed correctly, 0 = there was an error(s) somewhere in the word