List of resources that I found useful:
NINJAL: web interface for searching Balanced Corpus of Contemporary Written Japanese: BCCWJ
Coca corpus: browser-searchable Corpus of Contemporary America English
English Lexicon Project: English lexical database, good for finding lexical characteristics (frequency, orthographic neighbors, average lexical decision time etc)
Wordnet: English lexical database, good for finding lexical relationships.
International Picture Naming Project at UCSD: object and action pictures, with behavioral data associated with them.
Psychopy: free, flexible, cross-platform stimulus presentation software.
R: free cross-platform data analysis software