• user: anonymous

Open corpora

Corpus name Language Size
Croatian Wiki Croatian 14,657,355 info open
ACL Anthology Reference Corpus (ARC) English 49,348,397 info open
British Academic Spoken English Corpus (BASE) English 1,252,256 info open
British Academic Written English Corpus (BAWE) English 8,336,262 info open
BIBLE Polish, plahili-Polish Polish 169,934 info open
Serbian Wiki Serbian 17,806,808 info open
Swedish web corpus[2M] Swedish 1,671,318 info open
BIBLE Swahili, Swahili-Polish Welsh 169,612 info open