Cantonese pronunciation
If Cantonese is your native tongue: record the pronunciation of a Cantonese word that has no audio yet — your voice is the one piece of data AI can't fake (six+ tones; TTS is genuinely weak here).
The menu · pick something you can do
Lacuna helps anyone — no PhD required — use AI to make a small, real, verifiable contribution to science. Science databases are full of "gaps": facts published in papers but never entered into the canonical database. We find these gaps, fill them from the original sources, double-check them, and submit through official channels. You bring curiosity and judgment; the AI does the heavy lifting.
Everything below is a real gap in a database scientists rely on. We sort them by one thing: how ready each is for you to jump in today.
Self-serve, no approval needed — you could start today.
If Cantonese is your native tongue: record the pronunciation of a Cantonese word that has no audio yet — your voice is the one piece of data AI can't fake (six+ tones; TTS is genuinely weak here).
Find a building registered as a heritage monument but with no photo on Wikidata, go photograph it, and upload — your camera and your presence are the data AI can't fabricate.
The answer already sits in the data, so AI can find and fix it end to end. There's little only a human can add here.
Pick a type of fact missing across thousands of entries (e.g. scientists with no birth date), find the answer in a reliable source, and add it.
Scan classic Greek/Latin texts for markup or metadata errors (e.g. deprecated language codes, transcription slips) and submit a fix.
Real potential, but a channel or a clear task definition we haven't pinned down yet.
Find a fossil species scientists named in a paper but never added to the global fossil database, then pull its dig-site, rock layer, and age from the original paper and complete the record.
Find a star or galaxy whose recent papers were never linked to its database entry, or whose aliases are incomplete, and connect them.
Find species whose names changed (split/merged) in recent papers while the biodiversity database still uses the old classification, and flag it.
Find a protein whose function was newly characterized in a paper but whose database entry wasn't updated, and report the evidence.
Find a compound's measured property or bioactivity published in a paper but missing from the chemistry database, and add it.
Real unknowns and from-scratch documentation. The highest human value, and the hardest game.
(Phase 3 · fieldwork) If Wu/Shanghainese is your native tongue: document vocabulary, pronunciation, usage. Not "add audio to a known word" — it's recording from scratch a mostly-spoken, orthography-less endangered variety.
(Phase 3 · real unknowns) Pick a recognized open problem (e.g. an Erdős conjecture), make a rigorous AI-assisted attempt, and honestly log the process and outcome — solved / partial / dead-end / disproved. What's verified is the attempt's rigor and reproducibility, not acceptance of an answer.