Jianfa Tsai’s Input
“46:18 If you check with scientists, experienced educators, psychiatrists, and psychologists, they will generally agree that simultaneous multi-mode learning yields twice the learning effectiveness with half the effort. This is evidenced by the ultra-wealthy trillion-dollar tech companies (Apple, Google, Amazon, and Kobo) that offer eBooks with synced audio narration (done and paid for by the research for you) or audiobooks. This highlights the limitations of Anki as a heavy visual learning tool that lacks audio narration (imagine student doctors practising Anki hands and visual free by listening and speech feedback to app so they can exercise, do housework and perform activities of daily living – all doctors and other mere mortals have the same 24/7 worth of time), which slows learning progress for student doctors and experienced doctors refreshing their knowledge. Why not provide feedback to Anki to incorporate Gemini AI and Speechify into it, so student doctors can have live audio and visual (simultaneous) conversations with the Anki AI to test themselves? Naturally, there are errors in the training data and in AI capabilities; therefore, the AI reporting or recommendations on the learner’s test should not be taken into consideration (unless you build offline AI into computer desktops with its own server room in the medicine faculty library, where doctors have to move their bag of bones to the library to learn from the desktop PC). However, the productivity gains of multi-visual, audio and tactile channels for learning or self-testing will be noteworthy. Last but not least, doctors are often not good salespeople. They have access to ultra-wealthy and powerful clients or patients, yet little effort is made to influence or persuade the potential investors to invest in upskilling and improving the learning effectiveness and efficiency of medical students and experienced doctors. What could be done? https://youtu.be/_wCPO0B4Pyo?si=fBm7J_YJSMPpiYYg“
ELI5
Learning things by looking and listening at the exact same time helps our brains remember twice as much without working twice as hard, which is why big companies sync words with voices. Right now, medical students use an app called Anki to look at flashcards, but they cannot talk to it while doing chores or exercising, which wastes precious time. By adding clever talking AI tools into flashcards, students could study out loud without needing their hands, while medical schools can find wealthy investors to fund these helpful tools by teaching doctors how to share their vision clearly.
Most Important Point
Integrating multimodal AI audio narration and speech feedback into spatial repetition tools like Anki can double the efficiency of medical training, provided that medical institutions systematically train physicians to effectively persuade high-net-worth donors to fund these pedagogical innovations.
Enhancing Spatial Repetition with Multimodal AI Integration
The use of active recall and spaced repetition via flashcards is highly valued globally by medical students [07:09], yet traditional platforms rely heavily on manual, visual interaction. Research in educational psychology confirms that simultaneous multi-mode learning (combining visual, auditory, and tactile channels) optimizes cognitive load and accelerates data retention.
To overcome these constraints and unlock true hands-free productivity for busy clinicians, developers and institutions should implement specific software enhancements:
- Audio-Visual Synchronization: Developing open-source add-ons or submitting developer feedback to Anki to natively incorporate text-to-speech APIs (such as Speechify) and conversational large language models (such as Gemini AI).
- Hands-Free Auditory Testing: Transitioning flashcard review sessions into live audio-visual dialogues, allowing students to verbally answer cards and receive synchronized speech feedback while performing activities of daily living.
- Air-Gapped Local Infrastructure: To mitigate data privacy concerns and algorithmic hallucinations, medical faculties can establish on-premise, offline AI server environments within library infrastructures, maintaining data integrity during high-stakes testing.
Strategic Philanthropy and Investor Mobilization in Medicine
While medical professionals regularly engage with ultra-wealthy patients, a systemic gap exists in leveraging these networks to fund educational technology. Medical institutions can bridge this gap and secure significant capital for upskilling through targeted structural changes:
- Medical Philanthropy Curriculum: Integrating basic training in venture capital procurement, scientific storytelling, and persuasive communication into the medical curriculum so doctors can advocate for student upskilling.
- Grants and Endowments Realignment: Utilizing existing institutional frameworks—such as undergraduate research grants, medical education funds, and university innovation hubs [05:16], [53:16]—to pitch advanced multimodal learning platforms directly to tech-focused investors and alumni network boards.
Action Steps
- For Personal Life: Maximize daily time management by pairing mechanical tasks (e.g., household chores, exercise) with auditory learning tools or text-to-speech software to mimic multimodal efficiency.
- For Academic Life: Initiate or support student-led developer feedback groups within your university library networks to advocate for the implementation of offline, localized AI tools in research databases and study rooms [02:29].
- For Work Life: Develop concise, impactful project pitches that emphasize long-term societal and productivity gains, ensuring that communication with stakeholders outlines clear educational benefits.
Date
Friday, June 12, 2026 at 9:36 PM AEST
Authors
Jianfa Tsai (https://orcid.org/0009-0006-1809-1686) in collaboration with Gemini AI Pro.
References
NUS Medicine. (2026, June 11). Surviving and thriving in medical school | First Year To First Call Ep 1 [Video]. YouTube. https://youtu.be/_wCPO0B4Pyo?si=fBm7J_YJSMPpiYYg