The COVID-19 pandemic has significantly spurred the development and innovation of the point-of-care testing (POCT) field, highlighting the urgent need for rapid, accurate, and accessible diagnostic tools. Integration of contemporary technologies, such as the Internet of Things (IoT) and speech recognition, can greatly enhance the functionality and usability of POCT devices. Herein, we constructed a voice-controllable molecular diagnostic platform for rapid SARS-CoV-2 diagnosis using reverse transcription loop-mediated isothermal amplification (RT-LAMP) assay. The platform is a complete system comprised of a POCT device, a microfluidic chip, and a web app developed for smartphones. The microfluidic chip, designed with pneumatic microvalves, enables automatic dispensing of the RT-LAMP mixture into four reaction chambers for amplifying and analyzing different target genes of SARS-CoV-2, ensuring accurate diagnostic results. The POCT device has a compact design including modules for valve control, thermal regulation, fluid manipulation, and real-time fluorescence detection. The smartphone web app integrates the speech recognition feature for voice-controllable operations and provides an intuitive interface for displaying and interpreting results in real time. Featuring sophisticated algorithms for real-time dynamic Ct calculation and advanced image processing, the platform detects SARS-CoV-2 with a low limit-of-detection (LOD) of 100 genomic copies/μL within 35 min with high specificity. By leveraging the IoT for wireless data control and transfer, coupled with a speech-recognition feature, the platform offers hands-free operation to streamline the diagnostic process and minimize potential contamination risks while enhancing user convenience.