PRESENTERS

Fall 2025 Dates - Every Tuesday

Sept 16, 23, 30
Oct 7, 14, 21, 28
Nov 4, 11, 18, 25
Dec 2, 9

Colloquia (mandatory for all researchers)

Tuesdays Every Week @ 7:00 PM - 8:30 PM (EVERY WEEK!)

https://us06web.zoom.us/j/83346956991?pwd=STJ1SGFUK1VtMjdNRThLKy9KdHNlZz09

Meeting ID: 833 4695 6991 Passcode: 699214

Check out the latest Colloquia uploaded to our YouTube Channel!

November 18, Colloquia Presenters

Department of Computer Science & Engineering

Using Quantum Neural Networks (QNNs), Quantum Vision Transformers (QVT), and the Mathematical Morphological Reconstruction Algorithm (MMR) for Brain Tumor Detection

Brain tumors affect millions around the world, so detection is critical to helping doctors determine treatment. Currently, radiologists manually identify tumors through MRI (Magnetic Resonance Imaging) scans; however, this poses several limitations: it creates a heavy reliance on the experience of radiologists, has become increasingly costly and time-consuming, and is not as accessible to areas that lack the necessary resources and doctors. With the advancement of deep learning algorithms, a more accessible and efficient solution is possible. Given the existing research in classical Convolutional Neural Networks (CNNs) for tumor detection, Quantum Convolutional Neural Networks (QCNNs) and Quantum Vision Transformers (QVT) offer a promising approach to the problem. Mathematical Morphological Reconstruction (MMR), another image processing method, provides a relative metric for success in the QCNN, and is another classical alternative to CNNs. This research compares the accuracy and computational speed of the MMR, QCNN, QVT, and CNN algorithms to determine whether introducing a quantum aspect presents any noticeable advantage. To build these models, extensive datasets of MRI brain scans were collected. The MMR algorithm involved applying various techniques such as dilation, erosion, and skull stripping through OpenCV2's morphology functions. The QCNN algorithm utilizes quantum power to encode the data into a parametrized quantum circuit and apply convolutional and pooling layers. In terms of future steps, QVTs will be implemented with QCNNs for higher spatial understanding. So far, our results indicate that the MMR algorithm achieved up to 92% accuracy. These results will be compared with the accuracy of the QCNN, QVT, and CNN algorithms.

RESEARCHERS: Ekansh Samanta, Amador Valley High School '27; Riddhi Sharma, Evergreen Valley High School '27; Rutvi Mudalagi, Amador Valley High School '27; Shivansh Grover, Mission San Jose High School '28; Vivaan Sheoran, Leigh High School '28

ADVISOR: McMahan Lab, Quantum Computing & Computer Science

KEYWORDS: QCNN | QVT | Brain tumor

Department of Computer Science & Engineering

Detection of Surface Level Cyanobacterial Algae in Freshwater Lakes using Cost Effective RGB Autonomous Unmanned Aerial Vehicles

Toxic cyanobacteria pose a significant health risk to humans through the consumption of poisoned fish or shellfish and hinder recreational activities. Harmful algae blooms (HABs) categorized by a deviation of regular algal biomass, develop when toxic or nontoxic cyanobacteria are exposed to a combination of nutrient runoffs and increased water temperature. Algae blooms deplete the lake of oxygen causing areas of hypoxia, killing or harming aquatic animals that require aquatic respiration and killing underwater plants by obstructing sunlight. The rise of algal blooms in frequency and intensity force lake managers and government bodies to constantly monitor algal levels and concentrations in their lakes. We aim to create a cheaper, simple, autonomous solution to monitoring algal surface coverage by using solely an RGB camera and a drone to replace high cost sensors, satellite, and image analysis used in previous studies. Our drone will follow preplanned waypoints over an entire lake while taking pictures at set intervals. It will return to upload the images it took onto a neural network and upload its results to a separate, public algal dashboard. This autonomous solution uses a weaker processor to reduce battery consumption, allowing for longer range with the same battery compared to a real-time detection system. Ultimately, our research contributes to the growing body of autonomous UAVs for algal detection through the creation of a more practical solution, saving lake managers and governmental bodies time, costs, and personnel.

RESEARCHERS: Matthew Chang, The King's Academy '27; Jeremiah Welch, Credo High School '28; Shreya Bansal, Emerald High School '28

ADVISOR: McMahan Lab, Quantum Computing & Computer Science

McMahan Lab - Quantum Computing & Computer Science

Ekansh Samanta, Amador Valley High School '27

Riddhi Sharma, Evergreen Valley High School '27

Rutvi Mudalagi, Amador Valley High School '27

Shivansh Grover, Mission San Jose High School '28

Vivaan Sheoran, Leigh High School '28

McMahan Lab - Quantum Computing & Computer Science

Matthew Chang, The King's Academy '27

Jeremiah Welch, Credo High School '28

Shreya Bansal, Emerald High School '28

November 11, Colloquia Presenters

Department of Biological, Human & Life Sciences

Ensemble Learning Algorithms to Predict Scaffold/Matrix Attachment Regions

Scaffold matrix attachment regions (SMARs) are genomic elements that anchor DNA to the nuclear matrix, organizing chromatin into structural and functional domains. SMARs play a key role in regulation, DNA replication, retroviral integration, and the epithelial-mesenchymal transition. While previous studies have identified various DNA motifs and sequences associated with SMARs, current research on these regions remains limited, with no comprehensive or up-to-date database of SMARs in the human genome. A computational mechanism of identifying SMAR sequences would assist scientists studying metastasis and viral integration, potentially leading to therapeutic implications such as improving gene therapy by enabling the design of episomal expression vectors that use SMAR elements to enhance transgene retention, sustain long-term expression, and prevent genomic silencing. To produce this, we implemented multiple machine learning algorithms, including Random Forest, K-Nearest Neighbors, and XGBoost, trained on 497 DNA sequences from experimentally derived HeLa cell SMARs from the ENCODE project, SMOTE techniques, and inter-SMAR sequences. The Random Forest model achieved an 88.5% accuracy, the XGBoost model achieved an 81.6% accuracy, and the KNN achieved a 60.3% accuracy. A finalized R package will include DNA sequences in addition to other data types impacting chromatin accessibility. Furthermore, the UI tool will allow users to select model training datasets from multiple species. Our final product would increase the accessibility of SMARs-related research, facilitate studies on transcriptional regulation, and advance research on SMARs-related diseases by providing a method to identify novel SMAR loci.

RESEARCHERS: Sathvega Somasundaram, Evergreen Valley High School '26; Adithi Aia, Portola High School '26; Shreya Krishnakumar, Emerald High School '27; Akhilesh Kuppili, Portola High School '28; Dhriti Bharadwaj, Archbishop Mitty High School '28

ADVISOR: Cunha Lab, Bioinformatics and Cancer Biology

KEYWORDS: Ensemble algorithms | Computational Biology | Machine learning | Genetics

Department of Computer Science & Engineering

Designing amyloid-β inhibitor molecules using a hybrid quantum-classical generative model

The modern process of drug discovery and development requires large financial investment and time, which is why researchers are utilizing new computational chemistry methods like machine learning to speed up molecular synthesis pathways. Alzheimer's disease is a neurodegenerative disease associated with the aggregation of the protein amyloid-β (Aβ). The treatment requires molecules that can penetrate the extremely selective blood-brain barrier (BBB) and inhibit Aβ. This architecture utilizes a Wasserstein generative adversarial network (GAN) with a quantum generator and classical discriminator and a gradient penalty. It was trained on the IPAD-DB database of Alzheimer's protein inhibitors, to generate chemically feasible molecules that checks for size, lipophilicity, planarity, aromatic rings, as well as other functional groups critical for determining effective molecules that target amyloid-β. Out of complete generation of 100 molecules, 100% were small, 88% had amine groups, 15% passed BBB permeability checkers, and 87% had low aggregation potential. From docking, 22% were able to interact, with the binding affinity between the generated inhibitors to amyloid-β ranging -2.6 to -4.5, indicating weak binding capability. This approach offers the framework to accelerate the discovery of promising therapeutics for Alzheimer's disease by shortening the amount of time needed in the lab to find a molecule with optimized properties for blood-brain barrier penetration and amyloid beta inhibition.

RESEARCHERS: Leena Adwankar, Irvington High School '27; Nitya Pisolkar, Archbishop Mitty High School '27; Akhil Muthyala, Emerald High School '28

ADVISOR: McMahan Lab, Quantum Computing & Computer Science

KEYWORDS: Drug Discovery | Computational Chemistry | Machine Learning | Generative Adversarial Network | Alzheimer's Disease

Department of Chemistry, Biochemistry & Physics

Bridging the broad spectrum of chemistry: Amino triester lipids as biodegradable surfactants for drug delivery and precise control of quantum dot formation

The delivery of anionic cargoes, including mRNA, small molecules, aptamers, and other oligonucleotide-based therapeutics, most fundamentally requires the enablement of a mono- or polyanionic payload to be delivered across a lipophilic membrane bilayer. Delivery systems typically rely on bifunctional cationic materials composed of a protonatable amine headgroup that electrostatically complexes with the desired anionic cargo, along with a lipid segment that permeates the hydrophobic lipid bilayer membrane. These materials may complex with anionic targets to form nanoparticles, liposomes, and other macromolecular structures, and these formulations have been previously described to be highly efficacious in mRNA vaccination, gene delivery, siRNA delivery, and small molecular drug delivery. Given this, we synthesized–under nonhazardous, mild conditions, four non-toxic triester lipids. Preliminary investigations demonstrated that these triester lipids do not induce cell lysis, indicating potential for use in larger drug delivery systems. We then complexed the triester lipids with calcium phosphate (CaP) in an attempt to create lipid-coated nanoparticles capable of localized drug delivery. Through a series of cell proliferation and fluorescence imaging assays conducted on lung and colorectal cancer cell lines, we evaluated the ability of our triester lipids and our lipid-CaP conjugates in delivering clinically prevalent cancer therapeutic cargoes across the cell membrane. Lastly, we extended our study into the domain of inorganic chemistry by investigating the usage of our triester lipids as surface ligands in the modulation of quantum dots. The lipids functioned as a stabilizing agent, decreasing the rate of degradation.

RESEARCHERS: Alina Liu, Homestead High School ‘27

KEYWORDS: Synthesis | Physical Organic Chemistry | Chemical Biology | Spectroscopy | Inorganic Chemistry

Cunha Lab - Bioinformatics and Cancer Biology

Sathvega Somasundaram, Evergreen Valley High School '26

Adithi Aia, Portola High School '26

Shreya Krishnakumar, Emerald High School '27

Akhilesh Kuppili, Portola High School '28

Dhriti Bharadwaj, Archbishop Mitty High School '28

McMahan Lab - Quantum Computing & Computer Science

Leena Adwankar, Irvington High School '27

Nitya Pisolkar, Archbishop Mitty High School '27

Akhil Muthyala, Emerald High School '28

Alina Liu, Homestead High School ‘27

November 4, Colloquia Presenters

Department of Computer Science & Engineering

Autonomous Drone-Based Object Detection for Vegetation Hazard Identification and Wildfire Prevention Near Power Lines

In recent years, the frequency and severity of wildfires in the United States have risen sharply, with many traced to damaged or downed power lines. This project presents an autonomous drone-based system designed to assess and mitigate such risks by monitoring vegetation near electrical infrastructure. A custom drone equipped with imaging, GPS, and telemetry components captures aerial video of target areas. The footage is then segmented into still frames and analyzed using a trained image processing model capable of detecting hazardous vegetation. When a potential risk is identified, the system geotags and transmits the data to a centralized database for review by local authorities. By enabling early detection and rapid response, this platform offers an effective approach to reducing wildfire occurrences. Future testing in controlled environments and public parks will validate the system’s accuracy, efficiency, and operational reliability.

RESEARCHERS: Nirupama Balaji, American High School '28; Elizabeth Ashley, Milpitas High School '26

ADVISOR: McMahan Lab, Quantum Computing & Computer Science

KEYWORDS: Wildfire Prevention | Automated Drones | Image Processing | Power Line Monitoring

Department of Computer Science & Engineering

Using Quantum Neural Networks (QNNs), Quantum Vision Transformers (QVT), and the Mathematical Morphological Reconstruction Algorithm (MMR) for Brain Tumor Detection

ADVISOR: McMahan Lab, Quantum Computing & Computer Science

KEYWORDS: Deep Learning | Quantum Computing | Computer Vision

McMahan Lab - Quantum Computing & Computer Science

Nirupama Balaji, American High School '28

Elizabeth Ashley, Milpitas High School '26

McMahan Lab - Quantum Computing & Computer Science

Ekansh Samanta, Amador Valley High School '27

Riddhi Sharma, Evergreen Valley High School '27

Rutvi Mudalagi, Amador Valley High School '27

Shivansh Grover, Mission San Jose High School '28

October 28, Colloquia Presenters

Department of Computer Science & Engineering

Simulating BB84 Protocol with Noise in Quantum Key Distribution Systems using IBM Qiskit

Quantum Key Distribution (QKD) provides a novel alternative to Classical Key Distribution in protecting secure communications, but poses challenges in terms of research and widespread adoption. While the resource-intensive nature of QKD setups limit experimental studies to specialized laboratories, we present an end-to-end simulation that models photon generation, transmission, and detection, thereby lowering the barrier to testing hypotheses regarding experimental setups under realistic conditions. In this study, a fiber optic-based simulation is utilized along with a quantum circuit and a detailed noise model within the BB84 protocol to recreate an experimental setup. Our BB84 simulation retained ≈50% of transmitted bits after sifting, matching the experimental expectations. Our model incorporates customizable noise, such as multi-photon emissions and channel disturbances, directly into the key exchange process, allowing for an approach that utilizes simulated fiber optics and theoretical noise models to replicate physical conditions. Our findings support the development of QKD as a scalable solution for securing the quantum internet and future communication networks.

RESEARCHERS: Aditya Das, American High School '26; Sai Sanjay Devi, American High School '28; Samiha Das, Archbishop Mitty High School '28

ADVISOR: McMahan Lab, Quantum Computing & Computer Science

KEYWORDS: BB84 | Quantum Key Distribution | Cybersecurity

Department of Computer Science & Engineering

Measuring Air Pollution with the use of Unmanned Aerial Vehicles

Greenhouse gases such as CO₂, CH₄, and NOₓ are powerful drivers of climate change and loss of biodiversity despite monitoring networks now available being costly, sparse, and incapable of resolving fine‐scale complexities in emissions. Using small gas sensors on unmanned aerial vehicles (UAVs) with machine‐learning models is thwarted by difficulties in sensor calibration, processing data for flight operations, and adaptive navigation but allows rapid, high‐resolution mappings of pollutant concentrations. We present a UAV system for monitoring pollutants here in which ground‐truths are used to train machine‐learning models for converting raw sensor data to right gas concentrations, adaptive navigation for targeted hotspots in flight operations, and high‐resolution mappings of emissions with a scalable cost‑effective solution over labor‑based monitoring with recommendations for priority work towards mitigation.

RESEARCHERS: Anirudh Rao, American High School '28; Alyssa Kwon, Dougherty Valley High School ‘27; Rohit Pulle, Washington High School '28; Yash Shekhawat, Mission San Jose High School '28

ADVISOR: McMahan Lab, Quantum Computing & Computer Science

KEYWORDS: Pollution, Autonomous Drones, Artificial Intelligence, Emission Reduction, Pollution Monitoring

Department of Chemistry, Biochemistry & Physics

Antioxidant Activity of Fluorinated Edaravone Analogs for the Treatment of Amyotrophic Lateral Sclerosis

Edaravone is an FDA-approved small-molecule drug used to treat amyotrophic lateral sclerosis (ALS), a progressive neurodegenerative disorder characterized by the degeneration of motor neurons, though its precise mechanism of action remains unclear. Given that the antioxidant activity of aromatic compounds is strongly influenced by the electronic effects of their substituents, we synthesized a library of edaravone analogs, including either a methylated or trifluoromethylated pyrazole bearing a para-substituted phenyl ring. In DPPH and ABTS assays, we found that compounds with electron-withdrawing substituents, particularly the trifluoromethyl and sulfonic acid derivatives, initially exhibited lower radical-quenching activity, though those differences diminished over longer time points. In contrast, in cell-based viability assays with HEK293 and Neuro2a cells under hydrogen peroxide, sodium bromate, and sodium percarbonate-induced oxidative stress, the trifluoromethylated analog demonstrated the strongest protection against oxidative stress, contrasting its performance in cell-free assays. Altogether, this work demonstrates that fluorination can enhance cellular antioxidant performance beyond direct radical scavenging and provides mechanistic insight into how electronic properties affect the antioxidant activity of edaravone.

RESEARCHERS: Selina Xi, Valley Christian High School '26; Rushika Raval, Irvington High School '26

KEYWORDS: Medicinal Chemistry | Chemical Biology | Antioxidants | Amyotrophic Lateral Sclerosis | Edaravone

McMahan Lab - Quantum Computing & Computer Science

Aditya Das, American High School '26

Sai Sanjay Devi, American High School '28

Samiha Das, Archbishop Mitty High School '28

McMahan Lab - Quantum Computing & Computer Science

Anirudh Rao, American High School '28

Alyssa Kwon, Dougherty Valley High School ‘27

Rohit Pulle, Washington High School '28

Yash Shekhawat, Mission San Jose High School '28

Selina Xi, Valley Christian High School '26

Rushika Raval, Irvington High School '26

October 21, Colloquia Presenters

Department of Computer Science & Engineering

Detecting and mitigating errors in quantum computing

Quantum computing has the potential to significantly improve computational tasks. Unfortunately errors due to outside interactions cause the data to be noisy. Our group is focusing on finding a way to decrease these errors by reverting the data back to what it was before the noise. To simulate realistic errors, we used the Qsurface surface code visualizer. We use both Convolutional Neural Networks (CNNs), for preprocessing and feature extracting, and Graph Neural Networks (GNNs), for predicting and correcting errors in the surface codes. By training and evaluating the CNNs and the GNNs, we can increase performance in quantum computing. These changes will allow for major developments in the field and will be a significant contribution to making an accurate quantum structure..

RESEARCHERS: Satvik Dronavalli, Independence High School '26; Akash Singh, BASIS Independent Fremont (Upper School) '26

ADVISOR: McMahan Lab, Quantum Computing & Computer Science

KEYWORDS: Quantum Computing | Quantum Error Correction | Surface Codes | Graph Neural Networks | Convolutional Neural Networks

Department of Chemistry, Biochemistry & Physics

Antioxidant Activity of Fluorinated Edaravone Analogs for the Treatment of Amyotrophic Lateral Sclerosis

RESEARCHERS: Selina Xi, Valley Christian High School '26; Rushika Raval, Irvington High School '26

KEYWORDS: Medicinal Chemistry | Chemical Biology | Antioxidants | Amyotrophic Lateral Sclerosis | Edaravone

McMahan Lab - Quantum Computing & Computer Science

Satvik Dronavalli, Independence High School '26

Akash Singh, BASIS Independent Fremont '26

Selina Xi, Valley Christian High School '26

Rushika Raval, Irvington High School '26

October 14, Colloquia Presenters

Department of Computer Science & Engineering

Comparative study on three machine learning models in novel autonomous drone-based detection of invasive plant brassica nigra

California spends around $82 million to manage invasive plants each year. We propose a solution to automate the detection of invasive plant species by creating a machine learning model capable of identifying the presence of Brassica nigra—an annual herb which increases wildfire risk and produces chemicals that prevent the germination of native plants—from autonomous drone footage. We tested three different machine learning models for the detection of the invasive plant from our drone footage at different angles and distances. The three models were a Convolutional Neural Network (CNN), Stochastic Gradient Descent Classifier (SGDC), and eXtreme Gradient Boosting (XGBoost). The goal was to find the best model type for this application. We hypothesized that for the detection of invasive plant species from aerial autonomous drone images, a CNN model will outperform SGDC and XGBoost because of its ability to extract spatial features to find complex visual patterns. Additionally, we hypothesize that SGDC will perform better than XGBoost, as our data is linearly separable and SGDC has the ability to do limited feature extraction. Results analyzed by using the values of the heatmap of each model indicate that there is a statistically significant difference between the ability of the three models to find important features with the ANOVA test, achieving a p value of 9.2e-16 at an alpha level of 5%. We conclude that CNNs are the most suitable model for detecting invasive plants from drone footage surpassing the other two models with an accuracy of 99.4%.

RESEARCHERS: Chloe Ho, Basis Independent Silicon Valley '26; Sahiti Pantangi, Washington High School '28

ADVISOR: McMahan Lab, Quantum Computing & Computer Science

KEYWORDS: Invasive plant | Machine learning | Convolutional Neural Network | Autonomous Drone

Department of Chemistry, Biochemistry & Physics

Discovery of A4P1W1, a fluorinated atropisomeric arylisoxazole acrylamide covalent inhibitor for the treatment of cancers

5-methyl-3-phenylisoxazoles are privileged scaffolds in the design of antibiotics, antivirals, and anticancer compounds, previously demonstrating exceptional effectiveness in improving the clinical efficacy of β-lactam antibiotics such as Cloxacillin, Dicloxacillin, and Flucloxacillin, and Influenza inhibitor Nucleozin. Additionally, piperazyl and piperidyl acrylamides were found to be highly effective covalent warheads in the design of several anticancer agents, including the EGFR/BTK inhibitor, Ibrutinib, and more recently KRAS G12C inhibitors AMG-510 and MRTX-849. The collective clinical success of these motifs has inspired a hybridized approach towards the development of 5-methyl-3-arylisoxazoles bearing piperazyl acrylamides for the targeting of a variety of cancers. In the initiation of this campaign, we developed a high throughput mass spectrometry platform for synthetic optimization of a key amide bond formation reaction between various 5-methyl-3-arylisoxazole fragments and piperazine nucleophiles, where increasing equivalences of 4-(Dimethylamino)pyridine suppressed competitive acid anhydride formation, thereby promoting selective acylation of the piperazine nucleophile. This has enabled the preparation of twelve novel arylisoxazole piperazine acrylamide analogs bearing different aryl halogenation patterns and stereo-defined methylated piperazines. To interrogate the effect of more reactive warheads, we prepared four analogous piperazyl chloroacetamides and screened the library against HCT-116, CT-26, Calu-1, HT-29, MDA-MB-231 and HEK-293 cell lines. The chloroacetamides exhibit potent, non-selective cytotoxic properties while our twelve acrylamides were less potent. Among these acrylamides, the 2,6-chloro-fluoro-aryl isoxazole piperazine acrylate analog exhibited potent antiproliferative activity selective for human cancer cells. Collectively these discoveries enable further elaboration of the hit scaffold identified here for further development of selective anticancer agents.

RESEARCHERS: Lutecia Lam, BASIS Independent '28; Ieva Chepurna, San Mateo High School '26

KEYWORDS: Organic Synthesis | Medicinal Chemistry | Chemical Biology

McMahan Lab - Quantum Computing & Computer Science

Chloe Ho, Basis Independent Silicon Valley '26

Sahiti Pantangi, Washington High School '28

Lutecia Lam, BASIS Independent '28

Ieva Chepurna, San Mateo High School '26

October 7, Colloquia Presenters

Department of Computer Science & Engineering

Enhancing Document Search with Keyword-Based Image Retrieval

Searching for text within a PDF is a simple task, with commands such as CTRL-F providing quick searches, but searching for specific images in an academic document can take substantial time and work. While you could scroll through a document and look for an image, that is not the most efficient way to address the problem. In this project, we aim to create a service tool that can allow a reader of a PDF to search for images within a PDF, while only having to provide keywords that detail their search. Extensive evaluations of many different image recognition models and LLMs that build up the backend of this tool were done to provide accuracy of results and efficiency of the service.

RESEARCHERS: Aaron Ely, Basis Independent Fremont '28

ADVISOR: Liu Lab, Software Engineering

Department of Chemistry, Biochemistry & Physics

Evaluation of machine learning models for the classification of optimal coupling agents in amide coupling reactions

Reaction optimization is a very time, resource, and labor intensive process, as the optimal reaction conditions depend highly on substrate identity, and require extensive fine-tuning of synthetic conditions to arrive at the highest-yielding conversions. The multidimensionality of the data makes it suited for an approach involving machine learning, which could predictively identify optimal reaction conditions given a particular substrate feature set. Herein, we report a platform for standardizing and filtering open source reaction data from ORD (Open Reaction Database) and using this machine-readable dataset to train thirteen machine learning models, including linear, tree-based, kernel method, instance based, neural network, and ensemble architectures, in the yield prediction and classification of coupling agents in amide coupling reactions, which comprise a significant percentage of reactions performed in a medicinal chemistry setting. While yield prediction remained a difficult task for our models due to the complexity of our reaction data, our models performed with great accuracy when classifying reactions to their ideal coupling agent category, including carbodiimide-based, uronium salt, and phosphonium salt. To further validate this approach, we deployed our classification models on isoxazole coupling reaction data generated in our lab, and it successfully categorized the reactions by coupling agent type. Our results demonstrate that kernel methods and ensemble-based architectures perform significantly better than other models such as linear or single tree based. Additionally, molecular environment features, captured by XYZ coordinates, three-dimensional features, and Morgan Fingerprints around reactive functional groups, boosted model predictivity more than bulk material properties such as molecular weight, LogP, and SMILES.

RESEARCHERS: Abhinav Chalasani, Mission San Jose High School '26; Aarav Anand, Lynbrook High School '27

KEYWORDS: Amide Coupling Reactions | Reaction Optimization | Coupling Agent Classification | Reaction Yield Prediction | Machine Learning

Liu Lab - Software Engineering

Aaron Ely, Basis Independent Fremont '28

Abhinav Chalasani, Mission San Jose High School '26

Aarav Anand, Lynbrook High School '27

September 30, Colloquia Presenters

Department of Computer Science & Engineering

Evaluating the Impact of Playback Speed on Automatic Speech Recognition System (ASR) Transcription Accuracy

In speech-to-text (STT) systems, transcription efficiency and cost are strongly influenced by the duration of the input audio. Increasing playback speed shortens file length, which reduces both processing time and cost. However, increasing playback speed may come with a cost to accuracy. The metric we used to determine the optimal playback speed for STT models was Word Error Rate (WER), a standard Automatic Speech Recognition (ASR) metric that measures the proportions of substitutions, deletions, and insertions relative to the reference transcription (23). We hypothesized that playback speeds up to 1.25 times would maintain a WER below 0.2 (20%), while higher playback speeds would exceed a WER of 0.3 (30%). We sped up audio files by 4 factors (100%, 125%, 140%, and 150%) using the LibriSpeech dataset and tested them on 4 models: OpenAI Whisper, Microsoft Azure STT, Deepgram, and Google Cloud STT. For each playback speed, we split up our data into 5 Word Per Minute (WPM) bins using their original speed: 140-149, 150-159, 160-169, 170-179, and 180-18, allowing us to determine the cause of changes in the WER. Our results show that the WER remains stable up to 1.25 times for Azure and Deepgram, consistently maintaining a WER of less than 0.2, while Google and Whisper exceeded this threshold at the same speed. At higher playback speeds, all four systems showed a significant degradation in accuracy. These findings suggest that while 1.25 times playback offers a cost-efficient compromise for some systems, the “optimal” threshold varies depending on the model.

RESEARCHERS: Snata Mohanty, Dougherty Valley High School '26

ADVISOR: Liu Lab, Software Engineering

KEYWORDS: Automatic Speech Recognition (ASR), Word Error Rate (WER), Playback Speed, Speech-to-Text (STT), Transcription Accuracy, Model Robustness

Department of Computer Science & Engineering

DeepBERTa: A DeepSMILES Driven BERT Model for Molecular Property Prediction

Molecular machine learning is a field where computer science techniques are applied to solve chemical problems, such as predicting molecular properties or accelerating drug discovery. In recent years, deep learning models—including transformers, graph neural networks (GNNs), and recurrent neural networks (RNNs)—have shown strong performance in many chemical applications. These models typically rely on large molecular datasets (hundreds of millions to billions of compounds) and require a suitable molecular representation to process the input effectively. One of the most widely used representations is SMILES, a linear string notation for molecules. However, a newer variant called DeepSMILES simplifies the syntax and has been shown in recent studies to improve performance in some tasks. Despite the field's shift toward large-scale deep learning, little work has explored training transformer models directly on DeepSMILES. In this project, we introduce DeepBERTa, a DeepSMILES-based transformer built on the established ChemBERTa architecture and attempt to train it on millions of molecules to evaluate whether DeepSMILES can outperform SMILES in certain tasks and paradigms. Initial results suggest that DeepBERTa is comparable to ChemBERTa in a Blood-Brain Barrier Penetration classification task.

RESEARCHERS: Aayush Kothari, Mission San Jose High School '27; Sourish Rikkala, Fair Lawn High School '27

ADVISOR: Akl Lab, Machine Learning for Condensed Matter Physics

KEYWORDS: Molecular Machine Learning | Drug Discovery | DeepSMILES | DeepBERTa | Natural Language Processing

Department of Chemistry, Biochemistry & Physics

Scalable formal synthesis of (R)-(+)-etomoxir without pyrophoric reagents enabled by benchtop NMR

Etomoxir is a covalent inhibitor of CPT1, a transmembrane mitochondrial protein that acts as the rate-limiting enzyme for fatty acid oxidation. This enzyme plays a major role in metabolic diseases such as diabetes, where regulation of fatty acid biosynthesis and β-oxidation kinetics through CPT1 are effective treatments for such diseases. The 4-Cl phenolic ether on (R)-(+)-etomoxir is a key SAR hotspot for enabling isoform selective inhibition of CPT1. Previously reported syntheses either require early installation of a 4-Cl phenolic ether which precludes the potential for late stage aryl substitution, or employ large scale pyrophoric reactions in early synthetic operations which are challenging to scale. We demonstrate the scalability of a new synthetic route to intercept a late-stage allylic alcohol in route to (R)-(+)-etomoxir. Notably, our alternate retrosynthetic disconnection, which proceeds through a catalytic aerobic oxidation and a one-flask tandem aldol condensation- reduction sequence, to install a key allylic methylene, avoids pyrophoric materials such as n-butyllithium. With a scalable synthesis of a key diversifiable intermediate in hand, our laboratory is currently preparing a library of diverse (R)-(+)-etomoxir analogs to more fully interrogate the SAR of the aryl ring in CPT1 inhibitory activity.

RESEARCHERS: Jacqueline Shan, The King's Academy '26; Sophia Bagley, The Harker School '26

KEYWORDS: Formal Synthesis | Catalysis | Spectroscopy

Department of Biological, Human & Life Sciences

Comparative genomics to discover novel relationships between sharks and humans in the context of thymus development

Genomic studies have shown that sharks have mechanisms for greater cancer resistance than humans do. For example, these sharks have multiple overexpressed genes that are potential tumor suppressors, which downregulate cell proliferation in humans, thereby inhibiting oncogenesis. In this study, we compare the genomes, transcriptomes, and proteomes of commonly researched sharks to normal and cancerous human equivalents to determine whether there are any potential homologies between the genomes, which could indicate possible novel relationships. Sharks are an ideal organism to perform comparative genomics analysis on because of their increased cancer resistance from various factors like overexpression of tumor suppressor genes and mechanisms for greater genomic stability (which prevents cancer causing mutations). So far, we have used Ensembl to align tumor suppressor genes of different shark species to pinpoint a smaller region of interest for further analysis. With more specific DNA segments, we plan to use ortholog mapping, which traces genes from different species to a single gene from the most recent common ancestor, and protein-protein interaction networks, which reveal how genes are involved in the progression of disease. Additionally, we plan to use meta-RNAsequence analysis across both the shark and human transcriptomes. We hope to discover novel gene signatures in sharks and humans in colorectal cancer.

RESEARCHERS: Gautam Sharma, Mission San Jose High School '28; Jaanvi Dronamraju, Newark Memorial High School '27

ADVISOR: Cunha Lab, Bioinformatics and Cancer Biology

KEYWORDS: Cancer | DNA | Novel relationship | Genomes

Liu Lab - Software Engineering

Snata Mohanty, Dougherty Valley High School '26

Akl Lab - Machine Learning for Condensed Matter Physics

Aayush Kothari, Mission San Jose High School '27

Sourish Rikkala, Fair Lawn High School '27

Jacqueline Shan, The King's Academy '26

Sophia Bagley, The Harker School '26

Cunha Lab - AI & Machine Learning

Gautam Sharma, Mission San Jose High School '28

Jaanvi Dronamraju, Newark Memorial High School '27

September 23, Colloquia Presenters

Department of Computer Science & Engineering

ASDRP iOS App Development: Status & Roadmap

ASDRP faces challenges in managing its large-scale research program, including scattered information, an overload of calendar invites, and reliance on a costly attendance system. These disconnected tools lead to inconsistent communication and increased manual effort. To address this, we are developing the ASDRP Mobile App as a centralized platform to streamline program management. Tailored content ensures that students and advisors only see information relevant to their department and lab group, reducing noise from excess emails and invites. Key features include a student networking system for showcasing research, a location-based sign-in system to automate campus hour tracking, and an AI chatbot for program inquiries. Developed in Swift with Firebase as the backend, the app currently supports iOS with plans to expand to Android. By unifying ASDRP’s communication, scheduling, and attendance tracking, the app reduces administrative burden, enhances student engagement, and strengthens collaboration across the program.

RESEARCHERS: Grant Hur, The King's Academy PSP '26

ADVISOR: Liu Lab, Software Engineering

KEYWORDS: Mobile App Development | Swift (iOS) | Firebase Backend | Location-Based Sign-In | AI Chatbot

Department of Computer Science & Engineering

Evaluating the capabilities of Large Language Models to give Food Recommendations

Large Language Models (LLMs) are an emerging technology capable of recognition, summarization, translation, prediction, and content generation using extensive datasets. Several studies have explored different applications of LLMs such as translation, education, and healthcare. The purpose of this study is to explore a new area of personalized food and restaurant recommendations. We have created a framework and analyzed the potential of the top 3 of most advanced LLMs to provide reliable, detailed, and creative recommendations for food-related queries. Our finding show that LLMs are capable of outperforming traditional recommendation systems, with GPT scoring the highest overall score, followed by Gemini and DeepSeek (weakest performance). However, these LLMs still possess limitations such as inconsistent location accuracy, vague handling of affordability, and impractical suggestions for convenience. Our results highlight both the strengths and weaknesses of LLMs as restaurant recommendation systems.

RESEARCHERS: Myra Malhotra, Saint Francis High School '26; Vihaan Mittal, American High School '27; Saidhanush Gambhirrao, California High School '28; Soham Jani, Foothill High School '26

ADVISOR: Qin Lab, AI & Machine Learning

KEYWORDS: Large Language Models | Artificial Intelligence | Engineering

Department of Chemistry, Biochemistry & Physics

Anticancer Synthetic Arylsulfonamides with Wnt1-Modulating Activity

The Wnt1/β-catenin signaling pathway plays a vital role in embryonic development, organogenesis, tissue homeostasis, and cell survival, by carefully regulating dynamic homeostasis of the multifunctional protein β-catenin. Disruption of this regulation as a consequence of Axin and APC mutations can lead to abnormal β-catenin accumulation, a known driving factor in the development and progression of several human cancers. Previous studies have identified methyl 3-{[(4-methylphenyl)sulfonyl]amino}benzoate (MSAB) as a selective inhibitor of the Wnt1/β-catenin signaling pathway. To explore the structure-activity relationship on modifying the aniline and sulfonyl phenyl moiety of MSAB and their effects in vitro, we prepared a library of analogs with variously substituted phenyl, alkyl, heterocyclic, and saturated ring systems. Through MTT assays, we observed analogs with the methyl ester derivative showed significantly more activity than their ethyl ester counterparts and both 4-substituted esters exhibited significantly attenuated antiproliferative activity. We also observed that para-substitution of the sulfonyl phenyl moiety exhibited more dose-dependent inhibition of the Wnt1 pathway than their meta-substituted counterparts. Further, through a TCF/LEF-activated luciferase reporter cell assay, the 4-substituted methyl ester analogous to MSAB exhibited slightly reduced Wnt1-inhibitory activity, while 3- and 4-substituted ethyl esters exhibit minimal Wnt1-inhibitory activity. Additionally, we observed that para-substitution of the sulfonyl phenyl moiety exhibited more dose-dependent inhibition of the Wnt1 pathway than their meta-substituted counterparts. This difference in potency might be attributed to several factors that ultimately drive antiproliferative activity, prompting further investigation of these compounds as Wnt1-based antiproliferative agents.

RESEARCHERS: Lavernie Chen, Santa Clara High School '28; Allyson Yu, BASIS '27

KEYWORDS: Organic Synthesis | Arylsulfonamides | Wnt-1/β-catenin | Structure Activity Relationship | Medicinal Chemistry

Department of Biological, Human & Life Sciences

Bispecific antibody for AML therapy

Acute Myeloid Leukemia (AML) is an aggressive hematologic malignancy characterized by the expansion of abnormal myeloid progenitors in the bone marrow and bloodstream. Current therapies—including chemotherapy, FLT3 and IDH inhibitors, and antibody-drug conjugates—are often limited by relapse, toxicity, and poor long-term survival. Immunotherapies such as CAR-T cells show promise but face significant safety and scalability challenges. To address the urgent need for targeted strategies, we engineered bi- and trispecific antibodies designed to improve selectivity for AML cells while enhancing T cell–mediated cytotoxicity.

Our bispecific constructs pair an antigen-binding domain recognizing AML-associated markers (CLL-1 or TIM-3) with a CD3-binding arm that recruits T cells. In vitro assays using HL-60 and THP-1 leukemia cell lines confirmed strong binding affinity and demonstrated potent cytotoxicity against CLL-1⁺ and TIM-3⁺ populations, while sparing normal hematopoietic progenitors. Building on these results, we developed a trispecific antibody incorporating CLL-1, TIM-3, and CD3 recognition. This design exploits co-expression of CLL-1 and TIM-3 on leukemic stem cells, achieving higher potency against double-positive targets while reducing off-tumor effects. Preliminary data confirm that trispecific constructs enhance tumor cell killing, mitigate immune escape, and exhibit reduced toxicity compared to bispecific formats.

Future directions include in vivo efficacy studies and extension of this platform to other tumors such as ovarian cancer. Collectively, our findings highlight trispecific antibodies as a promising next-generation immunotherapy approach for AML, capable of integrating potency, selectivity, and safety into a single molecular design.

RESEARCHERS: TBH

ADVISOR: Wang Lab, Molecular & Cell Biology

Liu Lab - Software Engineering

Grant Hur, The King's Academy PSP '26

Qin Lab - AI & Machine Learning

Larry Xie, Milpitas High School '27

Saahithi Srikanth, Monta Vista High School '27

Kimberly Yashar, The Harker School '26

Gabriela Formanek, Notre Dame High School '26

Seoyeon Kim, Valley Christian High School '26

Lavernie Chen, Santa Clara High School '28; Allyson Yu, BASIS '27

Wang Lab - Molecular & Cell Biology

TBH

September 16, Colloquia Presenters

Department of Computer Science & Engineering

Influence of Chemical Etching on Twin Boundaries Dihedral Angle Measurements

Knowledge on interfacial free energies, or ratio of energies, of metals alloys is one the most sought after parameters in computational materials science and practical metallurgical applications. We propose the usage of an atomic force microscope (AFM) as a tool to evaluate the ratio of the twin boundaries to the surface free energy in copper. 3D printed models of twin boundaries were constructed on an atomic level scale. Heat treatment of "as received" copper samples was performed at 900º C and 800ºC for 1 hour to grow the copper's grains until it was suitable for observations. Metallurgically polished and etched samples were prepared in the ASDRP lab for optical, electron microscopy and AFM evaluations. We will discuss our results and future plans during the presentation.

RESEARCHERS: Larry Xie, Milpitas High School '27; Saahithi Srikanth, Monta Vista High School '27; Kimberly Yashar, The Harker School '26; Gabriela Formanek, Notre Dame High School '26; Seoyeon Kim, Valley Christian High School '26

ADVISOR: Starostina Lab, Materials Science

KEYWORDS: Grains l Twin Boundaries l Twinning Planes l Interfacial Free Energy l Microscopy l Copper l Fcc Metals

Department of Computer Science & Engineering

Toward Determination of Tabor Factor as a Function of Grain Size

Determining the Tabor factor in relation to microstructure and composition could pave the way for the creation and development of an inexpensive, non-destructive method for predicting the tensile properties of bulk materials using localized hardness measurements. This advancement is especially valuable for improving current preventative maintenance procedures and facilitating the upscaling of research and development in industrial settings.To start our research, we acquired CAD models and followed machine ASTM E8 standard tensile testing (TT) procedures for our copper samples. TT was performed at Santa Clara University, and five stress-strain (SS) diagrams were created and analyzed to determine their flow stress. Additionally, grain size was measured on both sides of the sample to be correlated alongside the flow stress. The microstructures and SS data will be shared and discussed in terms of the literature searches.

RESEARCHERS: Averi Mukhopadhyay, American High School '27; Ambar Vig, Los Altos High School '26; Nicholas Wong, Dougherty Valley High School '26; Anay Tailor, Dougherty Valley High School '27; Michael Tzeng, Mission San Jose High School '27; Tyler Buenaventura, Dougherty Valley High School '27

ADVISOR: Starostina Lab, Materials Science

KEYWORDS: Tabor Factor | Microstructure | Tensile Properties | Predictive Maintenance

Department of Computer Science & Engineering

Google Space Accountability Bot

Every semester, our lab faced the recurring challenge of holding students accountable for completing bi-weekly updates in a shared Google Sheet. While the process appeared straightforward, it often resulted in late-night manual reminders, repeated checks of the sheet, and unnecessary back-and-forth messages in Google Space. This manual oversight was inefficient. To address this, we developed an accountability chatbot integrated directly into Google Space, where students were already required to be active. The bot automates reminders, tracks completion, and enforces accountability through a transparent strike system. By embedding the chatbot into the existing communication platform, the system introduces no additional learning curve while ensuring consistent, automated accountability.

RESEARCHERS: Ayush Kansal, Irvington High School '26; Tithi Raval, Irvington High School '26

ADVISOR: Liu Lab, Software Engineering

KEYWORDS: Apps Script | Automation | Chatbot | Google Cloud | Python