Β Emergent Ability and Stability Analysis

Figure 6 shows the experiment results for emergent ability analysis and stability analysis. In Figure 6, (a)-(d) refer to the analysis of emergent abilities, and (c) refers to the evaluation of stability. We proved that single-cell LLMs have emergent abilities but the stability of single-cell LLMs should be improved.Β 

Figure 6. Different comparison groups for emergent ability analysis and stability analysis. (a): The model scale of different methods. (b): Accuracy of LLMs and vanilla NN in Cell-type Annotation task. The dataset here is the Pancreas cross dataset. (c): Accuracy of LLMs and vanilla NN in Cell-type Annotation task. The datasets here are MB Spatial and MCA. (d): Overall score comparison including ResPAN and different settings of scGPT. The dataset here is the human spatial transcriptomic dataset. (e): Different batch correction scores of different models based on changing random seeds (left) and different average classification scores of different models based on changing random seeds (right). The bold black line represents the median value while the length of each box can be interpreted as the variance level.

Emergent Ability Analysis

We discuss the emergent abilities of single-cell LLMs, with scBERT, Geneformer, and scGPT. We considered three scenarios to investigate the emergent abilities:

Stability Analysis