automatic pretraining
by predicting next word
$
\begin{array}{rl}
{\color{red}\equiv} & \mathrm{base\ mode\ (GPT\!-\!3)} \\[0.0ex]
{\color{red}+} & \mathrm{human\ supervised\
finetuning\ (SFT\ model)} \\[0.0ex]
{\color{red}+} & \mathrm{human\ supervised\ reinforement\
learning} \\[0.0ex]
{\color{red}\Rightarrow} & \mathrm{chat\ assistent\ (foundation\ model)}
\end{array}
{\color{red}+} & \mathrm{chain-of-thoughts training} \\[0.0ex]
$