摘要:At present, there are numerous intelligent teaching applications based on various advanced pattern recognition technologies, such as face sign-in, classroom action recognition, student facial expression recognition, and other systems which have been gradually applied to major schools. Speech emotion recognition can analyze the characteristics of current teaching emotions, so as to discover the rules and details of teachers’ grasp of emotions. In order to realize the task of emotion classification in the process of intelligent teaching through machine learning technology, a teaching speech emotion recognition method with multifeature fusion and deep learning is proposed. The proposed method front-end processes the speech spectrum features in a multifeature fusion manner and combines them with an artificial neural network classifier to form the final speech emotion recognition model. First, after speech preprocessing, three features are selected and fused features using a network structure trained by parallel subnetworks. Then, the classifier used is a hybrid neural network classifier combining a convolutional neural network and a recurrent neural network. Finally, 100 open web courses were used to train and test the model. The test results show that the hybrid neural network sound model using multifeature fusion has good teaching speech emotion recognition capability.