A method for unseen emotion class recognition comprises: receiving, with an emotion recognition model, a speech sample to be tested; calculating, with an encoder, a sample embedding to be tested of the speech sample to be tested; calculating a first distance metric between the sample embedding to be tested and a first registered emotion category representation, and a second distance metric between the sample embedding to be tested and a second registered emotion category representation, wherein the second registered emotion category is not included in a plurality of basic emotion categories; and determining an emotion category of the speech sample to be tested according to the first distance metric and the second distance metric. |