TypeError ufunc add did not contain a loop with signature matching types

Question

I am creating bag of words representation of the sentence  Then taking the words that exist in the sentence to compare to the file  vectors txt   in order to get their embedding vectors  After getting vectors for each word that exists in the sentence  I am taking average of the vectors of the words in the sentence  This is my code    import nltk import numpy as np from nltk import FreqDist from nltk corpus import brown   news   brown words categories  news    news sents   brown sents categories  news     fdist   FreqDist w lower   for w in news   vocabulary    word for word    in fdist most common 10    num sents   len news sents    def averageEmbeddings sentenceTokens  embeddingLookupTable       listOfEmb        for token in sentenceTokens          embedding   embeddingLookupTable token           listOfEmb append embedding   return sum np asarray listOfEmb     float len listOfEmb    embeddingVectors       with open  D   Embedding  vectors txt   as file       for line in file          key   val    line split          embeddingVectors key    val  for i in range num sents        features          for word in vocabulary           features word    int word in news sents i               print features       print list features values       sentenceTokens       for key  value in features items         if value    1         sentenceTokens append key  sentenceTokens remove          print sentenceTokens          print averageEmbeddings sentenceTokens  embeddingVectors    print features keys       Not sure why  but I get this error   TypeError                                 Traceback  most recent call last   lt ipython-input-4-643ccd012438 gt  in  lt module gt     39     sentenceTokens remove       40     print sentenceTokens  --- gt  41     print averageEmbeddings sentenceTokens  embeddingVectors    42   43 print features keys       lt ipython-input-4-643ccd012438 gt  in averageEmbeddings sentenceTokens  embeddingLookupTable   18         listOfEmb append embedding   19  --- gt  20     return sum np asarray listOfEmb     float len listOfEmb    21   22 embeddingVectors       TypeError  ufunc  add  did not contain a loop with signature matching types dtype   lt U9   dtype   lt U9   dtype   lt U9     P S  Embedding Vector looks like   the 0 011384 0 010512 -0 008450 -0 007628 0 000360 -0 010121 0 004674 -0 000076  of 0 002954 0 004546 0 005513 -0 004026 0 002296 -0 016979 -0 011469 -0 009159  and 0 004691 -0 012989 -0 003122 0 004786 -0 002907 0 000526 -0 006146 -0 003058  one 0 014722 -0 000810 0 003737 -0 001110 -0 011229 0 001577 -0 007403 -0 005355  in -0 001046 -0 008302 0 010973 0 009608 0 009494 -0 008253 0 001744 0 003263    After using np sum I get this error   TypeError                                 Traceback  most recent call last   lt ipython-input-13-8a7edbb9d946 gt  in  lt module gt     40     sentenceTokens remove       41     print sentenceTokens  --- gt  42     print averageEmbeddings sentenceTokens  embeddingVectors    43   44 print features keys       lt ipython-input-13-8a7edbb9d946 gt  in averageEmbeddings sentenceTokens  embeddingLookupTable   18         listOfEmb append embedding   19  --- gt  20     return np sum np asarray listOfEmb     float len listOfEmb    21   22 embeddingVectors       C  Anaconda3 lib site-packages numpy core fromnumeric py in sum a  axis  dtype  out  keepdims     1829     else     1830         return  methods  sum a  axis axis  dtype dtype  - gt  1831                              out out  keepdims keepdims     1832     1833   C  Anaconda3 lib site-packages numpy core  methods py in  sum a  axis  dtype  out  keepdims   30   31 def  sum a  axis None  dtype None  out None  keepdims False   --- gt  32     return umr sum a  axis  dtype  out  keepdims   33   34 def  prod a  axis None  dtype None  out None  keepdims False    TypeError  cannot perform reduce with flexible type

User · Accepted Answer

You have a numpy array of strings  not floats  This is what is meant by dtype   lt U9   -- a little endian encoded unicode string with up to 9 characters   try   return sum np asarray listOfEmb  dtype float     float len listOfEmb     However  you don t need numpy here at all  You can really just do   return sum float embedding  for embedding in listOfEmb    len listOfEmb    Or if you re really set on using numpy   return np asarray listOfEmb  dtype float  mean

[python] TypeError: ufunc 'add' did not contain a loop with signature matching types

Examples related to python

Examples related to python-3.x