Coercing issue when the code is run

I am getting the following error , can you please help ?

---

TypeError                                 Traceback (most recent call last)
<ipython-input-17-e97ecc47cb47> in <module>()
     60 #tries using all words as the feature selection mechanism
     61 print 'using all words as features'
---> 62 evaluate_features(make_full_dict)
     63 
     64 #scores words based on chi-squared test to show information gain (http://streamhacker.com/2010/06/16/text-classification-sentiment-analysis-eliminate-low-information-features/)

<ipython-input-17-e97ecc47cb47> in evaluate_features(feature_select)
     14         #http://stackoverflow.com/questions/367155/splitting-a-string-into-words-and-punctuation
     15         #breaks up the sentences into lists of individual words (as selected by the input mechanism) and appends 'pos' or 'neg' after each list
---> 16         with open(RT_POLARITY_POS_FILE, 'r') as posSentences:
     17                 for i in posSentences:
     18                         posWords = re.findall(r"[\w']+|[.,!?;]", i.rstrip())

TypeError: coercing to Unicode: need string or buffer, RDD found


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Coercing issue when the code is run #4

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Coercing issue when the code is run #4

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions