1

2

3

from the results above, the numbers in the paranthesis to the side of the zeros are the words. and the numbers outside the

paranthesis are their respective counts. this means there are 9 uniques words in the message 4 and the word coded '2' appears 10 times

From the predicitons we had 98% precision and 97% recall and the f1-score is 97%. We seek to improve it more. 🙂

	labels	message
0	ham	Go until jurong point, crazy.. Available only …
1	ham	Ok lar… Joking wif u oni…
2	spam	Free entry in 2 a wkly comp to win FA Cup fina…
3	ham	U dun say so early hor… U c already then say…
4	ham	Nah I don’t think he goes to usf, he lives aro…

		message
labels
ham	count	4825
	unique	4516
	top	Sorry, I’ll call later
	freq	30
spam	count	747
	unique	653
	top	Please call our customer service representativ…
	freq	4

	labels	message	length
0	ham	Go until jurong point, crazy.. Available only …	111
1	ham	Ok lar… Joking wif u oni…	29
2	spam	Free entry in 2 a wkly comp to win FA Cup fina…	155
3	ham	U dun say so early hor… U c already then say…	49
4	ham	Nah I don’t think he goes to usf, he lives aro…	61

	labels	message	length
0	ham	Go until jurong point, crazy.. Available only …	111
1	ham	Ok lar… Joking wif u oni…	29
2	spam	Free entry in 2 a wkly comp to win FA Cup fina…	155
3	ham	U dun say so early hor… U c already then say…	49
4	ham	Nah I don’t think he goes to usf, he lives aro…	61

Natural Language Processing Machine Learning Algorithm Model Python – NLP

DataPandas LTS