Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Write "the the the the the the"

Result:. "No duplicate words found"



Seems reasonable that those types of words are eliminated for the purpose of the tool. Either by having a letter limit or by whitelisting them. Of course there will be duplicates of "the" and "and".


Unfortunately, it's just a letter limit; lower it and the "the"s get highlighted. Support for stop/common words would definitely be a good upgrade.


They're called "stop words".


Perhaps a better model might be to look at word frequency analysis, and highlight words that are used substantially more frequently than in typical English text.


Tried “Buffalo buffalo Buffalo buffalo buffalo buffalo Buffalo buffalo” and it highlighted every word. Tried “James while John had had had had had had had had had had had a better effect on the teacher” and it had no issues with it.


you can use the first slider on the toolbar to set the minimum word length, default is 4 (or the third button on the toolbar if you're on phone or smaller screens)


you can use the first slider on the toolbar to set the minimum word length, default is 4 (or the third button on the toolbar if you're on phone or smaller screens)


You might have to adjust the minimal word length to 3.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: