Twitter Search, Assange and Impure Posted on December 12, 2010 by Anthony 1 Assange Twitter Search Text Julian Assange and WikiLeaks are still in the news this week. After what is arguably the most public security breach in history, Assange has become something of a pariah, with companies such as Amazon, Paypal, EveryDNS (not EasyDNS), the Australian Government and credit cards cutting services. Issues like the politics of freedom of discourse, the internet and who gets to decide where journalism stops and espionage begins have been hotly debated with varying degrees of sophistication all over the internet. Discussion of the discussion has not been as widespread. Visualising Twitter Search Using Twitter search data, visualisations and natural language tools for categorising words by frequency in language and occurrences in the text, it is possible to take a snapshot of discussion topics and sentiment. Impure makes it easy to generate these visualisations quickly and easily for any other trending topic that may develop around a brand. The main limitation is in the amount of data that Impure inports from Twitter. Collating the information elsewhere and importing it as a table is one way of dealing with this restriction. Assange the Outlier The first issue with the data set was the words ‘Julian’ and ‘Assange’. As the Twitter search was on the term ‘Assange’ and both words were rather remarkable, occurred often and were uncommon natural language, they skewed the graph heavily. As they can be assumed, the terms were removed from the text string, and using a logarithmic scale made the visualisation more useable. Occurrences in text by frequency in language Twitter Timeline & User Activity ObjectsOnTime makes it possible to display the tweets sequentially. Adding repetitions by user as a weighting to the visualisation shows which tweets were made by those discussing the subject often, those who were not and what was actually said. Unsuprisingly the most active user in this Twitter search dataset was a bot. Bot Activity The data used by the workspace will change each time it is accessed. As only a small sample is created each time the work space is accessed there can be a lot of variance in activity. This is an interesting demonstration of natural language tools and timelines to see who is saying what and demonstrating novel behaviours online. Explore the Workspace One response to “Twitter Search, Assange and Impure” What Gets Clicked, What Gets Read | Contoleon.com says: July 3, 2011 at 9:55 am […] Twitter Search, Assange and Impure […] Reply Leave a Reply Cancel reply Your email address will not be published. Required fields are marked *Comment Name * Email * Website