Texifter’s most recent historical Twitter prize winners include three from the United States, one from Great Britain, and one from France. Winners receive Enterprise access to DiscoverText for six months, and Sifter credit for up to three historical Twitter days and 200,000 tweets. The following is a snapshot of the most recent winners and their proposed research projects. Diana Ascher PhD student in the Department of Information Studies at UCLA @dianaascher “Helping Companies Streamline Information” Ascher proposes exploring cultural time orientation by analyzing the Twitter feeds from three news organizations to better understand how “information agents’ cultural backgrounds affect corporate information practice,” and specifically how organizations decide what information to share and when. Ascher hopes the research will help businesses streamline their information activity and routines, and help managers understand “how employees decide what’s important and what’s not.” Stephen Barnard Assistant Professor in the Sociology Department at St. Lawrence University @socsavvy “Better Understanding Journalism via Boston Marathon Bombing Twitter Data” Barnard plans to use Sifter to collect and analyze Twitter data about the 2013 Boston Marathon bombings. He will use Twitter’s PowerTrack filters to conduct a detailed search of Tweets that reported on the bombing, and compare the results to the responses from professional and citizen journalists. “I hope to gain a better understanding of the reporting processes and outcomes emerging from both groups,” Barnard writes, adding that he will use the findings to “highlight the structural relations of the emerging journalistic field.” Oliver Haimson PhD Student in the Informatics Department at University of California, Irvine @oliverhaimson “Analyzing Hashtags” Haimson’s plans to use the prize to analyze the hashtags #nymwars and #mynameis, which were used in 2011 and 2014 to critique Google’s and Facebook’s “real name” policies. He plans to evaluate the Twitter data from these two hashtags “using computational linguistics, qualitative coding, and social network analysis.” Omar Jaafor PhD Student in the Department of Operational Research, Applied Statistics and Simulation at University of Technology of Troyes @lmhasher “Developing Algorithms for Social Networks” Jaafor and fellow researchers will use the prize to continue to develop “clustering and anomaly detection algorithms for social networks in a big data environment.” Wasim Ahmed PhD Student in the Health Informatics Research Group at the University of Sheffield’s Information Department @was3210 ” Responding to Infectious Disease Outbreaks” Ahmed will use his prize to “study how users respond to outbreaks on infectious diseases on social media platforms, such as Twitter.” He plans to use his data towards his PhD “Pandemics and epidemics: User reactions on social media and Web 2.0 platforms.” For more information on the Texifter’s social data offer and text analytics tools, please send us an email email@example.com. Better yet, sign up for a free 30-day trial and start collecting your own social data today.
We need beta testers to close out our first year of new product development. Our tool Sifter is used for searching the complete “undeleted” history of Twitter. It allows the user to get a free estimate of the size and cost of a data license. Sifter supports keyword and metadata filters as well as complex Boolean formulations, all powered by the Gnip PowerTrack. We are offering a prize drawing for anyone (not limited to academics) who gets onto Sifter between today December 2, 2014 and the Winter Solstice on December 21, 2014 and completes the following three steps:
- Create a valid Sifter estimate for no more than 3 Twitter days resulting in no more than 100,000 results. Depending on the keyword or hashtags, this may take some trial and error and possibly use of the sampling filter. We’d very much like you to iterate through various free estimate attempts. Once you are logged in, it takes between 30 seconds and 2 minutes to create a valid estimate, depending on the complexity of your rules.
- Tweet about the experience including @texifter in the body of the Tweet.
- Send us a brief email about how Sifter could be improved (firstname.lastname@example.org) and please include a link to your Tweet.
The prize drawing will take place December 22nd. The drawing is limited to one entry per Twitter account. An individual or organization with multiple Twitter accounts can enter multiple times, however, an individual or organization can win only one prize. We will award five equal prizes in December. The winners will be able to get new estimates for up to 3 Twitter days and no more than 200,000 tweets. Texifter will pay the license fees and the winners will have access to the data in gratis DiscoverText Enterprise accounts for 90 days. For more information, please send us an email email@example.com. To get started, register for Sifter or login to your existing account.
This is the first update on Texifter’s effort to support the crowd source review of the FCC’s Open Internet (a.k.a. “Net Neutrality”) public comments. Help review and report on what the public has said about the future of the Internet.
If you are getting ready to code some public comments, please take a look at this introduction to the first experimental coding task.
“We recognize that not everyone may have the requisite technical skills to build visualizations and analyze raw XML data. (Members of the public will, of course, still have the option of reviewing and searching the record via ECFS). However, we’re hoping that those who do have the technical know-how will develop and share these tools for the public to use.”
Texifter has the right tools to allow anyone not versed in raw XML data extraction to search and code this data, then export the results as a CSV file including the relevant metadata. We have loaded the data and started a project using DiscoverText, which was built specifically for crowd source public comment review by US federal agencies.
We invite you to join our collaborative, web-based effort to find substantive comments and visualize what the public said about Net Neutrality. You can work directly with me and others to crowd source the review of the non-duplicate comments, or you can conduct your own parallel project with the same data.
To get involved, sign up for the free trial DiscoverText account and please note in the comment box that you want to work with the FCC data.
You might be interested in these preliminary stats based on what we downloaded yesterday:
- 446,667 items posted to the FCC web site
- 300,172 items after de-duplication
- The largest group of exact duplicates is 105,320 identical items that say:
“Net neutrality is the First Amendment of the Internet, the principle that Internet service providers (ISPs) treat all data equally. As an Internet user, net neutrality is vitally important to me. The FCC should use its Title II authority to protect it. Most Americans have only one choice for truly high speed Internet: their local cable company. This is a political failure, and it is an embarrassment. America deserves competition and choice. Without net neutrality, a bad situation gets even worse. These ISPs will now be able to manipulate our Internet experience by speeding up some services and slowing down others. That kills choice, diversity, and quality. It also causes tremendous economic harm. If ISPs can speed up favored services and slow others, new businesses will no longer be able to rely on a level playing field. When ISPs can slow your site and destroy your business at will, how can any startup attract investors? My friends, family, and I use the Internet for conversation and fun, but also for work and business. When you let ISPs mess with our Internet experience, you are attacking our social lives, our entertainment, and our economic well being. We won’tstand [sic] for it. ISPs are opposing Title II so that they can destroy the FCC’s net neutrality rules in court. This is the same trick they pulled last time. Please, let’s not be fooled again. Title II is the strong, legally sound way to enforce net neutrality. Use it.”
We are making continual improvements to the sifter beta. Our goal is to develop the best possible user interface for Gnip’s PowerTrack filters when searching for historical Twitter data. Version 2 of the historical Twitter filtering system reflects a lot of great input from our early adopters. The work is far from done. This video introduces v2. What we need is your input. How can we make this tool for searching every undeleted tweet in history easier to use?