Texifter - People

Mark J. Hoy - Technical Advisor


Mr. Hoy brings over 22 years of professional IT experience as well as over 30 years of programming hailing from a wide swath of areas including academic research, R&D for the defense industry and the Director of Programming for a graphic design and multimedia firm, and specializing in Information Retrieval, Natural Language Processing and Decision Systems. Mark also was the lead developer for the Coding Analysis Toolkit (CAT) which lead to Texifter's first product, the Public Comment Analysis Toolkit (PCAT) and further development into DiscoverText.

In previous positions, Mark was the co-creator of the ClueWeb09 dataset which is a collection of over 1 billion web pages gathered in January and February of 2009 in support of research on information retrieval and related language technologies. ClueWeb09 was most recently used as the featured dataset for NIST’s Text REtrieval Conference for 2009 and 2010.

Mark also worked as a Senior Programmer for the Lemur Toolkit for the Language Technologies Institute at Carnegie Mellon University. Prior to working on the Lemur Project, Mark was a Software Engineer for Northrop Grumman in the Information and Cyberwarfare Technology Center where he worked as a principal investigator, researcher and developer of tools for distributed mobile agent networks utilizing distributed fusion processing with Dempster-Shafer belief networks. Also while at Northrop Grumman, Mark was the co-inventor and developer of a disparate data query and 3-D visualization tool which lead to a set of analyst tools to profile and extract knowledge from large corpora of text documents.

Mark graduated from Carnegie Mellon University in 2007 with a Masters in Information Technology, Software Design and Management as well as in 2000 with a B.S. in Information and Decision Systems.

http://www.linkedin.com/in/markjhoy