Determination of the defining features of texts written in isolation with a Naive Bayesian Classifier

Emily J. Becker, Dominic Burkart, Judith Mildner, Diana Tamir

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This study seeks to identify differences between textual samples written in isolation and controls. Isolation is the state of deprivation of one's typical level of social interaction and falls into three categories: Prison, seclusion, and isolation. We coded a Naive Bayesian Classifier using the Python package NLTK and ran it with different training to test set ratios and a Leave One Out with authors. The results yielded that accuracy is proportional to training set size. Currently we are analyzing the key features the classifier used to sort the texts and calculating a chance value for the classifier. This is a highly relevant area of study because we hope to elucidate key differences in the thoughts and cognitive states of isolated people, which could predict behavior for socially isolated people.

Original languageEnglish (US)
Title of host publicationISEC 2018 - Proceedings of the 8th IEEE Integrated STEM Education Conference
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages209-210
Number of pages2
ISBN (Electronic)9781538633090
DOIs
StatePublished - Apr 17 2018
Externally publishedYes
Event8th IEEE Integrated STEM Education Conference, ISEC 2018 - Princeton, United States
Duration: Mar 10 2018 → …

Publication series

NameISEC 2018 - Proceedings of the 8th IEEE Integrated STEM Education Conference
Volume2018-January

Other

Other8th IEEE Integrated STEM Education Conference, ISEC 2018
Country/TerritoryUnited States
CityPrinceton
Period3/10/18 → …

All Science Journal Classification (ASJC) codes

  • Biochemistry, Genetics and Molecular Biology (miscellaneous)
  • Chemistry (miscellaneous)
  • Engineering (miscellaneous)
  • Education
  • Computer Science (miscellaneous)

Fingerprint

Dive into the research topics of 'Determination of the defining features of texts written in isolation with a Naive Bayesian Classifier'. Together they form a unique fingerprint.

Cite this