Stylometric Fingerprints and Privacy Behavior in Textual Data

Stylometric Fingerprints and Privacy Behavior in Textual Data
Author :
Publisher :
Total Pages : 310
Release :
ISBN-10 : OCLC:914236972
ISBN-13 :
Rating : 4/5 (72 Downloads)

Book Synopsis Stylometric Fingerprints and Privacy Behavior in Textual Data by : Aylin Caliskan-Islam

Download or read book Stylometric Fingerprints and Privacy Behavior in Textual Data written by Aylin Caliskan-Islam and published by . This book was released on 2015 with total page 310 pages. Available in PDF, EPUB and Kindle. Book excerpt: Machine learning and natural language processing can be used to characterize and quantify aspects of human behavior expressed in language. Linguistic features exhibited in any kind of text can be used to study individuals' behavior as well as to identify an author among thousands of authors. Studying aspects of human behavior can be automated by incorporating machine learning techniques and well-engineered features that represent behavior of interest. Human behavior analysis can be used to enhance security by detecting malware programmers, malicious users, or abusive multiple account holders in online networks. At the same time, such an automated analysis is a serious threat to privacy, especially to the privacy of persons that would like to remain anonymous. Nevertheless, privacy enhancing technologies can be built by first and foremost understanding privacy infringing methods in-depth to create countermeasures. Authorship attribution through stylometry, the study of writing style, in translated or unconventional text yields as high accuracy as the state-of-the-art accuracy in authorship attribution in English prose. Applying stylometry to the more structured domain of programming languages is also possible through a robust and principled method introduced in this thesis. Code stylometry is able to de-anonymize thousands of programmers with high accuracy while providing insight into software engineering. Programmer de-anonymization can aid in forensic analysis, resolving plagiarism cases, or copyright investigations. On the other hand, de-anonymizing programmers constitutes a privacy threat for anonymous contributors of open source repositories. Bridging the gap between natural language processing and machine learning is a powerful step towards designing feature sets that represent aspects of human behavior. Features obtained through natural language processing methods can be used to study the privacy behavior of users in large social networks. Aggregate privacy analysis shows that people with similar privacy behavior appear in clusters. This knowledge can be used to design privacy nudges and effective privacy preserving technologies. Machine learning can be incorporated on any kind of textual data to automate human behavior extraction in large scale.


Stylometric Fingerprints and Privacy Behavior in Textual Data Related Books

Stylometric Fingerprints and Privacy Behavior in Textual Data
Language: en
Pages: 310
Authors: Aylin Caliskan-Islam
Categories: Authorship
Type: BOOK - Published: 2015 - Publisher:

DOWNLOAD EBOOK

Machine learning and natural language processing can be used to characterize and quantify aspects of human behavior expressed in language. Linguistic features e
Integrating Social Media into Information Systems
Language: en
Pages: 203
Authors: Douglas Yeung, Douglas
Categories: Political Science
Type: BOOK - Published: 2018-01-01 - Publisher: Rand Corporation

DOWNLOAD EBOOK

This report examines the technical challenges associated with incorporating bulk, automated analysis of social media information into procedures for vetting peo
Authorship Attribution
Language: en
Pages: 116
Authors: Patrick Juola
Categories: Authorship, Disputed
Type: BOOK - Published: 2008 - Publisher: Now Publishers Inc

DOWNLOAD EBOOK

Authorship Attribution surveys the history and present state of the discipline, presenting some comparative results where available. It also provides a theoreti
Big Data Analytics
Language: en
Pages: 350
Authors: Ladjel Bellatreche
Categories: Computers
Type: BOOK - Published: 2021-01-02 - Publisher: Springer Nature

DOWNLOAD EBOOK

This book constitutes the proceedings of the 8th International Conference on Big Data Analytics, BDA 2020, which took place during December 15-18, 2020, in Sone
International Conference on Mobile Computing and Sustainable Informatics
Language: en
Pages: 845
Authors: Jennifer S. Raj
Categories: Technology & Engineering
Type: BOOK - Published: 2020-11-30 - Publisher: Springer Nature

DOWNLOAD EBOOK

Sustainability and mobile computing embraces a wide range of Information and Communication Technologies [ICT] in recent times. This book focuses more on the rec