Skip to main content

SOCI 5260/6500: Text Analysis

Summer II 2014

M,T,W,Th, 12-1:50pm, Wooten Hal 116, July 7-August 8, 2014

Professor Gabe Ignatow
gignatow@gmail.com

Although this course has a room assigned, it is online-only. We will communicate by email, supplemented by several in-person meetings.

Course description: Social media sites generate massive volumes of natural language data that are available for social science research, and social scientists have developed a number of new technologies for analyzing this data. Researchers are scaling up traditional research techniques to take advantage of new sources of textual data, as well as developing new methods along with new theoretical and metatheoretical frameworks and approaches to research ethics. This course provides a practical guide to contemporary text mining and analysis for the social sciences, covering both qualitative and quantitative text analytic research methods. Our focus in this course is mainly on sociological text analysis methods, including computer-assisted qualitative methods, semantic text analysis methods, and topic models.

Requirements:
1) Completion of weekly assignments (see below)
2) Completion of 10-page final paper

Final paper requirements: 

The final paper can be a proposal for a text mining and analysis project, a completed text mining and analysis project, or somewhere in between. For all final papers, students must collect their own data and explain and justify their sampling strategy. For CAQDAS projects, students must develop a coding scheme and apply it to a sub-sample of the larger text sample. For projects using more highly automated methods, students must review relevant text analysis methods and propose a strategy that can yield results relevant to the research question.

10 pages inclusive of full references, 12-pt font, double-spaced


WEEK 1: INTRODUCTION AND TEXT MINING




Assignments: send by email to gignatow@gmail.com by 12pm Friday July 11
1) Propose one or more research questions that could be approached with text analysis methods
2) Identify 3 or more possible data sources, including newspaper archives, historical archives, social media platforms, websites, or research databases.

(15 points)

WEEK 2: TEXT MINING AND CAQDAS

1. Text Mining


Text mining packages (free) (check YouTube for tutorials)


2. CAQDAS




Free trials of CAQDAS packages (check YouTube for tutorials)


Assignments: send by email to gignatow@gmail.com by 12pm Friday July 18
1) Scrape or otherwise create a text sample of at least 5000 words. Describe the sample and how you collected it.
2) Write a 1-2-page memo describing possible coding schemes you will use on your data.

(15 points)

WEEK 3: SEQUENCE ANALYSIS METHODS
Franzosi 1987 From Words to Numbers
Franzosi 1998 Narrative Analysis

Assignments: send by email to gignatow@gmail.com by 12pm Friday July25
1) Write 1-2-page reviews of two of this week's articles
2) Write a 1-page update of your progress on your final paper

(10 points)

WEEK 4: SEMANTIC AND SENTIMENT ANALYSIS

Bail 2012 The Fringe Effect

Assignments: send by email to gignatow@gmail.com by 12pm Friday Aug 1
1) Write 1-2-page reviews of two of this week's articles
2) Write a 1-page update of your progress on your final paper

(10 points)

WEEK 5: TOPIC MODELS

August 4 Mohr and Bogdanov 2013 Topic Models--What They Are and Why They Matter
August 5-6 Mohr, Wagner-Pacifici, Breiger and Bogdanov Graphing the Grammar of Motives in National Security Strategy

Assignments:
Email presentations to gignatow@gmail.com and ignatow@unt.edu by 5pm August 7 (10 points)
Final paper due by email by 12pm Friday August 8 (40 points)

Popular posts from this blog

Jurgen Habermas "The Uncoupling of System and Lifeworld"

The Uncoupling of System and Lifeworld Jiirgen Habermas The provisional concept of society proposed here is radically different in one respectfromthe Parsonianconcept:thematureParsons rein terpretedthestruc¬tural components of the lifcworld -culture, society, perso nality -as action systems constituting environments for one another. Without much ado, he subsumed the concept of the lifeworld gained from an action-theoretical perspective under systems -theoretical concepts. As we shall see below, the structuralcomponentsofthe lifeworldbecomesubsystems of ageneralsystem of action, to 'which the physical substratum of the lifeworld is reckoned along with the "behavior system." The p roposal That I am advancing here, by contrast, attempts to take into account the methodological differences between the internalist and the externalist viewpoints connected with the two conceptual strategies . From the participant p erspective of members of a Iifeworld it looks as if sociologywith

Intro Theory Make-up Exam

Students wishing to take the make-up exam for midterm 2 will meet at my office, Chilton 397 in the sociology department, at 3:30pm this Thursday, November 29. The exam will be short-essay format, and will be based on the same review sheet used for the regular midterm 2. This will be the only chance for a make-up.

SOCI 6203/5200 Text Mining (8-week online course, summer 2020)

SOCI 6203/5200 Text Mining Semester year Professor Gabe Ignatow ignatow@unt.edu  Start date-end date Overview: This is a graduate seminar on contemporary text mining and text analysis methods for the social sciences. We will cover principles of research design and research ethics as they apply to text-based social science research, and will review the major methodologies within social science text mining, including topic models and opinion mining. Course Objectives: Our goals for the course are to survey major contemporary approaches to social science text mining and for students to develop a preliminary text mining research project of their own. Prerequisites: None. However, experience with social science research methods and research design is preferred. Minimum Technology Requirements and Skills to Function in the Course: Basic reading, writing and computer skills, including the ability to access and search research databases and to download and lear