CS 79995-006-12569 & CS 69995-006-12554

ST: Probabilistic Data Management

Fall 2016

 

Instructor: Xiang Lian

Office Location: Mathematics and Computer Science Building, Room 264

Office Phone Number: (330) 672-9063

Web: http://www.cs.kent.edu/~xlian/index.html

Email: xlian@kent.edu

Course: ST: Probabilistic Data Management

Prerequisites: Permission of the instructor

Time: 11:00am - 12:15pm, MW

Classroom Location: White Hall (WTH) 00107

Course Webpage: http://www.cs.kent.edu/~xlian/2016Spring_CS79995/ST_Probabilistic_Data_Management.html

 

Instructor's Office Hours: TBA; or by appointment

 

Graduate Assistant: N/A

Office: N/A

E-mail: N/A

Phone: N/A

TA's Office Hours: N/A


Enrollment/Official Registration of this Class

The official registration deadline for this course is 09/04/2016. University policy requires all students to be officially registered in each class they are attending. Students who are not officially registered for a course by published deadlines should not be attending classes and will not receive credit or a grade for the course. Each student must confirm enrollment by checking his/her class schedule (using Student Tools in FlashLine) prior to the deadline indicated. Registration errors must be corrected prior to the deadline.

 

For registration deadlines, enter the requested information for a Detailed Class Search from the Schedule of Classes Search found at:

https://keys.kent.edu:44220/ePROD/bwlkffcs.P_AdvUnsecureCrseSearch?term_in=201680

 

After locating your course/section, click on the Registration Deadlines link on the far right side of the listing.

 

Last day to withdraw: 11/06/2016

 


Resources of Reading Materials

Research papers/surveys from database conferences/journals (SIGMOD, PVLDB, ICDE, TODS, VLDBJ, and TKDE).

 

o   A reading list will appear here later J

§  Indexing on Probabilistic Data

§  Probabilistic Query Types

§  Probabilistic Graphs

§  Big Uncertain Data (parallel processing)

§  Parallel Probabilistic Graph Processing

§  Probabilistic Data Management in Real Applications (e.g., sensor networks, road networks, social networks, bioinformatics, Semantic Web, etc.)

 


 

Catalog Description

The purpose of this course is to learn the fundamental concepts and techniques for probabilistic data management in the area of databases. Probabilistic data are pervasive in many real-world applications, such as sensor networks, GPS system, location-based services, mobile computing, multimedia databases, data extraction/integration, trajectory data analysis, Semantic Web, privacy preserving, and so on. It is rather challenging how to efficiently and effectively manage these large-scale probabilistic data. In this class, we will cover major research topics such as probabilistic/uncertain data model, probabilistic queries, probabilistic query answering techniques, data quality issues in databases, and so on. Students are expected to do a survey on a selected research direction for papers from recent database journals/conferences, and write research papers or reports with new problems or solutions. Students will also give presentations to the class to demonstrate their outcomes. It is also expected that the resulting surveys/papers can be extended to database conference/journal papers.

Learning Outcomes

At the end of this course, the students should be able to:

1.      Explain real applications of probabilistic and uncertain data management in databases.

2.      Know the classifications of data uncertainties according to different criteria.

3.      Explain the causes and importance of studying probabilistic data management.

4.      Describe data uncertainty models, possible worlds semantics, correlations in probabilistic data, and probabilistic graph models.

5.      Know various types of probabilistic queries in probabilistic/uncertain databases.

6.      Describe the models, problem definitions, and the proposed techniques for each probabilistic query type in the literature.

7.      Learn to read/write research papers, and understand the general trend of the research in probabilistic data management.

8.      Summarize and analyze the pros and cons of existing works in probabilistic/uncertain databases.

9.      Identify one or two future directions in probabilistic databases, which have not been studied before, or not been extensively studied before, to work on.

10.  Write a survey on related works of probabilistic data management.

11.  Propose new solutions to existing problems or novel solutions to new problems in probabilistic and uncertain data management.

12.  Write a research report or research project/paper on the proposed problems or solutions.

13.  Do experiments on the proposed ideas in probabilistic data management.

14.  Give two presentations on the survey and report, respectively, to show off the outcome of the research project/paper.

15.  Work in a team (each with at most 3 members) to collaboratively write the survey and research papers.

 


 

Tentative Schedule

Week

Topic

Notes1

Week 1 (Aug. 29)

Introduction

 

An Overview of Probabilistic Data Management

Please form study groups, each with at most 3 members, and send your IDs, names, and emails to me (xlian@kent.edu); Due on Sept. 14

Week 1 (Aug. 31)

Data Uncertainty Model

 

Week 2 (Sept. 5)

--

Labor Day; No classes

Week 2 (Sept. 7)

Probabilistic Query Answering Over Probabilistic/Uncertain Databases

 

Week 3 (Sept. 12)

 

 

Week 3 (Sept. 14)

Probabilistic Graph Databases

 

Week 4 (Sept. 19)

 

 

Week 4 (Sept. 21)

Data Quality in Probabilistic Databases

 

Week 5 (Sept. 26)

 

Presentations for Selected Research Papers

Week 5 (Sept. 28)

 

 

Week 6 (Oct. 3)

 

 

Week 6 (Oct. 5)

 

 

Week 7 (Oct. 10)

 

 

Week 7 (Oct. 12)

 

Deadline for submitting the survey (Oct. 12)

Week 8 (Oct. 17)

 

 

Week 8 (Oct. 19)

 

 

Week 9 (Oct. 24)

 

 

Week 9 (Oct. 26)

 

 

Week 10 (Oct. 31)

 

 

Week 10 (Nov. 2)

 

Presentations for Project Report

 

Last Day to Withdraw: 11/06/2016

Week 11 (Nov. 7)

 

 

Week 11 (Nov. 9)

 

 

Week 12 (Nov. 14)

 

 

Week 12 (Nov. 16)

 

 

Week 13 (Nov. 21)

 

 

Week 13 (Nov. 23)

--

Nov. 23-27, Thanksgiving Recess; No classes

Week 14 (Nov. 28)

 

 

Week 14 (Nov. 30)

 

 

Week 15 (Dec. 5)

 

 

Week 15 (Dec. 7)

 

 

Week 16 (Dec. 12-18)

Final Exam (& Presentation)

Deadline for submitting the research paper (Dec. 12)

 

Academic calendar: https://www.kent.edu/sites/default/files/academic-calendar-2014-2018_0.pdf

Final exam schedule: http://www.kent.edu/registrar/fall-final-exam-schedule

NOTE: Presentation dates and deadlines are tentative. Exact dates will be announced in class!!!


Scoring and Grading

5% - Attendance & Questions

60% - Research Projects/Papers

o   Survey on papers for the selected research topics in recent database conferences/journals (25%)

o   Code and report for the research project in paper format (including introduction, related works, problem definition, solutions, experiments, and conclusions; 35%)

40% - Presentations

o   Presentation for 1-2 related works in the selected research direction (20%)

o   Presentation and demonstration for the proposed research project (20%)

5% - Bonus Points, rated by other team members

A = 90 or higher

B = 80 - 89

C = 70 - 79

D = 60 - 69

F = <60

 


 

Guidelines for Surveys/Papers/Projects

 

All surveys/papers/projects will be submitted electronically only. Instructions are given separately.

 

Ø  Assignments must be submitted to Blackboard by the due date.

Ø  A survey or paper report turned in within two weeks after the due date will be considered late and will lose 30% of its grade (10% for the first week, and 20% more for the second week).

Ø  No assignment will be accepted for grading after two weeks late.

Ø  The late submission needs prior consent of the instructor.


Lecture Attendance Policy

Attendance in the lecture is mandatory. Students are expected to attend lectures, study the text, and contribute to discussions. You need to write your name on attendance sheets throughout the course, so please attend every lecture.

Students are expected to attend all scheduled classes and may be dropped from the course for excessive absences. Legitimate reasons for an "excused" absence include, but are not limited to, illness and injury, disability-related concerns, military service, death in the immediate family, religious observance, academic field trips, and participation in an approved concert or athletic event, and direct participation in university disciplinary hearings.

Even though any absence can potentially interfere with the planned development of a course, and the student bears the responsibility for fulfilling all course requirements in a timely and responsible manner, instructors will, without prejudice, provide students returning to class after a legitimate absence with appropriate assistance and counsel about completing missed assignments and class material. Neither academic departments nor individual faculty members are required to waive essential or fundamental academic requirements of a course to accommodate student absences. However, each circumstance will be reviewed on a case-by-case basis.

For more details, please refer to University policy 3-01.2: http://www.kent.edu/policyreg/administrative-policy-regarding-class-attendance-and-class-absence.


Make-up Presentation Policy

No make-up presentation will be given except for university sanctioned excused absences. If you miss a presentation (for a good reason), it is your responsibility to contact me before the presentation, or soon after the presentation as possible.


Academic Dishonesty Policy

The University expects a student to maintain a high standard of individual honor in his/her scholastic work. Unless otherwise required, each student is expected to complete his or her assignment individually and independently (even in the team, workload should be distributed to team members to accomplish individually). Although it is encouraged to study together, the work handed in for grading by each student is expected to be his or her own. Any form of academic dishonesty will be strictly forbidden and will be punished to the maximum extent. Copying an assignment from another student (team) in this class or obtaining a solution from some other source will lead to an automatic failure for this course and to a disciplinary action. Allowing another student to copy one's work will be treated as an act of academic dishonesty, leading to the same penalty as copying.

University policy 3-01.8 deals with the problem of academic dishonesty, cheating, and plagiarism. None of these will be tolerated in this class. The sanctions provided in this policy will be used to deal with any violations. If you have any questions, please read the policy at http://www.kent.edu/policyreg/administrative-policy-regarding-student-cheating-and-plagiarism and/or ask.


Students with Disabilities

University policy 3-01.3 requires that students with disabilities be provided reasonable accommodations to ensure their equal access to course content.  If you have a documented disability and require accommodations, please contact the instructor at the beginning of the semester to make arrangements for necessary classroom adjustments. Please note, you must first verify your eligibility for these through Student Accessibility Services (contact 330-672-3391 or visit www.kent.edu/sas for more information on registration procedures).


Statements for the Course

This course may be used to satisfy the University Diversity requirement. Diversity courses provide opportunities for students to learn about such matters as the history, culture, values and notable achievements of people other than those of their own national origin, ethnicity, religion, sexual orientation, age, gender, physical and mental ability, and social class. Diversity courses also provide opportunities to examine problems and issues that may arise from differences, and opportunities to learn how to deal constructively with them.

 

This course may be used to satisfy the Writing Intensive Course (WIC) requirement. The purpose of a writing-intensive course is to assist students in becoming effective writers within their major discipline. A WIC requires a substantial amount of writing, provides opportunities for guided revision, and focuses on writing forms and standards used in the professional life of the discipline.

 

This course may be used to fulfill the university's Experiential Learning Requirement (ELR) which provides students with the opportunity to initiate lifelong learning through the development and application of academic knowledge and skills in new or different settings. Experiential learning can occur through civic engagement, creative and artistic activities, practical experiences, research, and study abroad/away.

 


 

Disclaimer

The instructor reserves the right to alter this syllabus as necessary.