University of Konstanz
Graduiertenkolleg / PhD Program
Computer and Information Science

Nafees Ur Rehman

Doctoral Student in the PhD program since 01.11.2010.


1. Prof. Dr. Marc Scholl

organisational data

Room: E221
Tel.: +49 (0)7531 / 88-2188
E-mail: nafees.rehman"at"
Other Resources: Personal Page

project description

Explosion of social network activity in the recent years has led to generation of massive volumes of user-related data, such as status updates, messaging, blog and forum entries, recommendations, connection requests and suggestions, etc. and has given birth to novel analysis areas, such as social media analysis and social network analysis. This phenomenon can be viewed as a part of the Big Data challenge, which is to cope with the rising flood of digital data from many sources, including mobile phones, internet, videos, e-mails, and social network communication. The generated content is heterogeneous and encompasses textual, numeric, and multimedia data. Companies and institutions worldwide anticipate to gain valuable insights from big data and hope to improve their marketing, customer services and public relations with the help of the acquired knowledge.

The established data warehousing technology with On-Line Analytical Processing (OLAP) and data mining (DM) functionality is known for its universality and high performance, but also for its rigidness and limitations when it comes to semi-structured, unstructured or complex data. Various solutions have been proposed in theory and practice for warehousing and analyzing heterogeneous data. One class of solutions focuses on extending the capabilities of the predominant technologies, i.e., relational and multidimensional databases. Our approach is based on (1) discovering facts, dimensions and hierarchies from semi-structured and unstructured data. (2) Enriching the outcome by exploiting text mining techniques (e.g., Entity Detection, Language Detection, Sentiment Analysis etc.) And (3) extending the obtained structures via content-driven discovery of additional data characteristics. The benefit of obtaining a properly structured and consolidated dataset lies in the ability to use the standard tools for data analysis, visualization, and mining for performing a variety of analysis tasks. This work enables OLAP for social media analysis and offers a new social dimension to the existing business data in the warehouse, which allows new and potentially useful insights to the data.


The following list of publications covers only those, which are or were published during participation at the Graduiertenkolleg / PhD program.

id should be a number

curriculum vitae

Since 03/2010 Research member Database and Information System, University of Konstanz.
09/2007 - 02/2010 HiWi Database and Information System, University of Konstanz.
06/2007 - 06/2009 Lecturer of Computer Science at Institute of Management Sciences.
01/2005 - 06/2006 Master of Science in IT, Institute of Management Sciences.
01/2001 - 12/2002 Masters in Information Technology, CECOS University.
01/1998 - 12/2000 Bachelors in Computer Science (BCS),Al-Khair University.