Projects
Title: Towards Trustworthy Database Systems
Abstract: Answers to database queries often form the basis for
critical decision-making. To improve efficiency and reliability, answers to
these queries can be provided by distributed servers close to the querying
clients. However, because of the servers' ubiquity, the logistics associated
with fully securing them may be prohibitive; moreover, when the servers are run
by third parties, the clients may not trust them as much as they trust the
original data owners. Thus, the authenticity of the answers provided by servers
in response to clients' queries must be verifiable by the clients. More
generally, database responses are more useful if they contain the evidence of
their own correctness. For example, this enables a consumer to provide her own
credit report to a creditor without having the creditor request it from the
reporting agency to establish the validity of the report. This project is
developing methods for authenticating the validity and authenticity of a
variety of database queries, including general relational, data cube, and
spatio-temporal queries. Furthermore, techniques that use powerful
cryptographic primitives are being developed for providing authentication and
confidentiality. This research will enable utilization of this infrastructure
in applications where users must rely on the authenticity of the answer, such
as in financial systems, network monitoring, traffic control, or applications
yet to be imagined. The results of this project will be disseminated through
publications in journals and conferences. Furthermore, source code of these
methods, in the form of libraries, will be made available over the web.
Funding: Funded by NSF Cyber Trust Program under the project
"CT-ISG:
Collaborative Research: Towards Trustworthy Database Systems ", PI,
Feifei Li, 10/01/08-09/30/11, $150,620.
Project Website: please follow this link.
Title: Efficiency Ranking and Aggregate Query Processing for Probabilistic Data
Abstract: When dealing with massive quantities of data, ranking and aggregate queries are powerful techniques for focusing attention on the most important answers. Many applications that produce such massive quantities of data inherently introduce uncertainty in the same time, for example, probabilistic match in data integration, imprecise measurements from sensors, fuzzy duplicates in data cleaning, inconsistency in scientific data. Hence, the importance of these queries is even greater in probabilistic data, where a relation can encode exponentially many possible worlds. Uncertainty opens the gate to many possible definitions for ranking and aggregate queries. With the wide presence of probabilistic data, processing ranking and aggregate queries efficiently with the right semantics is of key importance for the successful deployment of probabilistic databases.
Funding: Funded by NSF IIS Program under the project " III:Small:Efficient Ranking and Aggregate Query Processing for Probablistic Data", sole PI, Feifei Li, 09/01/09-08/31/12, $328,831.Project Website: Coming soon!
Title: Geometric-Aware Query Processing Techniques in Spatial and Multimedia Applications
Abstract: Sspatial databases have offered a large number of applications in last decade that shape the horizon of computing services from people’s daily life to scientific research. For example, people rely on online map services to plan their trips; the deployment and query processing in large sensor networks often require the design of location-aware algorithms. Driven by these increasing number of applications, efficient processing of important and novel query types in spatial databases has always been a focal point. We are interested at designing geometric-aware query processing algorithms for spatial and multimedia databases. More specifically, by taking into account the particular geometric properties pertained to the application at hand, our goal is to deliver practical, efficient and effective algorithms for real applications on large scale data sets.
Project Website: Coming soon!