ACM Computing Surveys 28A(4), December 1996, http://www.acm.org/surveys/1996/Formatting/. Copyright © 1996 by the Association for Computing Machinery, Inc. See the permissions statement below.

A Role for Database Research in the Database Industry

David Lomet
Microsoft Research , Database Group
One Microsoft Way, Redmond, WA 98052
lomet@microsoft.com , http://www.research.microsoft.com/research/db/lomet
(206)703-1853


Abstract: We argue that research in the database industry is not primarily a matter of discovering new problems. Rather, the main role of database research is to formulate the basic abstractions, provide the elegant and general algorithms, and characterize performance.

Categories and Subject Descriptors:

General Terms:

Additional Key Words and Phrases: database research


Publication Information

Citation
Lomet, D., 1996. A Role for Database Research in the Database Industry ACM
Submission date
November 15, 1996
Acceptance date
November 15, 1996

Research in the Database Industry

A database industry would be alive and well in the US and elsewhere, even if researchers had never entered the database arena. This area, first and foremost, copes with the business data processing problem. Businesses were and are willing to spend money on this. Hence, the existence of the industry is no accident, and certainly did not require researchers to identify the problem.

In addition, a good bit of the database technology would have evolved without research input. Something close to the notion of a transaction existed in IMS around 1970. Data models, both hierarchical and network (looking very much like extended relational and OO navigation models) already existed by the early 70's without research input. Tree indexing and hashing were in use in a similar time frame. We need to understand how research contributed to the evolution of database technology so that we can understand the role it might play in the future.

The research contribution, in my view, consisted of providing two fundamental abstractions, transactions and the relational model. Working with these abstractions, one could enormously expand the scope of the algorithm solution space, hence improving functionality, performance, indeed many desirable attributes of database systems. These abstractions gave database users models with which they could cope. And researchers leapt into the database technology enterprise by exploiting these abstractions with technical solutions that were both general and elegant. With transactions, it was concurrency control, recovery, and availability techniques that resulted. With the relational model, it was normalization, query processing, optimization, data independence, indexing and storage organizations.

The point here is that industry identified the problems and provided the early impetus. Researchers came along later and provided the clean abstractions and the elegant solutions. These aree what enables database technology to be readily transmitted to new practitioners and to become solid engineering, not just arcane craft. This has served our field well, giving researchers important problems to ponder, and has returned to industry elegant abstractions, algorithms, and understanding. This is likely to continue to be the model, and I believe it should be.

Areas of Current Industry Interest

So what are the areas on which industry has just begun to focus? I think it is these areas that cry out, or should cry out to researchers as the golden opportunities. I am unconvinced that the research community is likely to anticipate the next area with a huge impact. So the areas that I cite below (only some of the possible areas) have largely had some preliminary industrial exploration already. But fundamental understandings are in short supply, as are elegant and generalizable algorithms.

Industry is moving fast in all these areas, but research should be able to play its customary role of understanding and generalizing, and hence providing the foundation upon which superior technolgy can and will be built.


Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Publications Dept, ACM Inc., fax +1 (212) 869-0481, or permissions@acm.org.

>