E-books in Russian and English
|
| Principles of Distributed Database Systems, Third Editiondate: 8 марта 2011 / author: izograv / категория: Sql/Database / views: 3281 / comments: 0 Principles of Distributed Database Systems, Third Edition by M. Tamer Özsu and Patrick Valduriez This third edition of a classic textbook can be used to teach at the senior undergraduate and graduate levels. The material concentrates on fundamental theories as well as techniques and algorithms. The advent of the Internet and the World Wide Web, and, more recently, the emergence of cloud computing and streaming data applications, has forced a renewal of interest in distributed and parallel data management, while, at the same time, requiring a rethinking of some of the traditional techniques. This book covers the breadth and depth of this re-emerging field. The coverage consists of two parts. The first part discusses the fundamental principles of distributed data management and includes distribution design, data integration, distributed query processing and optimization, distributed transaction management, and replication. The second part focuses on more advanced topics and includes discussion of parallel database systems, distributed object management, peer-to-peer data management, web data management, data stream systems, and cloud computing. New in this Edition: • New chapters, covering database replication, database integration, multidatabase query processing, peer-to-peer data management, and web data management. • Coverage of emerging topics such as data streams and cloud computing • Extensive revisions and updates based on years of class testing and feedback Ancillary teaching materials are available. PREFACE TO THE SECOND EDITION Many things have changed since the publication of the first edition of this book in 1991. At the time, we reported projections that, by 1998, centralized database managers (DBMSs) would be an \antique curiosity" and most organizations would move towards distributed database managers. Distribution was slowly starting and "client/server" had just started to enter our daily jargon. These systems were generally multiple client/single server systems in which the distribution was mostly in terms of functionality, not data. If multiple servers were used, clients were responsible for managing the connections to these servers. Thus, transparency of access was not widely supported, and each client had to "know" the location of the required data. The distribution of data among multiple servers was very primitive; systems did not support fragmentation or replication of data. Systems of the time were \homogeneous" in that each system could manage only data that were stored in its own database, with no linkage to other repositories. Things have changed dramatically since then. Many vendors are much closer to achieving true distribution in their development cycle. Client/server systems remain the preferred solution in many cases, but they are much more sophisticated. For example, today's client/server systems provide signicant transparency in accessing data from multiple servers, support distributed transactions to facilitate transparency, and execute queries over (horizontally) fragmented data. Further, new systems implement both synchronous and asynchronous replication protocols, and many vendors have introduced gateways to access other databases. In addition, significant achievements have taken place in the development and deployment of parallel database servers. Object database managers have entered the marketplace and have found a niche market in some classes of applications which are inherently distributed. In parallel with these developments in the database system front, there have been phenomenal changes in the computer networking infrastructure that supports these systems. The relatively slow (10Mbit/sec) Ethernet has been replaced as the de facto local area network standard by much faster networks (FDDI or switched Ethernet) operating at around 100Mbit/sec, and broadband networks (particularly the ATM technology) have been deployed for both local area and wide area networking. These networks, coupled with very low overhead networking protocols, such as SCI, reduce the differences between local area and wide area networks (other than latency considerations) and potentially eliminate the network as the major performance bottleneck. This, in turn, requires us to review our system development assumptions and performance tuning criteria. Use of the Internet which is basically a heterogeneous network with links of varying capacities and capabilities has exploded. There is clearly a technology push/application pull in effect with respect to distributed DBMS development: new applications are requiring changes in DBMS capabilities, and new technological developments are making these changes possible. With these developments, it was time to prepare a revised second edition of the book. In the process, we have retained the fundamental characteristics and key features of the book as outlined in the Preface to the first edition. However, the material has been heavily edited. Every chapter has been revised some in fundamental ways, others more superficially. The major changes are the following: 1. The query processing/optimization chapters (Chapters 7{9) have been revised to focus on the techniques employed in commercial systems. New algorithms, such as randomized search strategies, are now included. 2. The transaction management chapters (Chapters 10{12) now include material on advanced transaction models and work flows. 3. Chapter 13, which focused on the relationship of distributed DBMSs and distributed operating systems, has been dropped and some of the material is incorporated into the relevant chapters. 4. The first edition contained a chapter (Chapter 15) which discussed current issues at the time parallel DBMSs, distributed knowledge-base systems (mainly deductive DBMSs), and distributed object DBMSs. In the intervening years, two of these topics have matured and become major forces in their own rights, while the third (deductive databases) has not achieved the same prominence. In this edition, we devote full chapters to parallel DBMSs (Chapter 13) and distributed object DBMSs (Chapter 14), and have dropped deductive DBMSs. 5. Following the same approach, we introduce a new chapter devoted to current issues (Chapter 16). This chapter now includes sections on data warehousing (from a distributed data management perspective), World Wide Web and databases, push-based technologies, and mobile DBMSs. 6. The chapter on multidatabase systems (Chapter 15 in the current edition) has been revised to include a discussion of general interoperability issues and distributed object platforms such as OMA/CORBA and DCOM/OLE. We are quite satisfied with the result, which represents a compromise between our desire to address new and emerging issues, and maintain the main characteristic of the book in addressing the principles of distributed data management. Certain chapters, in particular Chapters 15 and 16, require further depth, but those will be topics of future editions. The guide to reading the book, introduced in the Preface to the first edition, is still valid in general terms. However, we now discuss, in Chapter 3, the relationship between distributed DBMSs and the new networking technologies. Thus, this chapter no longer serves simply as background and should be read (at least the relevant sections) following Chapter 1. We have set up a Web site to communicate with our readers. The site is at cs.ualberta.ca/~database/distdb.html. This site contains presentation slides that accompany the book as well as other information regarding the book's use as a textbook. Many colleagues have helped with the revisions. Maggie Dunham and Nandid Soparkar provided detailed and early comments on the overall structure and content of the book. Maggie also provided input for the mobile database management section (Section 16.4). Ioannis Nikolaidis helped immensely with the revisions to Chapter 3he made us rewrite that chapter three times. Jari Veijaleinen provided many exercises which have been incorporated into this edition. Esther Pacitti provided input for replication protocols. Peter Trianta_llou provided material on this topic as well. Alexander Thomasian's input for performance evaluation work was invaluable, as was Elliot Moss's critical review of the nested transaction discussion in the transaction processing chapters. Mukesh Singhal advised us of the new advances in distributed deadlock management. Luc Bouganim contributed significantly to Chapter 13 on parallel DBMSs. Ken Barker and Kamalakar Karlapalem provided the material that formed the basis of distributed object database design in Chapter 14. Kaladhar Voruganti wrote the first draft of the architectural and system issues sections of Chapter 14. Randal Peters read Chapter 14 and forced us to revise many parts of it. The distributed garbage collection section of that chapter is based on a draft provided by Laurent Amsaleg and Michael Franklin. Amit Sheth provided input on the revised outline for Chapter 15. Asuman Dogac read the complete chapter and provided feedback. Mokrane Bouzeghoub and Eric Simon helped on the data warehouse section of Chapter 16. Dana Florescu, Alon Levy, Ioana Manolescu and Anthony Tomasic provided input for research prototypes in the section on Web and databases in Chapter 16. The material in push-based technologies section was reviewed (a number of times) by Stan Zdonik and Mike Franklin. Both of them also provided significant feedback about the characterization of data delivery alternatives. We are indebted to all of them, as well as to those who helped with the original edition of the book and whom we cite in the Preface to the First Edition. Many other colleagues have asked questions and provided suggestions over the years; unfortunately, we have not kept their names. Our thanks to everyone who has provided input. We look forward to receiving more suggestions on the second edition. We have had very good luck with our editors at Prentice Hall. Our current editor, Alan Apt, and our development editor, Sondra Chavez, have been tremendously helpful in both pushing us forward and providing the necessary institutional support. Our production editors, Ed DeFelippis and Irwin Zucker, have managed the production process so that the production of earlier chapters could proceed in parallel with our writing of the later chapters. This allowed the revised edition to be ready within one year. Stephen Lee, as our copy editor, made the entire text significantly more readable. Anne Nield helped us in many ways editing chapters, correcting the text and keeping us organized. Paul Iglinski wrote a number of scripts that helped immensely with cleaning up the b
Comments: 0
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Design/Web/Support/Anti-Leech by izograv @ yandex.ru
Optimized for Firefox | Anti-Leech tested on IE, Firefox, Reget