• Home
  • Florida Atlantic University

 

COP 6731 Theory and Implementation of Database Systems

Data Models and DBMS Architecture

Stonebraker, Michael, and Joey Hellerstein. "What goes around comes around." Readings in Database Systems 4 (2005).

Joseph M. Hellerstein and Michael Stonebraker. "Anatomy of a Database System."

Query Processing

Selinger, P. Griffiths, et al. "Access path selection in a relational database management system." Proceedings of the 1979 ACM SIGMOD international conference on Management of data. ACM, 1979.

Shapiro, Leonard D. "Join processing in database systems with large main memories." ACM Transactions on Database Systems (TODS) 11.3 (1986): 239-264.

DeWitt, David, and Jim Gray. "Parallel database systems: the future of high performance database systems." Communications of the ACM 35.6 (1992): 85-98.

Graefe, Goetz. Encapsulation of parallelism in the Volcano query processing system. Vol. 19. No. 2. ACM, 1990.

Nyberg, Chris, et al. "AlphaSort: A RISC machine sort." ACM SIGMOD Record. Vol. 23. No. 2. ACM, 1994.

Mackert, Lothar F. "R* optimizer validation and performance evaluation for distributed queries." Readings in database systems (1988): 219-229.

Stonebraker, Michael, et al. "Mariposa: a wide-area distributed database system." The VLDB Journal 5.1 (1996): 48-63.

Papadias, Dimitris, et al. "Query processing in spatial network databases." Proceedings of the 29th international conference on Very large data bases-Volume 29. VLDB Endowment, 2003.

Bruno, Nicolas, Surajit Chaudhuri, and Luis Gravano. "Top-k selection queries over relational databases: Mapping strategies and performance evaluation." ACM Transactions on Database Systems (TODS) 27.2 (2002): 153-187.

Vitter, Jeffrey Scott. "External memory algorithms and data structures: Dealing with massive data." ACM Computing surveys (CsUR) 33.2 (2001): 209-271.

Chaudhuri, Surajit. "An overview of query optimization in relational systems." Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems. ACM, 1998.

Graefe, Goetz. "Query evaluation techniques for large databases." ACM Computing Surveys (CSUR) 25.2 (1993): 73-169.

Kossmann, Donald. "The state of the art in distributed query processing." ACM Computing Surveys (CSUR) 32.4 (2000): 422-469.

O'Neil, Patrick, and Goetz Graefe. "Multi-table joins through bitmapped join indices." ACM SIGMOD Record 24.3 (1995): 8-11.

Selinger, P. Griffiths, et al. "Access path selection in a relational database management system." Proceedings of the 1979 ACM SIGMOD international conference on Management of data. ACM, 1979.

Data Storage and Access Methods

Beckmann, Norbert, et al. The R*-tree: an efficient and robust access method for points and rectangles. Vol. 19. No. 2. ACM, 1990.

Stonebraker, Michael. "Operating system support for database management." Communications of the ACM 24.7 (1981): 412-418.

Gray, Jim, and Goetz Graefe. "The five-minute rule ten years later, and other computer storage rules of thumb." ACM Sigmod Record 26.4 (1997): 63-68.

Patterson, David A., Garth Gibson, and Randy H. Katz. A case for redundant arrays of inexpensive disks (RAID). Vol. 17. No. 3. ACM, 1988.

Shekhar, Shashi, and Duen-Ren Liu. "CCAM: A connectivity-clustered access method for networks and network computations." Knowledge and Data Engineering, IEEE Transactions on 9.1 (1997): 102-119.

Stonebraker, Mike, et al. "C-store: a column-oriented DBMS." Proceedings of the 31st international conference on Very large data bases. VLDB Endowment, 2005

Pavlo, Andrew, et al. "A comparison of approaches to large-scale data analysis." Proceedings of the 2009 ACM SIGMOD International Conference on Management of data. ACM, 2009.

Zaharia, Matei, et al. "Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing." Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation. USENIX Association, 2012.

Abadi, Daniel J., Samuel R. Madden, and Nabil Hachem. "Column-stores vs. row-stores: How different are they really?." Proceedings of the 2008 ACM SIGMOD international conference on Management of data. ACM, 2008.

Afrati, Foto N., and Jeffrey D. Ullman. "Optimizing multiway joins in a map-reduce environment." IEEE Transactions on Knowledge and Data Engineering 23.9 (2011): 1282-1298.

Fagin, Ronald, et al. "Extendible hashing—a fast access method for dynamic files." ACM Transactions on Database Systems (TODS) 4.3 (1979): 315-344.

Litwin, Witold. "Linear hashing: a new tool for file and table addressing." VLDB. Vol. 80. 1980.

Nievergelt, Jürg, Hans Hinterberger, and Kenneth C. Sevcik. "The grid file: An adaptable, symmetric multikey file structure." ACM Transactions on Database Systems (TODS) 9.1 (1984): 38-71.

Beckmann, Norbert, et al. "The R*-tree: an efficient and robust access method for points and rectangles." ACM SIGMOD Record. Vol. 19. No. 2. Acm, 1990.

Robinson, John T. "The KDB-tree: a search structure for large multidimensional dynamic indexes." Proceedings of the 1981 ACM SIGMOD international conference on Management of data. ACM, 1981.

Gaede, Volker, and Oliver Günther. "Multidimensional access methods." ACM Computing Surveys (CSUR) 30.2 (1998): 170-231.

Dean, Jeffrey, and Sanjay Ghemawat. "MapReduce: simplified data processing on large clusters." Communications of the ACM 51.1 (2008): 107-113.

Transaction Management

Gray, Jim N., et al. "Granularity of locks and degrees of consistency in a shared data base." IFIP Working Conference on Modelling in Data Base Management Systems. 1976.

Kung, Hsiang-Tsung, and John T. Robinson. "On optimistic methods for concurrency control." ACM Transactions on Database Systems (TODS) 6.2 (1981): 213-226.

Agrawal, Rakesh, Michael J. Carey, and Miron Livny. "Concurrency control performance modeling: alternatives and implications." ACM Transactions on Database Systems (TODS) 12.4 (1987): 609-654.

Lehman, Philip L. "Efficient locking for concurrent operations on B-trees." ACM Transactions on Database Systems (TODS) 6.4 (1981): 650-670.

Mohan, C., et al. "ARIES: a transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging." ACM Transactions on Database Systems (TODS) 17.1 (1992): 94-162.

Mohan, C., Bruce Lindsay, and Ron Obermarck. "Transaction management in the R* distributed database management system." ACM Transactions on Database Systems (TODS) 11.4 (1986): 378-396.

Gray, Jim, et al. "The dangers of replication and a solution." ACM SIGMOD Record 25.2 (1996): 173-182.

Thomasian, Alexander. "Concurrency control: methods, performance, and analysis." ACM Computing Surveys (CSUR) 30.1 (1998): 70-119.

Barghouti, Naser S., and Gail E. Kaiser. "Concurrency control in advanced database applications." ACM Computing Surveys (CSUR) 23.3 (1991): 269-317.

Gray, Jim. "The transaction concept: Virtues and limitations." VLDB. Vol. 81. 1981.

Extensibility

Stonebraker, Michael. "Inclusion of new types in relational data base systems." (1986): 262-269.

Hellerstein, Joseph M., Jeffrey F. Naughton, and Avi Pfeffer. Generalized search trees for database systems. September, 1995.

Lohman, Guy M. Grammar-like functional rules for representing query optimization alternatives. Vol. 17. No. 3. ACM, 1988.

Database Evolution

Chaudhuri, Surajit, and Vivek Narasayya. "AutoAdmin “what-if” index analysis utility." ACM SIGMOD Record 27.2 (1998): 367-378.

Bernstein, Philip A. "Applying Model Management to Classical Meta Data Problems." CIDR. Vol. 2003. 2003.

Mohan, C., and Inderpal Narang. "Algorithms for creating indexes for very large tables without quiescing updates." ACM SIGMOD Record. Vol. 21. No. 2. ACM, 1992.

Data Warehousing

Chaudhuri, Surajit, and Umeshwar Dayal. "An overview of data warehousing and OLAP technology." ACM Sigmod record 26.1 (1997): 65-74.

O'Neil, Patrick, and Dallan Quass. "Improved query performance with variant indexes." ACM Sigmod Record. Vol. 26. No. 2. ACM, 1997.

Gray, Jim, et al. "Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals." Data Mining and Knowledge Discovery 1.1 (1997): 29-53.

Zhao, Yihong, Prasad M. Deshpande, and Jeffrey F. Naughton. "An array-based algorithm for simultaneous multidimensional aggregates." ACM SIGMOD Record. Vol. 26. No. 2. ACM, 1997.

Ceri, Stefano, and Jennifer Widom. "Deriving production rules for incremental view maintenance." (1991).

Hellerstein, Joseph M., Ron Avnur, and Vijayshankar Raman. "Informix under control: Online query processing." Data Mining and Knowledge Discovery 4.4 (2000): 281-314.

Kotidis, Yannis, and Nick Roussopoulos. "DynaMat: a dynamic view management system for data warehouses." ACM SIGMOD Record. Vol. 28. No. 2. ACM, 1999.

Data Mining

Zhang, Tian, Raghu Ramakrishnan, and Miron Livny. "BIRCH: an efficient data clustering method for very large databases." ACM SIGMOD Record. Vol. 25. No. 2. ACM, 1996.

Shafer, John, Rakesh Agrawal, and Manish Mehta. "SPRINT: A scalable parallel classi er for data mining." Proc. 1996 Int. Conf. Very Large Data Bases. 1996.

Agrawal, Rakesh, and Ramakrishnan Srikant. "Fast algorithms for mining association rules." Proc. 20th int. conf. very large data bases, VLDB. Vol. 1215. 1994.

Park, Jong Soo, Ming-Syan Chen, and Philip S. Yu. An effective hash-based algorithm for mining association rules. Vol. 24. No. 2. ACM, 1995.

Kuramochi, Michihiro, and George Karypis. "An efficient algorithm for discovering frequent subgraphs." Knowledge and Data Engineering, IEEE Transactions on 16.9 (2004): 1038-1051.

Bastide, Y., Taouil, R., Pasquier, N., Stumme, G., and Lakhal, L., "Mining frequent patterns with counting inference", ACM SIGKDD Explorations Newsletter, 2 (2), pp. 66–75 (2000).

Bayardo Jr, R. J., "Efficiently mining long patterns from databases", Proceedings of the ACM SIGMOD International Conference on Management of Data, ACM, pp. 85–93 (1998).

Calders, T. and Goethals, B., "Non-derivable itemset mining", Data Mining and Knowledge Discovery, 14 (1), pp. 171–206 (2007).

Gouda, K. and Zaki, M. J., "Genmax: An efficient algorithm for mining maximal frequent itemsets", Data Mining and Knowledge Discovery, 11 (3), pp. 223–242 (2005).

Zaki, M. J. and Hsiao, C.-J., "Efficient algorithms for mining closed itemsets and their lattice structure", IEEE Transactions on Knowledge and Data Engineering, 17 (4), pp. 462–478 (2005).

Pei, J., Han, J., Mortazavi-Asl, B., Wang, J., Pinto, H., Chen, Q., Dayal, U., and Hsu, M.-C., "Mining sequential patterns by pattern-growth: The prefixspan approach", IEEE Transactions on Knowledge and Data Engineering, 16 (11), pp. 1424–1440 (2004).

Srikant, R. and Agrawal, R., “Mining sequential patterns: Generalizations and performance improvements”, Proceedings of the 5th International Conference on Extending Database Technology, Springer (1996).

Zaki, M. J., "SPADE: An efficient algorithm for mining frequent sequences", Machine learning, 42 (1-2), pp. 31–60 (2001).

Al Hasan, M. and Zaki, M. J., "Output space sampling for graph patterns", Proceedings of the VLDB Endowment, 2 (1), pp. 730–741 (2009).

Huan, J., Wang, W., and Prins, J., "Efficient mining of frequent subgraphs in the presence of isomorphism", Proceedings of the IEEE International Conference on Data Mining, IEEE, pp. 549–552 (2003).

Inokuchi, A., Washio, T., and Motoda, H. (2000), "An apriori-based algorithm for mining frequent substructures from graph data", in: Proceedings of the European Conference on Principles of Data Mining and Knowledge Discovery, Springer, pp. 13–23.

Kuramochi, M. and Karypis, G., "Frequent subgraph discovery", Proceedings of the IEEE International Conference on Data Mining, IEEE, pp. 313–320 (2001).

Yan, X. and Han, J., "gSpan: Graph-based substructure pattern mining", Proceedings of the IEEE International Conference on Data Mining, IEEE, pp. 721–724 (2002).

Zaki, M. J., "Efficiently mining frequent trees in a forest", Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp. 71–80 (2002).

Geng, L. and Hamilton, H. J., "Interestingness measures for data mining: A survey", ACM Computing Surveys, 38 (3), p. 9 (2006).

Gionis, A., Mannila, H., Mielikäinen, T., and Tsaparas, P., "Assessing data mining results via swap randomization", ACM Transactions on Knowledge Discovery from Data, 1 (3), p. 14 (2007).

Lallich, S., Teytaud, O., and Prudhomme, E., "Association rule interestingness: measure and statistical validation", in: Quality measures in data mining, Springer, pp. 251–275 (2007).

Megiddo, N. and Srikant, R., "Discovering predictive association rules", Proceedings of the 4th International Conference on Knowledge Discovery in Databases and Data Mining, pp. 274–278 (1998).

Tan, P.-N., Kumar, V., and Srivastava, J. (2002), "Selecting the right interestingness measure for association patterns", Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp. 32–41.

Webb, G. I., "Discovering significant rules", Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp. 434–443 (2006).

Cheng, Y., "Mean shift, mode seeking, and clustering", Pattern Analysis and Machine Intelligence, IEEE Transactions on, 17 (8), pp. 790–799 (1995).

Ester, M., Kriegel, H.-P., Sander, J., and Xu, X., "A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise.", Proceedings of the 2nd ACM SIGKDD, ed. by E. Simoudis, J. Han, and U. M. Fayyad, AAAI Press, pp. 226–231 (1996).

Hinneburg, A. and Gabriel, H.-H., "Denclue 2.0: Fast clustering based on kernel density estimation", Proceedings of the 7th International Symposium on Intelligent Data Analysis, Springer, pp. 70–80 (2007).

Ye, Wei, et al. "Incorporating user's preference into attributed graph clustering." IEEE Transactions on Knowledge and Data Engineering 33.12 (2020): 3716-3728.

Wang, Wei, Jiong Yang, and Richard Muntz. "STING: A statistical information grid approach to spatial data mining." Vldb. Vol. 97. 1997.

Hinneburg, Alexander, and Hans-Henning Gabriel. "Denclue 2.0: Fast clustering based on kernel density estimation." International symposium on intelligent data analysis. Berlin, Heidelberg: Springer Berlin Heidelberg, 2007.

Xu, Xiaowei, et al. "Scan: a structural clustering algorithm for networks." Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining. 2007.

Fang, Yixiang, et al. "Effective community search over large spatial graphs." Proceedings of the VLDB Endowment (PVLDB) (2017).

Wen, Dong, et al. "Efficient structural graph clustering: an index-based approach." Proceedings of the VLDB Endowment 11.3 (2017): 243-255.

Cui, Wanyun, et al. "Local search of communities in large graphs." Proceedings of the 2014 ACM SIGMOD international conference on Management of data. 2014.

Budak, Ceren, et al. "Geoscope: Online detection of geo-correlated information trends in social networks." Proceedings of the VLDB Endowment 7.4 (2013): 229-240.

Shi, Jieming, et al. "Density-based place clustering in geo-social networks." Proceedings of the 2014 ACM SIGMOD international conference on Management of data. 2014.

Whang, Joyce Jiyoung, David F. Gleich, and Inderjit S. Dhillon. "Overlapping community detection using neighborhood-inflated seed expansion." IEEE Transactions on Knowledge and Data Engineering 28.5 (2016): 1272-1284.

Murthy, Sreerama K. "Automatic construction of decision trees from data: A multi-disciplinary survey." Data mining and knowledge discovery 2 (1998): 345-389

Zheng, Zijian, Ron Kohavi, and Llew Mason. "Real world performance of association rule algorithms." Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining. 2001.

Web Services and Data Bases

Eric A. Brewer. "Combining Systems and Databases: A Search Engine Retrospective"

Sizov, Sergej, et al. "The BINGO! System for Information Portal Generation and Expert Web Search." CIDR. 2003.

Jacobs, Dean. "Data management in application servers." Datenbank-Spektrum 8 (2004): 5-11.

Abiteboul, Serge. Querying semi-structured data. Springer Berlin Heidelberg, 1997.

Goldman, Roy, and Jennifer Widom. "Dataguides: Enabling query formulation and optimization in semistructured databases." (1997).

Chen, Jianjun, et al. "NiagaraCQ: A scalable continuous query system for internet databases." ACM SIGMOD Record. Vol. 29. No. 2. ACM, 2000.

Stream-Based Data Management

Hanson, Eric N., et al. "Scalable trigger processing." Data Engineering, 1999. Proceedings., 15th International Conference on. IEEE, 1999.

Seshadri, Praveen, Miron Livny, and Raghu Ramakrishnan. "The Design and Implementation of a Sequence Database System."

Avnur, Ron, and Joseph M. Hellerstein. "Eddies: Continuously adaptive query processing." ACM SIGMoD Record. Vol. 29. No. 2. ACM, 2000.

Balakrishnan, Hari, et al. "Retrospective on aurora." The VLDB Journal 13.4 (2004): 370-383.

Babcock, Brian, et al. "Models and issues in data stream systems." Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. ACM, 2002.

Datar, Mayur, et al. "Maintaining stream statistics over sliding windows." SIAM journal on computing 31.6 (2002): 1794-1813.

Haveliwala, Taher H. "Topic-sensitive pagerank." Proceedings of the 11th international conference on World Wide Web. ACM, 2002.


Make everything as simple as possible, but not simpler.

Albert Einstein


FAU • DEPARTMENT OF COMPUTER & ELECTRICAL ENGINEERING AND COMPUTER SCIENCE

© 2015 Florida Atlantic University. An Equal Opportunity/Equal Access Institution.