A Blog Post about Query Execution Engines
Posted: September 4, 2016 Filed under: Research | Tags: analytical database, database, execution engine, query processing, sql-on-hadoop Leave a commentRecently, I joined a team blog for sharing knowledge and experiences with nice guys. In the blog, I wrote a blog post about query execution engines at
A Survey of Query Execution Engines (from Volcano to Vectorized Processing). Enjoy!
Java GC 관련 링크 정리
Posted: February 29, 2016 Filed under: Code, Uncategorized | Tags: GC, jvm Leave a comment- Java Garbage Collection, Naver D2 Hello World (in Korean)
- Garbage Collection Optimization for High-Throughput and Low-Latency Java Applications, LinkedIn Engineering
- JVM GC Settings and HBase Performance
- How Garbage Collection differs in the three big JVMs
- Java Garbage Collection Distilled
- Avoiding Full GCs in HBase with MemStore-Local Allocation Buffers: (Part 1, Part 2)
Links about Array DBMSs
Posted: December 1, 2015 Filed under: Research, Tokamak Project | Tags: array dbms, scientific computing, tokamak 1 CommentThis article just lists resources available in Internet and papers about array DBMSs and scientific databases.
General
- Array DBMS in Wikipedia
- Rasdaman, an Array DBMS production
- Array Databases: The Next Big Thing in Data Analytics?, Datanami
Applications of Array Data Model
- Geo-spatial data
- scientific data
- financial feeds
- sensor data
- sequencing data
From Academia
General
- Overview of SciDB, ACM SIGMOD 2010 (PDF)
- A presentation material made by other guy
- Paper List of Brown Univ’s Data Management Research Group
- Paper List of UW’s SciDB Branch
- SciDB DBMS Research at M.I.T.
- The Architecture of SciDB
Query Language or Interface
- SciQL: A Query Language for Science Applications, Workshop Array Databases 2011 (PDF)
Query Processing
- Efficient Iterative Processing in the SciDB Parallel Array Engine, SSDBM 2015. (PDF)
- Squeezing a Big Orange into Little Boxes: The AscotDB System for Parallel Processing of Data on a Sphere, IEEE Data Engineering Bulletin, 2013.
- ArrayStore: A Storage Manager for Complex Parallel Array Processing, ACM SIGMOD 2011.
- Hybrid Merge/Overlap Execution Technique for Parallel Array Processing, Workshop Array Databases 2011 (PDF)
- An Array Library for MS SQL Server, Workshop Array Databases 2011 (PDF)
- SAGA: Array Storage as a DB with Support for Structural Aggregations, SSDBM 2014
- Hybrid Merge/Overlap Execution Technique for Parallel Array Processing, AD 2011
- Time Travel in a Scientific Array Database, ICDE 2013
Applications
- Accelerating Computationally Intensive Queries on Massive Earth Science Data, Workshop Array Databases 2011 (PDF)
- Sample uses of HDF 2006 (PDF)
- A Survey of Scientific Applications using SciDB
- Paradigm4 White Papers
Data Format
- An Overview of the HDF5 Technology Suite and its Applications, Workshop Array Databases 2011 (PDF)
SciDB From Paradigm4
- Why an Array Database?, Paradigm4
- MAC™—the key to fast range selects and joins
- Analytics for Massive Data Sets, Paradigm4 (slide)