Online Analytical Processing on Hadoop using Apache Kylin

Supriya Vitthal Ranawade; Shivani Navale; Akshay Dhamal; Kuldeep Deshpande; Chandrashekhar Ghuge

Call for Paper

May Edition

IJAIS solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 28 April 2026

Submit your paper

Know more

The week's pick

Optimized Decision Tree Classifier for Data Aggregation in Wireless Sensor Networks Using IoT Sensor Data

Jagan Kurma Raghuvaran Kendyala Varun Bitkuri Avinash Attipalli Jaya Vardhani Mamidala Sunil Jacob Enokkaren

Random Articles

Hybrid Approach for Query Expansion using Query Log

July

2014

Survey of Intrusion Detection and Prevention System in MANETs based on Data Gathering Techniques

February

2012

Evaluating the Effectiveness of Web Search Metrics

December

2012

Kohonen Self Organizing Map with Modified K-means clustering For High Dimensional Data Set

May

2012

Reseach Article

Online Analytical Processing on Hadoop using Apache Kylin

by Supriya Vitthal Ranawade, Shivani Navale, Akshay Dhamal, Kuldeep Deshpande, Chandrashekhar Ghuge

International Journal of Applied Information Systems

Foundation of Computer Science (FCS), NY, USA

Volume 12 - Number 2

Year of Publication: 2017

Authors: Supriya Vitthal Ranawade, Shivani Navale, Akshay Dhamal, Kuldeep Deshpande, Chandrashekhar Ghuge

10.5120/ijais2017451682

Supriya Vitthal Ranawade, Shivani Navale, Akshay Dhamal, Kuldeep Deshpande, Chandrashekhar Ghuge . Online Analytical Processing on Hadoop using Apache Kylin. International Journal of Applied Information Systems. 12, 2 ( May 2017), 1-5. DOI=10.5120/ijais2017451682

@article{ 10.5120/ijais2017451682,

author = { Supriya Vitthal Ranawade, Shivani Navale, Akshay Dhamal, Kuldeep Deshpande, Chandrashekhar Ghuge },

title = { Online Analytical Processing on Hadoop using Apache Kylin },

journal = { International Journal of Applied Information Systems },

issue_date = { May 2017 },

volume = { 12 },

number = { 2 },

month = { May },

year = { 2017 },

issn = { 2249-0868 },

pages = { 1-5 },

numpages = {9},

url = { https://www.ijais.org/archives/volume12/number2/983-2017451682/ },

doi = { 10.5120/ijais2017451682 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2023-07-05T19:07:55.092166+05:30

%A Supriya Vitthal Ranawade

%A Shivani Navale

%A Akshay Dhamal

%A Kuldeep Deshpande

%A Chandrashekhar Ghuge

%T Online Analytical Processing on Hadoop using Apache Kylin

%J International Journal of Applied Information Systems

%@ 2249-0868

%V 12

%N 2

%P 1-5

%D 2017

%I Foundation of Computer Science (FCS), NY, USA

Abstract

In the Big Data age, it is necessary to remodel the traditional data warehousing and Online Analytical Processing (OLAP) system. Many challenges are posed to the traditional platforms due to the ever increasing data. In this paper, we have proposed a system which overcomes the challenges and is also beneficial to the business organizations. The proposed system mainly focuses on OLAP on Hadoop platform using Apache Kylin. Kylin is an OLAP engine which builds OLAP cubes from the data present in hive and stores the cubes in HBase for further analysis. The cubes stored in HBase are analyzed by firing SQL-like analytics queries. Also, reports and dashboards are further generated on the underlying cubes that provides powerful insights of the company data. This helps the business users to take decisions that are profitable to the organization. The proposed system is a boon to small as well as large scale business organizations. The aim of the paper is to present a system which builds OLAP cubes on Hadoop and generate insightful reports for business users.

References

Kuldeep Deshpande, and Dr. Bhimappa Desai “Limitations of datawarehouse platforms and Assessment of Hadoop as an alternative”, IJITMIS, Volume 5, Issue 2, pp. 51-58, 2014.
Shruti Tekadpande, Leena Deshpande “Analysis and Design of ETL process using Hadoop”, IJEIT, Volume 4, Issue 12, pp. 171-174 2015.
T.K.Das and Arati Mohapatro, “A Study on Big Data Integration with Data Warehouse”, International Journal of Computer Trends and Technology (IJCTT) – volume 9, number 4, Mar 2014.
Merv Adrian and Colin White, “Analytic Platforms: Beyond the Traditional Data Warehouse”, BeyeNETWORK Custom Research Report, 2010.
Clark Bradley, Ralph Hollingshead, Scott Kraus, Jason Lefler, Roshan Taheri, “Data Modeling Considerations in Hadoop and Hive”, Technical paper, SAS, 2013.
Bo Wang, Hao Gui, Mark Roantree, Martin F. O’Connor, “Data Cube Computational Model with Hadoop MapReduce”, WEBIST, 2014.
Marissa Rae Hollingsworth, Hadoop and Hive as Scalable Alternatives to RDBMS: A Case Study, Boise State University, 2012.
Jie Song, Chaopeng Guo, Zhi Wang, Yichan Zhang, Ge Yu, et al.. “HaoLap: a Hadoop based OLAP system for big data”, Journal of Systems and Software, Elsevier, 2015, vol. 102, pp.167-181.
Cohen, J., Dolan, B., Dunlap, M., Hellerstein, J.M., and Welton, C. MAD Skills: New Analysis Practices for Big Data. PVLDB 2(2), 2009.
Yongqiang He et all “RCFile: A Fast and Space-efficient Data Placement Structure in MapReduce-based Warehouse Systems”, ICDE 2011.
Philip Russom, Next generation Datawarehouse platforms, The Datawarehousing Institute, 2009
Ashish Thusoo, Joydeep Sen Sarma, Amit Jain, Zheng Shao, Prasad Chakka, Ning Zhang, Suresh Antony, Hao Liu and Raghotham Murthy, Hive – A Petabyte Scale Data Warehouse Using Hadoop, IEEE, 2010.
Alfredo Cuzzocrea, Il-Yeol Song, Ladjel Bellatreche, “Data Warehousing and OLAP over Big Data: Current challenges and future research directions”, DOLAP '13 Proceedings of the sixteenth international workshop on Data warehousing and OLAP, Pages 67-70.
Gray, J., Chaudhuri, S., Bosworth, A., Layman, A., Reichart, D., Venkatrao, M., Pellow, F., and Pirahesh, H. Data Cube: A Relational Aggregation Operator Generalizing Group-by, Cross-Tab, and Sub Totals. Data Mining and Knowledge Discovery 1(1), 1997.
Tian, X., 2008. Large-scale SMS messages mining based on map-reduce, IEEE International Symposium on Computational Intelligence and Design. Piscataway, NJ, USA, October 17–18, 2008, pp. 7–12.
http://kylin.apache.org/

Index Terms

Computer Science

Information Sciences

Keywords

Analytics OLAP Hadoop Kylin Hive Datawarehouse