Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Performance testing in lexical analysis on latest Twitter trends for enterprise network using PIG

Performance testing in lexical analysis on latest Twitter trends for enterprise network using PIG Twitter usage is outgrowing each day and so is the ever changing trend. The data flow keeps on increasing each day through tweets and thus analysis on the latest trends are done by using most widely used framework for processing large datasets-Apache Hadoop. This paper presents an overview of Twitter Trend analysis using Apache PIG in Hadoop framework and its performance will be tested using lexical analysis. Hadoop is preferred over other frameworks due to its robustness, scalability, cost effectiveness and high speed which makes parallel processing over distributed network easy. The paper also shows results obtained through lexical analysis done on the Hadoop cluster to show the efficiency of the program. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png International Journal of Enterprise Network Management Inderscience Publishers

Performance testing in lexical analysis on latest Twitter trends for enterprise network using PIG

Loading next page...
 
/lp/inderscience-publishers/performance-testing-in-lexical-analysis-on-latest-twitter-trends-for-Xjwkz6wwmq
Publisher
Inderscience Publishers
Copyright
Copyright © Inderscience Enterprises Ltd
ISSN
1748-1252
eISSN
1748-1260
DOI
10.1504/ijenm.2022.122405
Publisher site
See Article on Publisher Site

Abstract

Twitter usage is outgrowing each day and so is the ever changing trend. The data flow keeps on increasing each day through tweets and thus analysis on the latest trends are done by using most widely used framework for processing large datasets-Apache Hadoop. This paper presents an overview of Twitter Trend analysis using Apache PIG in Hadoop framework and its performance will be tested using lexical analysis. Hadoop is preferred over other frameworks due to its robustness, scalability, cost effectiveness and high speed which makes parallel processing over distributed network easy. The paper also shows results obtained through lexical analysis done on the Hadoop cluster to show the efficiency of the program.

Journal

International Journal of Enterprise Network ManagementInderscience Publishers

Published: Jan 1, 2022

There are no references for this article.