从Mysql导入数据到Hadoop-qkshan-ChinaUnix博客

工程技术学习（此博客停用）duanli.blog.chinaunix.net

首页　| 　博文目录　| 　关于我

qkshan

博客访问： 106073
博文数量： 19
博客积分： 840
博客等级：准尉
技术积分： 235
用户组：普通用户
注册时间： 2009-10-02 21:25

文章分类

全部博文（19）

电脑故障（0）
算法与数据结构（1）
计划赶不上变化（1）
Hadoop与云计算（2）
WEB（1）
GUI设计（1）
操作系统（6）
置顶（1）
PCB（0）
机器人（0）
ARM（1）
linux（0）
skyeye模拟开发（1）
minigui（0）
fedora/RHEL（3）
未分配的博文（1）

文章存档

2011年（1）

2010年（5）

2009年（13）

我的朋友

相关博文

从Mysql导入数据到Hadoop

分类：系统运维

2010-06-26 23:00:47

《待翻译》from: http://everythingmysql.ning.com/profiles/blogs/hadoop-for-mysql-people

There's a lot of buzz lately about Hadoop. If you're completely new to Hadoop, I recommend the free videos from Cloudera (). If you have a vague idea and want to play around, it's easy!

First, download Cloudera's training VM which has a small Hadoop cluster already installed and running:

http://www.cloudera.com/developers/downloads/virtual-machine/

Second, you need to put some data into Hadoop. Fortunately for database folks, there's a tool to import data into Hadoop from MySQL called "Sqoop". It's already installed on the VM and there are instructions for using Sqoop to import some MySQL tables into Hadoop (see Desktop/instructions/exercises/SqoopExercise.html inside the VM). FYI, it's not uncommon to "Sqoop" data into Hadoop, do analysis and transformations, and then use Sqoop to export the data back to MySQL.

Now you're ready to do analysis of your data using Hadoop's powerful MapReduce. Except that MapReduce requires coding (Java, Python, PHP, etc) and an understanding of the functional programming model that is MapReduce. For an easier entry into Hadoop, try Hive. Hive is a data warehousing system for Hadoop. It offers a language (HiveQL) that feels just like SQL. Examples:

$ hive

hive> SHOW TABLES;

hive> SELECT * FROM LIMIT 10;

Hive supports most of the SQL queries you are used to. For example JOIN, LEFT OUTER JOIN, RIGHT OUTER JOIN, GROUP BY, ORDER BY, aggregate functions, etc. The best part is that Hive can scale to analyze petabytes of data!

阅读(2244) | 评论(0) | 转发(0) |

上一篇：HDFS的JAVA接口API操作实例

下一篇：利用JTBC(1.0)PHP搭建实验室网站记录

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6