数据文件数据加载到hive表-hxl-ChinaUnix博客

东南西北风andyhuang.blog.chinaunix.net

首页　| 　博文目录　| 　关于我

hxl

博客访问： 6774656
博文数量： 1005
博客积分： 8199
博客等级：中将
技术积分： 13071
用户组：普通用户
注册时间： 2010-05-25 20:19

个人简介

脚踏实地、勇往直前！

文章分类

全部博文（1005）

Oracle（273）

12c cdb/pdb（2）

GoldenGate（15）

SQL&PL/SQL（8）

Data Guard（24）

RAC&Failsafe（52）

Server Utilities（17）

Security（1）

Performance Tuni（27）

Server Admin（80）

Backup&Recovery（38）
Oracle Asm（14）

Backup&Recovery（2）

Server Admin（12）
AIX（24）
Linux（101）
MySql（214）

配置管理（41）

备份和恢复（22）

优化（5）

SQL（17）

安全（0）

MariaDB（1）

工具（4）

中间件（1）

中间件（15）

排错（27）

高可用（12）

集群（11）
elasticsearch（2）
cratedb（0）
gnuplot（4）
ssdb（3）
时序数据库（2）
cratedb+es（27）
存储（1）
golang（7）
自动化运维（15）

nagios（0）

cacti（0）

zabbix（1）

garfana（1）

Python（12）
TiDB（4）
架构设计（1）
Redis（22）
db2（2）
Hadoop（63）

hadoop（16）

hbase（22）

hive（17）

zookeeper（1）

kafka（1）

Storm（1）

Pig（2）

spark（1）
Java（57）

Spring（4）

Hibernate（1）

JDBC（10）

Servlet（3）

网络（0）

多线程（1）
kettle（12）
PostgreSQL（21）

高可用（1）
PHP（7）
MongoDB（34）
Sql Server（12）
HP-UNIX（1）
Windows（2）
Solaris（4）
Storage（6）
Perl（3）
Shell（12）
C/C++（2）
work（17）
未分配的博文（36）

文章存档

2020年（2）

2019年（93）

2018年（208）

2017年（81）

2016年（49）

2015年（50）

2014年（170）

2013年（52）

2012年（177）

2011年（93）

2010年（30）

我的朋友

相关博文

数据文件数据加载到hive表

分类： HADOOP

2014-10-28 17:36:13

本地文件加载到hive表
1.在hxl数据库下创建表
hive> create table tb_emp_info
    > (id int,
    > name string,
    > age int,
    > tel string)
    > ROW FORMAT DELIMITED
    > FIELDS TERMINATED BY '|'
    > STORED AS TEXTFILE;
OK
Time taken: 0.296 seconds
hive> show tables in hxl;
OK
tb_emp_info
Time taken: 0.073 seconds

2.准备加载数据
[hadoop1@node1 hive]$ more tb_emp_info.txt
1|name1|25|13188888888888
2|name2|30|13888888888888
3|name3|3|147896221
4|name4|56|899314121
5|name5|12|899314121
6|name6|9|899314121
7|name7|32|899314121
8|name8|42|158964
9|name9|86|899314121
10|name10|45|789541

3.本地系统加载文件数据
进入到tb_emp_info.txt文件所在的目录,然后执行hive进入到hive模式
[hadoop1@node1 hive]$ hive
hive> use hxl;
OK
Time taken: 0.103 seconds
hive> load data local inpath 'tb_emp_info.txt' into table tb_emp_info;
Copying data from file:/home/hadoop1/file/hive/tb_emp_info.txt
Copying file: file:/home/hadoop1/file/hive/tb_emp_info.txt
Loading data to table hxl.tb_emp_info
OK
Time taken: 0.694 seconds

若是分区表的话，需要指点导入的分区，如：

hive> load data local inpath 'login.txt' into table tb_sso_ver_login_day partition(statedate=20141201);

4.查看加载进去的数据
hive> select * from tb_emp_info;
OK
1       name1   25      13188888888888
2       name2   30      13888888888888
3       name3   3       147896221
4       name4   56      899314121
5       name5   12      899314121
6       name6   9       899314121
7       name7   32      899314121
8       name8   42      158964
9       name9   86      899314121
10      name10 45      789541

5.可以进入到hdfs目录下查看该表对应的文件
hive> dfs -ls /user/hive/warehouse/hxl.db/tb_emp_info;
Found 1 items
-rw-r--r--   3 hadoop1 supergroup        214 2014-10-28 17:31 /user/hive/warehouse/hxl.db/tb_emp_info/tb_emp_info.txt

HDFS文件导入到Hive表

1.查看hdfs系统上的文件
$hadoop fs -cat /user/hadoop1/myfile/tb_class.txt
输出部分
0|班级0|2014-10-29 14:10:17|2014-10-29 14:10:17
1|班级1|2014-10-29 14:10:17|2014-10-29 14:10:17
2|班级2|2014-10-29 14:10:17|2014-10-29 14:10:17
3|班级3|2014-10-29 14:10:17|2014-10-29 14:10:17
4|班级4|2014-10-29 14:10:17|2014-10-29 14:10:17
5|班级5|2014-10-29 14:10:17|2014-10-29 14:10:17
6|班级6|2014-10-29 14:10:17|2014-10-29 14:10:17
7|班级7|2014-10-29 14:10:17|2014-10-29 14:10:17
8|班级8|2014-10-29 14:10:17|2014-10-29 14:10:17

2.创建表
create table tb_class_info
(id int,
class_name string,
createtime timestamp ,
modifytime timestamp)
ROW FORMAT DELIMITED
FIELDS TERMINATED BY '|'
STORED AS TEXTFILE;

3.导入表
load data inpath '/user/hadoop1/myfile/tb_class.txt' into table tb_class_info;

-- The End --

阅读(33948) | 评论(0) | 转发(0) |

上一篇：hbase shell下如何使用删除键

下一篇：hive表数据导出

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6