_覓 | 覦覈襦 | 豕蠏手 | 殊螳 | 譯殊碁 |
FrontPage › HivePython
|
|
伎 襦蠏碁 讌.
import sys import string from pykospacing import spacing for line in sys.stdin: line = line.strip() pc_id, msg = line.split("\t") print ("\t".join([pc_id, spacing(msg)])) hdfs k.
hadoop fs -put -f /home/hdfs/py/chat_spacing.py /user/hive/udfs transform 襯 .
set hive.execution.engine=mr; add file hdfs:///user/hive/udfs/chat_spacing.py; select transform(msg) using 'python3.4 chat_spacing.py' as (pc_id bigint, msg string) from ( select concat(cast(id as string), "\t", sss) msg from sample ) t; [edit]
pyhive #from pyhive import hive conn = hive.Connection(host="192.168.0.100", port=10000, username="hdfs") cursor = conn.cursor() cursor.execute("SHOW TABLES") for table in cursor.fetchall(): print(table) export http_proxy='http://xxx.com' export https_proxy='https://xxx.com' pip3.4 install pip3.4 install tensorflow --trusted-host pypi.python.org --index-url=https://pypi.python.org/simple/
鏤
|
語 ろ企. ろ 襷讌襦 轟 譬 . (襾語) |