data refers to raw, unprocessed facts, statistics, or information collected for reference, analysis, and processing. they are of different formats: structured data: organized in a...
数据是未处理的事实和信息,分为结构化、非结构化和半结构化。大数据具有体量大、生成快、类型多等特点,传统数据库难以处理。数据仓库用于集中存储数据,支持长期分析。hadoop是一个开源框架,利用mapreduce进行大数据的并行处理。