No Date No BB ‍‍ 最近很多小伙伴都在问我相同的问题: “我们最近做了一个活动,但是我不知道该怎么写总结,怎么做复盘。” 在人人都在谈大数据的时代,每个人都会说:“ 一定要用数据说话,做到精细化运营! ” 于是,很多数据分析师都陷入了这样一个困境—— 每天都为了各种活动数据复盘忙得不可开交,结果做了两三年,水平还是很平庸,并没有做出什么有效分析来指导下一次活动。然后他们也很苦恼,就会问这样一个问题: 感觉整天都在打杂,数据分析师有前途吗? 其实,之所以出现这样的情况,是因为他们少做了一个重要动作,那就是在活动结束后,问一句 活动复盘的目的是什么 ? 活动复盘在广义上的价值之一便是帮助策划部门策划一场场激动人心的活动,从中获得关注、人气、订单及收入。那么如何从数据角度对一场电商活动进行 事前预估、事中监控 和 事后复盘 ,便成为对数据工作者来说必然会面对的问题,包括且不限于: 1、 如何 根据大促目标进行细化数据指标的分析? 2、 如何找到 不同活动的重要分析指标 ? 3、 如何分析 出有效的经验以便下次活动借鉴? 活动复盘到底该如何做呢? 这道题是一道典型的数据分析业务题,它考察的不是你具体要分析哪些指标,而是你面对问题时的一个 分析框架 与 思考逻辑 ,这是一位优秀数据分析师最基本的能力。 对此,我们请教了数据分析资深从业者,结合他们的工作经验,分享出来~~ 01


数据挖掘 (Data Mining),也叫数据开采,数据采掘等,是按照既定的业务目标从海量数据中提取出潜在、有效并能被人理解的模式的高级处理过程。在较浅的层次上,它利用现有数据库管理系统的查询、检索及报表功能,与多维分析、统计分析方法相结合,进行联机分析处理(O乙心),从而得出可供决策参考的统计分析数据;在深层次上,则从数据库中发现前所未有的、隐含的信息。 随着数据量的爆炸式增长,我们需要借助一些有效的工具进行数据挖掘工作,从而帮助我们更轻松地从巨大的数据集中找出关系、集群、模式、分类信息等。下面小麦整理了市面上五款好用的数据挖掘工具,以供大家参考选择! 1.Rapid Miner Rapid Miner,原名YALE又一个学习环境,是一个用于机器学习和数据挖掘实验的环境,用于研究和实际的数据挖掘任务。毫无疑问,这是世界领先的数据挖掘开源系统。该工具以Java编程语言编写,通过基于模板的框架提供高级分析。 它使得实验可以由大量的可任意嵌套的操作符组成,这些操作符在XML文件中是详细的,并且是由快速的Miner的图形用户界面完成的。最好的是用户不需要编写代码。它已经有许多模板和其他工具,让我们可以轻松地分析数据。 2. KNIME Konstanz信息采集器是一个用户友好、可理解、全面的开源数据集成、处理、分析和探索平台。它有一个图形用户界面,帮助用户方便地连接节点进行数据处理。


众所周知,大数据是指无法在一定时间范围内用常规软件工具进行捕捉、管理和处理的数据集合。它的含义十分广泛,并庞大复杂,需要有专门设计的硬件和软件工具来进行数据处理和分析。下面给大家推荐几款常见好用的 数据分析工具 ,以供参考选择。 Smartbi Smartb i是国内领先的BI厂商,产品定位于一站式大数据服务平台,对接各种业务数据库、数据仓库和大数据平台,进行加工处理、分析挖掘与可视化展现;满足各种数据分析应用需求,如大数据分析、自助探索分析、地图可视化、移动管理驾驶舱、指挥大屏幕、企业报表平台、数据挖掘等。Smartbi产品功能设计全面,企业单位只需要安装部署一次,就可以实现中国式复杂报表、自助BI、以及数据挖掘产品的使用,产品性能、易用性和安全性都不错,广泛应用于金融、政府、电信、企事业单位等领域。 Storm Storm是自由的开源软件,一个分布式的、容错的实时计算系统。Storm可以非常可靠的处理庞大的数据流,用于处理Hadoop的批量数据。Storm很简单,支持许多种编程语言,使用起来非常有趣。 Apache Drill 为了帮助企业用户寻找更为有效、加快Hadoop数据查询的方法,Apache软件基金会近日发起了一项名为“Drill”的开源项目。Apache Drill 实现了 Google's Dremel. 据Hadoop厂商MapR

Read Excel Cells and Copy content to txt file

I am working with RapidMiner at the moment and am trying to copy my RapidMiner results which are in xlsx files to txt files in order to do some further processing with python. I do have plain text in column A (A1-A1500) as well as the according filename in column C (C1-C1500). Now my question: Is there any possibility (I am thinking of the xlrd module) to read the content of every cell in column A and print this to a new created txt file with the filename being given in corresponding column C

Read Excel Cells and Copy content to txt file

I am working with RapidMiner at the moment and am trying to copy my RapidMiner results which are in xlsx files to txt files in order to do some further processing with python. I do have plain text in column A (A1-A1500) as well as the according filename in column C (C1-C1500). Now my question: Is there any possibility (I am thinking of the xlrd module) to read the content of every cell in column A and print this to a new created txt file with the filename being given in corresponding column C

Can I export RapidMiner model to integrate with python?

I have trained a classifier model using RapidMiner after a trying a lot of algorithms and evaluate it on my dataset. I also export the model from RapidMiner as XML and pkl file, but I can't read it in my python program (scikit-learn). Is there any way to import RapidMiner classifier/model in a python program and use it to predict or classify new data in my end application?

Practically, I would say no - just train your model in sklearn from the beginning if that's where you want it.

how to write output from rapidminer to a txt file?

i am using rapidminer 5.3.I took a small document which contains around three english sentences , tokenized it and filtered it with respect to the length of words.i want to write the output into a different word document.i tried using Write document utility but it is not working,it is simply writing the same original document into the new one.However when i write the output to the console,it gives me the expected answer.Something wrong with the write document utility.

How can I set the rows to be the attributes and columns the samples in rapidminer?

I have an excel file with data where the columns are the samples and the rows the attributes. I can't transpose the data, because the rows exceed the maximum number of columns. When I load this data into rapidminer, it automatically sets the columns as attributes and the rows as samples. How can I set the columns as samples and the rows as attributes?

Assuming that you mean that you can't transpose the data in Excel because of the row limitation, the shortest answer is to start your

Is there a way to import a RapidMiner MLP-ANN in OpenCV?

I trained and validated a MLP Model in RapidMiner Studio. My Input Values are already normalized to [-1, 1]. As far as I understood, the MLP is already defined by its weights. Now I'm trying to import this in OpenCV, as I don't want to retrain the whole model. I got all weights per Node + Bias from RapidMiner. OpenCV offers the function CvANN_MLP::load(), where I am able to load a XML or YML file.