Use for questions about the different software and hardware used to assist in, and solve, data science problems
Questions tagged [tools]
68 questions
9
votes
3 answers
Google prediction API: What training/prediction methods Google Prediction API employs?
The details of the Google Prediction API are on this page, but I am not able to find any details about the prediction algorithms running behind the API.
So far I have gathered that they let you provide your preprocessing steps in PMML format.

Tahir Akhtar
- 315
- 2
- 9
7
votes
1 answer
Lightweight data provenance tool
One of the problems I often encounter is that of poor data provenance.
When I do research I continuously make modifications to my code and rerun experiments. Each time I'm faced with a number of questions, such as: do I save the old results…

Benjamin B.
- 245
- 1
- 6
3
votes
3 answers
What's your ideal work environment?
I'm a founder in a data science heavy startup, and I'm currently functioning as the entire dev team. Before I know it we'll have people working together on a project I, currently, work on completely alone. So:
What are some must have things data…

Warlax56
- 430
- 2
- 7
2
votes
1 answer
Reusable parameter scans wrapper
In most of my projects, I come up with models and want to visualize how some property $x$ varies as a function of a subset of parameters $p_1$,$p_2$, .. etc.
So I'll often end up with figures of the "parameter scan" which look like this
Those are…

jgyou
- 121
- 3
2
votes
0 answers
Making Factual drake work on Windows 7 64-bit
I have installed Drake on Windows 7 64-bit.
I am using JDK 1.7.0_51.
I tried both using the pre-compiled jar file and
compiling from the Clojure source using leiningen.
The resulting Drake version is 0.1.6, the current development version.
When…

Yuval F
- 761
- 1
- 6
- 7
1
vote
1 answer
Any clear winner for Data Science in Scala?
I have found a number of libraries and tools for data science in Scala, I would like to know about which one has more adoption and which one is gaining adoption at a faster pace and to what extent this is the case. Basically, which one should I bet…

Trylks
- 178
- 8
1
vote
1 answer
How to extract most occuring words based on month & what tool to use?
Hi guys I'm very new to data science,
I have intermediate background on programming and have used Pentaho Data Integration tool once for DB migration & data cleansing.
Let's say I have this kind of data:
item_details, timestamp
Wooden chairs,…

Raizerde
- 13
- 3
1
vote
1 answer
What tools are out there to collect participants' browsing and/or search data as part of an experiment?
I'm running an experiment where I need to collect and analyse participants' browsing and search histories. The design of the experiment is similar to an "instrumented user panel", described…

Elise
- 111
- 2