Questions tagged [tools]

Use for questions about the different software and hardware used to assist in, and solve, data science problems

68 questions
9
votes
3 answers

Google prediction API: What training/prediction methods Google Prediction API employs?

The details of the Google Prediction API are on this page, but I am not able to find any details about the prediction algorithms running behind the API. So far I have gathered that they let you provide your preprocessing steps in PMML format.
Tahir Akhtar
  • 315
  • 2
  • 9
7
votes
1 answer

Lightweight data provenance tool

One of the problems I often encounter is that of poor data provenance. When I do research I continuously make modifications to my code and rerun experiments. Each time I'm faced with a number of questions, such as: do I save the old results…
Benjamin B.
  • 245
  • 1
  • 6
3
votes
3 answers

What's your ideal work environment?

I'm a founder in a data science heavy startup, and I'm currently functioning as the entire dev team. Before I know it we'll have people working together on a project I, currently, work on completely alone. So: What are some must have things data…
Warlax56
  • 430
  • 2
  • 7
2
votes
1 answer

Reusable parameter scans wrapper

In most of my projects, I come up with models and want to visualize how some property $x$ varies as a function of a subset of parameters $p_1$,$p_2$, .. etc. So I'll often end up with figures of the "parameter scan" which look like this Those are…
jgyou
  • 121
  • 3
2
votes
0 answers

Making Factual drake work on Windows 7 64-bit

I have installed Drake on Windows 7 64-bit. I am using JDK 1.7.0_51. I tried both using the pre-compiled jar file and compiling from the Clojure source using leiningen. The resulting Drake version is 0.1.6, the current development version. When…
Yuval F
  • 761
  • 1
  • 6
  • 7
1
vote
1 answer

Any clear winner for Data Science in Scala?

I have found a number of libraries and tools for data science in Scala, I would like to know about which one has more adoption and which one is gaining adoption at a faster pace and to what extent this is the case. Basically, which one should I bet…
Trylks
  • 178
  • 8
1
vote
1 answer

How to extract most occuring words based on month & what tool to use?

Hi guys I'm very new to data science, I have intermediate background on programming and have used Pentaho Data Integration tool once for DB migration & data cleansing. Let's say I have this kind of data: item_details, timestamp Wooden chairs,…
Raizerde
  • 13
  • 3
1
vote
1 answer

What tools are out there to collect participants' browsing and/or search data as part of an experiment?

I'm running an experiment where I need to collect and analyse participants' browsing and search histories. The design of the experiment is similar to an "instrumented user panel", described…
Elise
  • 111
  • 2