Highest Voted Questions - Artificial Intelligence Stack Exchange

5

votes

1 answer

Are most things generally discovered because they work empirically and later justified mathematically, or vice-versa?

In the original GloVe paper, the authors discuss group theory when coming up with the equation (4). Is it possible that the authors came up with this model, figured out it was good, and then later found out various group theory justifications that…

asked Dec 12 '19 at 05:50

Damien

169
1

5

votes

3 answers

How to make convnets aware what the image actually is, not what is depicted on it?

I've uploaded a picture to Wolfram's ImageIdentify of graffiti on the wall, but it recognized it as 'monocle'. Secondary guesses were 'primate', 'hominid', and 'person', so not even close to 'graffiti' or 'painting'. Is it by design, or there are…

asked Aug 18 '16 at 17:42

kenorb

10,483
3
44
94

5

votes

1 answer

Could AI kill the joy of competitive sports and games?

Lee Sedol, former world champion, and legendary Go player today announced his retirement with the quote "Even if I become the No. 1, there is an entity that cannot be defeated". Is it possible that AIs could kill the joy of competitive games(Go,…

asked Nov 29 '19 at 23:33

Vildemort

117
6

5

votes

2 answers

Can CNNs be applied to non-image data, given that the convolution and pooling operations are mainly applied to imagery?

When using CNNs for non-image (times series) data prediction, what are some constraints or things to look out for as compared to image data? To be more precise, I notice there are different types of layers in a CNN model, as described below, which…

asked Nov 29 '19 at 01:51

nilsinelabore

241
2
12

5

votes

1 answer

How does Wit.ai convert sentences into structured data?

The Wit.ai is a Siri-like voice interface which can can parse messages and predict the actions to perform. Here is the demo site powered by Wit.ai. How does it understand the spoken sentences and convert them into structured actionable data?…

asked Aug 18 '16 at 14:30

kenorb

10,483
3
44
94

5

votes

1 answer

Should I train different models for detecting subsets of objects?

Suppose we have $1000$ products that we want to detect. For each of these products, we have $500$ training images/annotations. Thus we have $500,000$ training images/associated annotations. If we want to train a good object detection algorithm to…

asked Nov 22 '19 at 00:16

NebulousReveal

113
5

5

votes

1 answer

Concrete example of latent variables and observables plugged into the Bayes' rule

In the context of the variational auto-encoder, can someone give me a concrete example of the application of the Bayes' rule $$p_{\theta}(z|x)=\frac{p_{\theta}(x|z)p(z)}{p(x)}$$ for a given latent variable and observable? I understand with VAE's…

asked Nov 21 '19 at 10:16

user8714896

797
1
6
24

5

votes

1 answer

Does MMD-VAE solve the problem of blurred images of vanilla VAEs?

I understand that with vanilla VAEs, there are a few reasons justifying the production of blurred out images. The InfoVAE paper describes the case when the decoder is flexible enough to ignore the latent attributes and generate an averaged out image…

asked Nov 19 '19 at 14:57

Ananda

148
9

5

votes

1 answer

How could decision tree learning algorithms cope with imbalanced classes?

Decision trees and random forests may or not be more suited to solve supervised learning problems with imbalanced labels (or classes) in datasets. For example, see the article Using Random Forest to Learn Imbalanced Data, this Stats SE question and…

asked Nov 18 '19 at 15:37

jennifer ruurs

579
2
9

5

votes

1 answer

What is the "semantic level"?

I am reading the paper Hierarchical Attention-Based Recurrent Highway Networks for Time Series Prediction (2018) by Yunzhe Tao et al. In this paper, they use several times the expression "semantic levels". Some examples: HRHN can adaptively select…

asked Nov 18 '19 at 09:07

MikelBa

53
1
7

5

votes

1 answer

What is "conditioning" on a feature?

On page 98 of Jet Substructure at the Large Hadron Collider: A Review of Recent Advances in Theory and Machine Learning the author writes; Redacted phase space: Studying the distribution of inputs and the network performance after conditioning on…

asked Nov 14 '19 at 11:57

Clumsy cat

153
5

5

votes

2 answers

Is the Mask Needed for Masked Self-Attention During Inference with GPT-2

My understanding is that masked self-attention is necessary during training of GPT-2, as otherwise it would be able to directly see the correct next output at each iteration. My question is whether the attention mask is necessary, or even possible,…

asked Nov 14 '19 at 11:41

D_s

51
3

5

votes

1 answer

Why would the application of boosting prevent underfitting?

"Why would the application of boosting prevent underfitting?" I read in some paper that applying boosting would prevent you from underfitting. Why is that? Source: https://www.cs.cornell.edu/courses/cs4780/2015fa/web/lecturenotes/lecturenote13.html

asked Nov 14 '19 at 08:09

jennifer ruurs

579
2
9

5

votes

1 answer

Context-based gap-fill face posture-mapper GAN

These images are handmade, not auto-generated like they will be in production. Apologies for inaccuracies in the graph overlay. I am trying to build an AI like that displayed in the diagram: when given a training set of images with their…

asked Nov 13 '19 at 19:35

Geza Kerecsenyi

51
6

5

votes

1 answer

Isn't deep fake detection bound to fail?

Deep fakes are a growing concern: the ability to credibly alter a video may have great (negative) impacts on our society. It is so much of a concern, that the biggest tech companies launched a specific challenge:…

asked Nov 13 '19 at 12:40

Lucas Morin

252
2
13

Most Popular