Most Popular

1500 questions
20
votes
1 answer

What is a fully convolution network?

I was surveying some literature related to Fully Convolutional Networks and came across the following phrase, A fully convolutional network is achieved by replacing the parameter-rich fully connected layers in standard CNN architectures by…
r4bb1t
  • 335
  • 1
  • 2
  • 8
20
votes
1 answer

What is the intuition behind the dot product attention?

I am watching the video Attention Is All You Need by Yannic Kilcher. My question is: what is the intuition behind the dot product attention? $$A(q,K, V) = \sum_i\frac{e^{q.k_i}}{\sum_j e^{q.k_j}} v_i$$ becomes: $$A(Q,K, V) = \text{softmax}(QK^T)V$$
DRV
  • 1,673
  • 3
  • 13
  • 19
20
votes
2 answers

What are the main differences between skip-gram and continuous bag of words?

The skip-gram and continuous bag of words (CBOW) are two different types of word2vec models. What are the main differences between them? What are the pros and cons of both methods?
DRV
  • 1,673
  • 3
  • 13
  • 19
20
votes
1 answer

Would Google's self-driving-car stop when it sees somebody with a T-shirt with a stop sign printed on it?

In Hidden Obstacles for Google’s Self-Driving Cars article we can read that: Google’s cars can detect and respond to stop signs that aren’t on its map, a feature that was introduced to deal with temporary signs used at construction sites. Google…
kenorb
  • 10,483
  • 3
  • 44
  • 94
20
votes
3 answers

Are Asimov's Laws flawed by design, or are they feasible in practice?

Isaac Asimov's famous Three Laws of Robotics originated in the context of Asimov's science fiction stories. In those stories, the three laws serve as a safety measure, in order to avoid untimely or manipulated situations from exploding in…
3442
  • 768
  • 4
  • 11
20
votes
2 answers

Problems that only humans will ever be able to solve

With the increasing complexity of reCAPTCHA, I wondered about the existence of some problem, that only a human will ever be able to solve (or that AI won't be able to solve as long as it doesn't reproduce exactly the human brain). For instance, the…
Marc Perlade
  • 303
  • 1
  • 6
19
votes
2 answers

Are Modular Neural Networks more effective than large, monolithic networks at any tasks?

Modular/Multiple Neural networks (MNNs) revolve around training smaller, independent networks that can feed into each other or another higher network. In principle, the hierarchical organization could allow us to make sense of more complex problem…
Harsh Sikka
  • 191
  • 2
19
votes
4 answers

Where can I find the original paper that introduced RNNs?

I was able to find the original paper on LSTM, but I was not able to find the paper that introduced "vanilla" RNNs. Where can I find it?
Ahsan Tarique
  • 291
  • 1
  • 2
  • 5
19
votes
4 answers

What is the difference between actor-critic and advantage actor-critic?

I'm struggling to understand the difference between actor-critic and advantage actor-critic. At least, I know they are different from asynchronous advantage actor-critic (A3C), as A3C adds an asynchronous mechanism that uses multiple worker agents…
Blaszard
  • 1,037
  • 3
  • 11
  • 25
19
votes
3 answers

What are the mathematical prerequisites for an AI researcher?

What are the mathematical prerequisites for understanding the core part of various algorithms involved in artificial intelligence and developing one's own algorithms? Please, refer to some specific books.
Surya Bhusal
  • 371
  • 1
  • 3
  • 12
19
votes
3 answers

How do I choose the best algorithm for a board game like checkers?

How do I choose the best algorithm for a board game like checkers? So far, I have considered only three algorithms, namely, minimax, alpha-beta pruning, and Monte Carlo tree search (MCTS). Apparently, both the alpha-beta pruning and MCTS are…
JoeyB
  • 477
  • 1
  • 5
  • 15
19
votes
1 answer

What is the difference between tree search and graph search?

I have read various answers to this question at different places, but I am still missing something. What I have understood is that a graph search holds a closed list, with all expanded nodes, so they don't get explored again. However, if you apply…
xava
  • 423
  • 1
  • 4
  • 10
19
votes
4 answers

What activation function does the human brain use?

Does the human brain use a specific activation function? I've tried doing some research, and as it's a threshold for whether the signal is sent through a neuron or not, it sounds a lot like ReLU. However, I can't find a single article confirming…
mlman
  • 301
  • 2
  • 5
19
votes
2 answers

What is the difference between an observation and a state in reinforcement learning?

I'm studying reinforcement learning. It seems that "state" and "observation" mean exactly the same thing. They both capture the current state of the game. Is there a difference between the two terms? Is the observation maybe the state after the…
echo
  • 693
  • 1
  • 6
  • 12
19
votes
2 answers

How to implement an "unknown" class in multi-class classification with neural networks?

For example, I need to detect classes for MNIST data. But I want to have not 10 classes for digits, but also I want to have 11th class "not a digit", so that any letter, any other type of image, or random noise would be classified as "not a digit".…
Sergey Kravchenko
  • 311
  • 1
  • 2
  • 5