Most Popular

1500 questions
5
votes
2 answers

Are both the training and inference systems required in the same application?

From what I understand, there are 2 stages for deep learning: the first is training and the second is inference. The first is often done on GPUs because of their massive parallelism capabilities, among other things. The second, inference, while it…
5
votes
2 answers

Is it possible to classify data using a genetic algorithm?

Is it possible to classify data using a genetic algorithm? For example, would it be possible to sort this database? Any example in Matlab?
5
votes
1 answer

How do I use a taxonomy and the support vector machine for question classification?

I am going to develop an open-domain natural language question-answering (NLQA) system, and will use the support vector machine (SVM) as the machine learning (ML) model for question classification. The data that I have is from a cube, containing…
lilienfa
  • 319
  • 1
  • 9
5
votes
1 answer

How to handle cycles in minimax algorithm

For example, I am implementing AI for turn based game and have enough computational resources for build full game tree. My problem is the game can be infinite if both players will repeat moves and my minimax implementation stucks because game tree…
5
votes
3 answers

In logistic regression, why is the binary cross-entropy loss function convex?

I am studying logistic regression for binary classification. The loss function used is cross-entropy. For a given input $x$, if our model outputs $\hat{y}$ instead of $y$, the loss is given by $$\text{L}_{\text{CE}}(y,\hat{y}) = -[y \log \hat{y} +…
hanugm
  • 3,820
  • 3
  • 24
  • 56
5
votes
1 answer

In the field of Deep Learning research, what considerations do researchers take into account when inventing new neural network models?

I am not a researcher, but I am curious to know what considerations are relevant to take into account during research for the invention of a new neural network model, and what relevant knowledge researchers typically possess in the area. And an…
5
votes
2 answers

Is an embedding a representation of a word or its meaning?

What does the term "embedding" actually mean? An embedding is a vector, but is that vector a representation of a word or its meaning? Literature loosely uses the word for both purposes. Which one is actually correct? Or is there anything like: A…
hanugm
  • 3,820
  • 3
  • 24
  • 56
5
votes
2 answers

What makes a transformer a transformer?

Transformers are modified heavily in recent research. But what exactly makes a transformer a transformer? What is the core part of a transformer? Is it the self-attention, the parallelism, or something else?
5
votes
2 answers

What does Brooks mean by "representation"?

For a class, I'm reading Brooks' "Intelligence without representation". The introduction is dedicated to slating representation as a focus for AI development. I've read that representation is the problem of representing information symbolically, in…
Jansky
  • 153
  • 2
5
votes
4 answers

What are the typical sizes of practical/commercial artificial neural networks?

I'm interested in artificial neural networks (ANN) and I wonder how big ANNs in practical use are, for example, Tesla Autopilot, Google Translate, and others. The only thing I found about Tesla is this one: "A full build of Autopilot neural…
Mitarano
  • 53
  • 1
  • 6
5
votes
2 answers

Why can't cognitive architectures achieve general intelligence?

Newbie here. I recently read about cognitive architectures (see: https://en.wikipedia.org/wiki/Cognitive_architecture). They are supposed to be modeled after the human mind and represent a promising approach towards artificial general intelligence…
Anonymous
  • 69
  • 1
5
votes
1 answer

Are neurons instantly feed forward when input arrives?

Let's say I have a neural network with 5 layers, including the input and output layer. Each layer has 5 nodes. Assume the layers are fully connected, but the 3rd node in the 2nd layer is connected to the 5th node in the 4th layer. All these numbers…
Miemels
  • 389
  • 2
  • 11
5
votes
2 answers

What are the major differences between multi-armed bandits and the other well-known algorithms (DQN, A3C, PPO, etc)?

I have studied in the past different algorithms, i.e. DQN, DDQN, REINFORCE, A3C, PPO, TRPO, so on. I am doing an internship this summer where I have to use a multi-armed bandit (MAB). I am a bit confused between MAB and the other above…
5
votes
1 answer

Why object detection algorithms are poor in optical character recognition?

OCR is still a very hard problem. We don't have universal powerful solutions. We use the CTC loss function An Intuitive Explanation of Connectionist Temporal Classification | Towards Data Science Sequence Modeling With CTC | Distill which is very…
user40943
5
votes
1 answer

What does "statistical efficiency" mean in this context?

Consider the following statement(s) from Deep Learning book (p. 333, chapter 9: Convolutional Networks) by Ian Goodfellow et al. Convolution is thus dramatically more efficient than dense matrix multiplication in terms of the memory requirements…
hanugm
  • 3,820
  • 3
  • 24
  • 56