Highest Voted Questions - Artificial Intelligence Stack Exchange

5

votes

3 answers

Is it possible to tell the Reinforcement Learning agent some rules directly without any constraints?

I try to apply RL for a control problem, and I intend to either use Deep Q-Learning or SARSA. I have two heating storage systems with one heating device, and the RL agent is only allowed to heat up 1 for every time slot. How can I do that? I have…

reinforcement-learning

asked Oct 13 '21 at 11:59

PeterBe

256
2
12

5

votes

1 answer

How to decide a train-test split?

In almost every ML model, a train-test (or train-test-val split) is critical to assess the model's performance. However, I have always wondered what the rationale is to decide a particular train-test split. I've seen that some people like an 80-20…

asked Oct 12 '21 at 14:13

user48670

5

votes

1 answer

Why is the change in cost wrt bias in neural network equal to error in the neuron?

While reading the book on neural networks by Michael Nielson, I had a problem understanding equation (BP3), which is $$ \frac{\partial C}{\partial b_{j}^{l}}=\delta_{j}^{l} \tag{BP3}\label{BP3}, $$ which can be translated to plain English as…

asked Apr 21 '17 at 10:51

Madhusoodan P

151
1
4

5

votes

1 answer

Is there a widely accepted definition of the width of a neural network?

The depth of a neural network is equal to the total number of layers in the neural network (excluding the input layer by convention). A neural network with "many layers" is called a deep neural network. On the other hand, the width is the name of a…

asked Sep 22 '21 at 01:07

hanugm

3,820
3
24
56

5

votes

2 answers

Should I repeat lengthy deep learning experiments to average results ? How to decide how many times to repeat?

I am doing my MSc thesis on deep learning. My model takes many hours to train. Part of what I do is trying different parameters and settings hoping that they will achieve different results. But I often notice that the result differences are too…

asked Sep 18 '21 at 18:25

Manveru

221
1
3

5

votes

2 answers

What is the difference between edge computing and federated learning?

I recently read about federated learning introduced by Google, but it seems to be like edge computing. What is the difference between edge computing and federated learning?

asked Apr 17 '17 at 10:06

Karan Nadagoudar

285
1
5

5

votes

3 answers

Can I always use "encoding" and "embedding" interchangeably?

This question is restricted to the text domain only. The meaning of the word "encode" is Convert (information or instruction) into a particular form. One which performs encoding is called an encoder. In deep learning, an encoder can also be the…

asked Sep 10 '21 at 02:06

hanugm

3,820
3
24
56

5

votes

2 answers

Can an AGI convince another AGI to modify its code?

Let's suppose there are two AGIs, $A$ and $B$. Assume that $B$ has the ability to modify $A$, but this action of modifying is considered bad by $B$. Can $A$ ever convince $B$ to modify $A$?

asked Apr 11 '17 at 14:46

Logan Young

61
4

5

votes

1 answer

What is expectation-maximization in machine learning?

While studying machine learning algorithms, I often see the term "expectation-maximisation" (or EM), and how it is used to estimate parameters, where the model depends on unobserved latent variables. The way I see it, it is like a…

asked Mar 28 '17 at 16:31

Guest2000

305
1
4

5

votes

1 answer

How do recommendation systems work?

How do recommendation systems (e.g. on Youtube) work? Apparently, every user gets different recommendations depending on his location, his past liked videos, etc. So it would seem like a training model is applied to every single user, but that can't…

asked Mar 28 '17 at 15:37

Guest2000

305
1
4

5

votes

1 answer

When would you use Evolutionary Strategies over Step-Based Reinforcement Learning

In Salimans et al, 2016, the authors argue that ES should be considered a competitive alternative to MDP-based RL algorithms like Q-Learning, TRPO. However, in practice, I notice that more often than not ES takes far more episodes to converge than…

asked Sep 01 '21 at 07:27

ganto

151
1

5

votes

1 answer

What's the difference between a "perceptron" and a GLM?

In a comment to this question user nbro comments: As a side note, "perceptrons" and "neural networks" may not be the same thing. People usually use the term perceptron to refer to a very simple neural network that has no hidden layer. Maybe you…

perceptron

asked Aug 07 '21 at 13:47

R.M.

308
1
6

5

votes

2 answers

Are calculus and differential geometry required for building neural networks?

I've been studying geometry and linear algebra for months with the goal to build neural networks. But now I'm reading that perceptrons require fitting curves, and curves are not expressed as linear functions. So, I might need to study differential…

asked Aug 06 '21 at 05:13

user456280

171
5

5

votes

2 answers

How to check whether my loss function is convex or not?

Loss functions are useful in calculating loss and then we can update the weights of a neural network. The loss function is thus useful in training neural networks. Consider the following excerpt from this answer In principle, differentiability is…

asked Aug 02 '21 at 02:21

hanugm

3,820
3
24
56

5

votes

3 answers

What is Lipschitz constraint and why it is enforced on discriminator?

The following is the abstract for the research paper titled Improved Training of Wasserstein GANs Generative Adversarial Networks (GANs) are powerful generative models, but suffer from training instability. The recently proposed Wasserstein GAN…

asked Jul 29 '21 at 10:23

hanugm

3,820
3
24
56

Most Popular