Most Popular

1500 questions
5
votes
3 answers

Is it possible to tell the Reinforcement Learning agent some rules directly without any constraints?

I try to apply RL for a control problem, and I intend to either use Deep Q-Learning or SARSA. I have two heating storage systems with one heating device, and the RL agent is only allowed to heat up 1 for every time slot. How can I do that? I have…
PeterBe
  • 256
  • 2
  • 12
5
votes
1 answer

How to decide a train-test split?

In almost every ML model, a train-test (or train-test-val split) is critical to assess the model's performance. However, I have always wondered what the rationale is to decide a particular train-test split. I've seen that some people like an 80-20…
user48670
5
votes
1 answer

Why is the change in cost wrt bias in neural network equal to error in the neuron?

While reading the book on neural networks by Michael Nielson, I had a problem understanding equation (BP3), which is $$ \frac{\partial C}{\partial b_{j}^{l}}=\delta_{j}^{l} \tag{BP3}\label{BP3}, $$ which can be translated to plain English as…
Madhusoodan P
  • 151
  • 1
  • 4
5
votes
1 answer

Is there a widely accepted definition of the width of a neural network?

The depth of a neural network is equal to the total number of layers in the neural network (excluding the input layer by convention). A neural network with "many layers" is called a deep neural network. On the other hand, the width is the name of a…
hanugm
  • 3,820
  • 3
  • 24
  • 56
5
votes
2 answers

Should I repeat lengthy deep learning experiments to average results ? How to decide how many times to repeat?

I am doing my MSc thesis on deep learning. My model takes many hours to train. Part of what I do is trying different parameters and settings hoping that they will achieve different results. But I often notice that the result differences are too…
Manveru
  • 221
  • 1
  • 3
5
votes
2 answers

What is the difference between edge computing and federated learning?

I recently read about federated learning introduced by Google, but it seems to be like edge computing. What is the difference between edge computing and federated learning?
5
votes
3 answers

Can I always use "encoding" and "embedding" interchangeably?

This question is restricted to the text domain only. The meaning of the word "encode" is Convert (information or instruction) into a particular form. One which performs encoding is called an encoder. In deep learning, an encoder can also be the…
hanugm
  • 3,820
  • 3
  • 24
  • 56
5
votes
2 answers

Can an AGI convince another AGI to modify its code?

Let's suppose there are two AGIs, $A$ and $B$. Assume that $B$ has the ability to modify $A$, but this action of modifying is considered bad by $B$. Can $A$ ever convince $B$ to modify $A$?
5
votes
1 answer

What is expectation-maximization in machine learning?

While studying machine learning algorithms, I often see the term "expectation-maximisation" (or EM), and how it is used to estimate parameters, where the model depends on unobserved latent variables. The way I see it, it is like a…
Guest2000
  • 305
  • 1
  • 4
5
votes
1 answer

How do recommendation systems work?

How do recommendation systems (e.g. on Youtube) work? Apparently, every user gets different recommendations depending on his location, his past liked videos, etc. So it would seem like a training model is applied to every single user, but that can't…
Guest2000
  • 305
  • 1
  • 4
5
votes
1 answer

When would you use Evolutionary Strategies over Step-Based Reinforcement Learning

In Salimans et al, 2016, the authors argue that ES should be considered a competitive alternative to MDP-based RL algorithms like Q-Learning, TRPO. However, in practice, I notice that more often than not ES takes far more episodes to converge than…
ganto
  • 151
  • 1
5
votes
1 answer

What's the difference between a "perceptron" and a GLM?

In a comment to this question user nbro comments: As a side note, "perceptrons" and "neural networks" may not be the same thing. People usually use the term perceptron to refer to a very simple neural network that has no hidden layer. Maybe you…
R.M.
  • 308
  • 1
  • 6
5
votes
2 answers

Are calculus and differential geometry required for building neural networks?

I've been studying geometry and linear algebra for months with the goal to build neural networks. But now I'm reading that perceptrons require fitting curves, and curves are not expressed as linear functions. So, I might need to study differential…
user456280
  • 171
  • 5
5
votes
2 answers

How to check whether my loss function is convex or not?

Loss functions are useful in calculating loss and then we can update the weights of a neural network. The loss function is thus useful in training neural networks. Consider the following excerpt from this answer In principle, differentiability is…
hanugm
  • 3,820
  • 3
  • 24
  • 56
5
votes
3 answers

What is Lipschitz constraint and why it is enforced on discriminator?

The following is the abstract for the research paper titled Improved Training of Wasserstein GANs Generative Adversarial Networks (GANs) are powerful generative models, but suffer from training instability. The recently proposed Wasserstein GAN…
hanugm
  • 3,820
  • 3
  • 24
  • 56