# Bayesian statistics

Continuation of https://matchmaticians.com/questions/qse0ka .
Still struggling with understanding Bayesian statistics, so I need comparsion of MLE and Bayesian method(or whatever it is called) please.
The thing which confuses me the most is parameter.
In MLE I have pdata - some unknown distribution which generated samples.
pmodel with parameters - model which I believe describes samples in the best way with it's parameters.
So my goal in MLE is to find parameters for my model(distribution) which maximizes the probability of observing the given samples under the assumed distribution with parameter values θ.

Let's consider an example which @Mathe showed for Bayesian:

Suppose we are interested in knowing the efficacy of a treatment to heal a person, that is, we are interested in the probability pp of a person being cured after following the treatment. We know that the treatment was administered to 100 patients, and 60 were cured.
Now, we want to follow a Bayesian approach, so we treat the unknown probability p as a random quantity. Based on previous studies, we believe the efficacy of the treatment could be modeled by a beta distribution with parameters α=3,β=5. We want to see how the recent sample of treated patients can update our beliefs about the treatment.

So we start from describing our prior observations and believe that they can be described by a beta distribution with parameters α=3,β=5, right?

The prior then takes the form of $p_{prior}(p) \propto p^{3-1} (1-p)^{5-1}$ and the likelihood function is $p_{data}(x|p) \propto p^{60}(1-p)^{40}$.

Here what confuses me. Why probability is a parameter? I understand a parameter as a part of some parametric probability distribution.
Also @Mathe mentioned at the start of answer that parameter is a part of pmodel
, what is pmodel here?

Other calculations are clear for me, I mean that I can calculate Byesian, but still don't understand how it works.

