Technical note: General formulation for the distribution problem – prognostic assumed probability density function (PDF) approach based on the maximum-entropy principle and the Liouville equation

Yano, Jun-Ichi; Larson, Vincent E.; Phillips, Vaughan T. J.

doi:https://doi.org/10.5194/acp-25-9357-2025

Articles | Volume 25, issue 16

https://doi.org/10.5194/acp-25-9357-2025

Articles | Volume 25, issue 16

Technical note

27 Aug 2025

Technical note |

| 27 Aug 2025

Technical note: General formulation for the distribution problem – prognostic assumed probability density function (PDF) approach based on the maximum-entropy principle and the Liouville equation

Jun-Ichi Yano, Vincent E. Larson, and Vaughan T. J. Phillips

Abstract

A general formulation for the distribution problem is presented, which is applicable to frequency distributions of subgrid-scale variables and hydrometeor size distributions, as well as to probability distributions characterizing data uncertainties. The general formulation is presented based upon two well-known basic principles: the maximum-entropy principle and the Liouville equation. The maximum-entropy principle defines the most likely general distribution form if necessary constraints are specified. This paper proposes to specify these constraints as the output variables to be used in a host model. Once a general distribution form is defined, the problem of the temporal evolution of the distribution reduces to that of predicting a small number of parameters characterizing it. This paper derives prognostic equations for these parameters from the Liouville equation. The developed formulation, which is applicable to a wide range of atmospheric modeling problems, is specifically applied to the condensation growth of cloud droplets as a demonstration.

Download & links

Article (PDF, 1030 KB)

Download & links

How to cite.

Received: 06 Oct 2023 – Discussion started: 23 Jan 2024 – Revised: 15 May 2025 – Accepted: 06 Jun 2025 – Published: 27 Aug 2025

1 Introduction

The present paper considers the distribution problem in a general manner. Regarding the problems of distributions, at least three examples are identified in atmospheric sciences.

The first and perhaps most obvious example is the problem of determining the distribution of a variable over a domain as a distribution density function (DDF). Typically, the domain corresponds to a grid box in a numerical simulation, and the obtained distribution is used for calculating subgrid-scale characteristics that are required by a host model. This problem may be called the subgrid-scale distribution problem. A specific application of this problem is the determination of the fractional area occupied by clouds within a grid box (Sommeria and Deadorff, 1977; Mellor, 1977; Bougeault, 1981; Le Treut and Li, 1991; Bechtold et al., 1992, 1995; Richard and Royer, 1993; Bony and Emanuel, 2001; Golaz et al., 2002; Tompkins, 2002). Arguably all subgrid-scale processes may be represented under this subgrid-scale distribution framework (cf. Yano, 2016).

The second example is the size distribution of hydrometeor particles (condensed water, ice species, etc.). Information on particle size distributions (PSDs) is crucial for predicting various conversion processes from one hydrometeor type to another as well as for evaluating fall-out rates of those hydrometeors (cf. Khain et al., 2015; Khain and Pinsky, 2018). The rates of all these processes depend sensitively on the hydrometeor particle size.

The third and perhaps most abstract example of the distribution is the probability. The notion of probability appears in many aspects of the atmospheric sciences (e.g., Yano and Manzato, 2022). Here, we especially keep in mind applications to data assimilation, in which data uncertainties are measured by probability distributions (cf. Wikle and Berliner, 2007).

The present paper considers all three of these problems under a single framework. It is possible to consider these three qualitatively different problems together because the time evolution of all these distributions can, in principle, be predicted by similar equations. The time evolution of both the subgrid-scale distribution (DDF) and the probability density function (PDF) is predicted by the Liouville equation (cf. Sect. 3.5). Here, it is hard to overemphasize the clear difference between them; to put it simply, distribution and probability are distinctively different concepts. Unfortunately, in the literature, both are often referred to as PDFs. We follow this custom to some extent, but we will add “DDF” in parentheses whenever it is not cumbersome. Thus, though these two problems deal with different types of distributions, both are governed by the same equation. The time evolution of PSDs is, fundamentally, described by the stochastic-collection equation by adding additional tendency terms to it (cf. Sect. 3.6). Though its form is not identical to the Liouville equation, it can still be considered in an analogous manner. Thus, it becomes possible to deal with these three different problems under a single framework.

All three of these problems also suffer from the same difficulty: direct use of these fundamental equations (Liouville and stochastic-collection) turns out to be very expensive computationally. In data assimilation, an ensemble-forecast method is adopted as an alternative, but the difficulty of generating a statistically large enough ensemble remains. Thus a numerically more efficient method must be sought in order to make them practically useful. This difficulty has been, so far, addressed separately in these three problems. The originality of the present work is to simply point out that all three of these computational problems can be considered under a common framework.

In the subgrid-scale distribution problem, a main strategy is to assume a simple distribution form that is specified by a small number of parameters, sometimes termed “PDF parameters”. The PDF parameter values, and hence the distribution itself, evolve with time as conditions change from, say, overcast cloud to cumulus cloud. Hence the time evolution of the PDF parameters must be predicted. Typically, this is done by first predicting a set of moments of the distribution and then inverting the set to solve for the PDF parameters. This general approach can be called an assumed-PDF method. One of the earliest applications of the idea to turbulence is by Lockwood and Naguib (1975). See also an early review by Pope (1979).

A main strategy in cloud microphysics is to integrate the full information over all the possible particle sizes. Thus, the distribution problem becomes implicit. This approach is called bulk. However, for performing integrals over particle size, we need to assume a certain distribution form, which is typically exponential or the gamma distribution. In this manner, we see a clear link of the “bulk” microphysical approaches to the “assumed-PDF” approach.

In data assimilation, typically, a decision is made to focus only on the mean, variance, and some covariances of a probability distribution. As a result, a full probability is not explicitly considered. Here, typically, an “assumed PDF” is Gaussian.

For overviews on how these three problems are constructed using the distributions, we refer to Machulskaya (2015) for the subgrid-scale distribution problem; Seifert and Beheng (2001, 2006), Khain et al. (2015), and Khain and Pinsky (2018) for microphysics; and Bannister (2017), Jazwinski (1970), and Carrassi et al. (2018) for the data assimilation. Among those three problems, the role of probability may be least obvious in data assimilation, especially for those who only consult the final formulation of the standard variational principle. Here, we specifically refer to Sect. 2 of Carrassi et al. (2018), in which a more formal formulation in terms of the probability is presented.

The purpose of this study is to present a more coherent and self-contained formulation for the distribution problems under the framework of those assumed-PDF approaches in a general sense. The assumed-PDF approaches solve only about half of the whole problem. There are still two major issues to be resolved: (i) the choice of an assumed-PDF form and (ii) methodologies for evaluating the time evolution of the introduced PDF parameters efficiently. The present study proposes the solutions to those two major issues. Currently, there is no clear guiding principle for the first issue (cf. Sect. 3.2.2). The second issue is currently dealt with by relating the PDF parameters and moments to each other, but the conversion from moments to parameters is not guaranteed to be simple or analytic (cf. Sect. 3.2.1). By addressing these two major issues in the distribution problem, the present study generalizes the formulation for the existing assumed-PDF approaches developed for the subgrid-scale distribution problem into more general distribution problems. By doing so, the present study links the subgrid-scale distribution problem to the distribution problem more generally, as found in, for example, cloud microphysics and data assimilation. Conversely, the generalized formulation of the present study reduces to the existing assumed-PDF approaches by introducing additional approximations and assumptions (cf. Sect. 5.4).

Here, these two questions are addressed separately in this study in Sects. 4 and 5, respectively. Thus, those two methodologies can also be adopted independently if desired. Yet, the present study suggests that the most coherent formulation for the assumed-PDF approaches can be developed by adopting both methodologies together.

In this manner, the present work also constitutes an effort to develop a unified and coherent formulation for subgrid-scale representations (Yano, 2016; Yano et al., 2014, 2018). Readers are encouraged to refer to them for the authors' general perspectives on the subgrid-scale representations, but see further Yano (2015 a, b). Refer especially to Yano (2016) for general discussions of the subgrid-scale distribution problem. Keep in mind that the subgrid-scale representation problem is considered exclusively from the perspective of DDF in the present study but without excluding the possibilities of alternative approaches as discussed in Yano (2016). More generally, the fundamental research, as pursued in the present study, is extremely crucial for the improvement of subgrid-scale parameterizations (Yano et al., 2014). Even the current operational numerical forecasts, which are based on a premise of high-resolution cloud modeling, may break down without such fundamental research to support them (Yano et al., 2018).

The paper begins in the next section by introducing a basic governing equation system that is adopted throughout the paper to construct the general formulation for the distribution problems. As seen therein, the adopted governing equation is general enough that the formulation of the present study can be applied to more or less any problems that can be expected in atmospheric sciences. Section 3 reviews our basic knowledge about the distribution problem. The moment concept is first introduced in Sect. 3.1 because it is so central in the current approaches. The basic idea of the assumed-PDF approach is outlined in Sect. 3.2, where its basic problems are also pointed out. The maximum-entropy principle, which defines the form for the most likely general distribution under given constraints, is introduced in Sect. 3.3, and its applications are further discussed in Sect. 3.4. The Liouville equation is introduced in Sect. 3.5, and the stochastic-collection equation is separately introduced in Sect. 3.6.

Both the maximum-entropy principle and the Liouville equation play key roles in the present study, albeit in different manners, in resolving the two aforementioned major issues. The first issue is addressed by taking the maximum entropy as a guiding principle. A key open question here is the precise conditions to be posed under this principle in order to define a PDF (DDF). A standard procedure is to take what we already know as a specified condition of a system (e.g., total kinetic energy of an ideal-gas system). An original argument of the present paper is to propose to constrain the form of the PDF (DDF), instead, by the quantities that are required, for example, for purpose of predictions (e.g., cloud fractions, precipitation rate) in modeling. This argument is developed in Sect. 4 by discussing the contrasts between the standard statistical problems and those typically addressed in atmospheric sciences. The second issue is addressed in Sect. 5 by deriving a general form from the Liouville equation of prognostic equations for predicting the time evolution of the PDF parameters introduced under an assumed PDF. The derived general formulation is applied to the illustrative example of the condensation growth of cloud droplets in Sect. 6. The paper is concluded in Sect. 7.

It is emphasized that this work presents a new formulation by addressing the two aforementioned major problems in the assumed-PDF approaches, rather than solving any specific problems. For this reason, the basic style is to present basic principles first in a straight and concise manner. We choose this style for ease of referring to formulations, especially based on the fact that the presented formulation can be applied to almost any distribution problems in atmospheric sciences. Specific examples are gradually introduced so that a more concrete picture of the methodology gradually emerges. Readers who would like to see a concrete example first are suggested to go to Sect. 6 directly and read backwards. Also for this reason, the governing equation system to be considered in the present study is introduced in a standalone manner separately in the next section so that the range of applicability of the present study to various atmospheric problems should become immediately clear. Along the lines of generality intended in the present study, all the equations are presented in nondimensional forms, setting the various physical constants to unity for convenience, throughout the paper.

2 Governing equation system

Many physical variables, say ϕ, of the atmospheric flows are advective; thus they are governed by an equation of the following form:

\begin{matrix} (1) & \frac{\partial ϕ}{\partial t} = - v \cdot \nabla ϕ + F . \end{matrix}

Here, v is the velocity and F designates all the tendencies (source) contributing to the variable, ϕ, apart from the advection as represented as the first term on the right-hand side. The source, F, generally depends on the variable, ϕ, and also possibly on time t and space. For instance, in a cloud macrophysical application, ϕ is the liquid water content or the number density of liquid droplets with a particular radius. Equations (2.9), (2.10), and (2.11) of Machulskaya (2015) are examples of the equations of the form of Eq. (1) considered in the context of the DDF problem. Yano et al. (2005) and Yano (2016) show that the basic formulations for the subgrid-scale parameterizations can be reproduced by simply examining this general form (Eq. 1). More specifically, Yano (2014) shows that all the essential, basic standard formulas for the mass-flux convection parameterization can be reproduced from a general governing equation, Eq. (1).

With an ultimate application to the systems described by governing equations of the form of Eq. (1) in mind, for ease of the deductions in the following, the present study focuses on a case with no spatial dependence in the above:

\begin{matrix} (2) & \frac{d ϕ}{d t} = F . \end{matrix}

Yet, in spite of this restriction, and without arguing for any general physical relevance, it may also be worthwhile to emphasize that the source term, F, in Eq. (2) includes any types of physical processes that are locally defined. Most of the microphysical processes, for example, fall into this category. More importantly, this restriction does not have any serious consequences because the final general formulation for predicting the assumed-PDF parameters in Sect. 5.1 is very easily generalized to the cases with spatial dependence (cf. Sect. 5.3).

To maintain the generality of the formulation, the term, F, is left unspecified in considering the time evolution of various types of distributions in Sect. 5. As a result, the formulations presented in the following, especially our key result given by Eqs. (50a) and (50b), are applicable to any types of physics. All we have to do is to specify the form, F, as required in applications. As a specific example, the source term is set to $F = 1 / r$ in Sect. 6 (cf. Eq. 76a) in considering the condensation growth of a droplet with a radius, r. We also examine the behavior of systems with mathematically simple forms for F in Sect. 5.5 as well as in Appendix C. Keep in mind that in Eq. (1), the source term, F, can be space dependent, say, involving spatial derivatives. However, in considering Eq. (2) in the following, this possibility will be excluded for ease of analysis. Furthermore, in the present study, the source term, F, is assumed to be deterministic, except for the case of the Brownian motion considered in Sect. 4.4.

Throughout the study, only the cases with a single variable, ϕ, are considered explicitly for the economy of presentation. However, when multiple variables are involved in a problem, as is typically the case in any realistic applications, the only modification required is to replace the scalar, ϕ, by a vector. Examples with systems with multiple dimensions are presented in Yano (2024). Probably, a more serious restriction in the following development of the formulation is in considering only the cases with no spatial dependence. However, as it turns out, the generalization of the final formulation to the spatially dependent cases with Eq. (1) is fairly straightforward, as discussed in Sect. 5.3.

In the present work, we proceed with the hypothesis that the physics of a system is already completely known in the form of Eq. (2) with a forcing term, F, completely specified. In practical applications, this hypothesis is satisfied by specifying all the terms in a system in a closed form, as parameterizations, if required. Especially, in applying the general formulation of the present study to the subgrid-scale distribution problems, this hypothesis means that we know the governing equation system of the small scale to be parameterized fully; cf. Sect. 4.3 for further discussions. In this spirit, for example, Yano (2014) proceeds with a hypothesis that we know perfectly the equations for the cloud-resolving modeling and reconstructs the standard mass-flux-based convection parameterization based on this hypothesis. In the same way, the present method solves a subgrid-scale distribution problem assuming that we know the full equations for all the scales of a system. As is going to be seen, for this reason, we do not need any turbulence closures (cf. Mellor, 1973; Mellor and Yamada, 1974) in the present formulation.

3 Basic principles

The purpose of this section is to summarize well-known basic principles for describing PDFs (DDFs).

3.1 Distributions and moments

Let a PDF (DDF) for a variable, ϕ, be denoted by p(ϕ). Moments, 〈ϕⁿ〉 ( $n = 1, 2, \dots$ ),¹ can be constructed from a given distribution, p, by

\begin{matrix} (3) & 〈 ϕ^{n} 〉 = \int ϕ^{n} p d ϕ . \end{matrix}

Here, an unspecified integral range may be taken from −∞ to +∞ with many of the physical variables, but some physical variables are semi-positive definite (e.g., temperature, mixing ratios). In the latter case, the integral range above must be from 0 to +∞.

The series of moments may be interpreted analogously to the Taylor series, in the sense that it constrains a function. However, unlike the latter, there is no closed analytical formula for reconstructing the original distribution from a given series of moments: although a series of moments can be derived from a given distribution in a straightforward manner, the reverse is hardly the case. This is in spite of the extensive literature on the subject (e.g., Daniels, 1954; Butler, 2007; Dang and Xu, 2019). On the other hand, the usefulness of the moments for describing the turbulent flows can hardly be overemphasized (e.g., Stull, 1988; Garratt, 1992) either.

3.2 Assumed PDF (DDF)

3.2.1 General formulation

The basic idea of an assumed-PDF (DDF) approach is to introduce a generic form of PDF (DDF) characterized by a few free parameters, say λ_i ( $i = 0, \dots, N$ , where N is kept as small as possible), but in such a manner that the distribution of a variable of concern can be represented:

\begin{matrix} (4a) & p = p (ϕ, λ_{0}, λ_{1}, \dots, λ_{N}) . \end{matrix}

Here, λ₀ will be used to designate a constant factor for a normalization of a distribution throughout the paper whenever the assumed-PDF formulation is discussed in a general manner. On the other hand, p₀ will be adopted for the normalization factor whenever a specific PDF form is discussed; the latter choice is consistent with the fact that notations different than λ_i (i≠0) are also often adopted for the other assumed-PDF parameters.

Importantly, the distribution, p, is related to the cumulative probability, P, by

\begin{matrix} (4b) & p = d P / d ϕ, \end{matrix}

where P is more precisely defined as the probability that the variable ϕ^′ is less than a specified value ϕ; i.e., $P = P (ϕ^{'} < ϕ)$ .

Examples of distributions taking the form of Eq. (4a) are discussed in subsequent subsections (e.g., Eqs. 24 and 25, setting p₀=λ₀ therein). Once the functional form of a distribution is constrained by Eq. (4a), the problem of determining the distribution p(ϕ), which must be defined for every value of ϕ, reduces to that of determining a given finite set of parameters {λ_i}, which evolve with time by following the evolution of the distribution. However, keep in mind that the parameters {λ_i} should not depend on ϕ for obvious reasons. Note that {*} indicates a set of parameters throughout the paper.

Here, it is important to remember that an assumed-PDF form is only an approximation; to state this fact more emphatically, it may be better to state it as

\begin{matrix} (4c) & p = p (ϕ, λ_{0}, λ_{1}, \dots, λ_{N}) + ε, \end{matrix}

with ε designating the possible error under this approximation. Yet, in the following deductions, this error term is mostly neglected, except for couple of exceptions where it is added as a reminder.

A major exception to the above rule is when the assumed PDF is an exact solution of the original equation, and when the initial PDF follows the assumed form, there is no error. We may further expect that the error remains small even if the initial PDF does not follow the assumed form. Otherwise, there is no way that an assumed-PDF form can predict the evolution of a distribution in any accurate manner.

The parameters {λ_i} defining the distribution (Eq. 3) may be determined, for example, from a known set of moments, 〈ϕⁿ〉; i.e.,

\begin{matrix} (5) & λ_{i} = λ_{i} ({〈 ϕ^{n} 〉}) . \end{matrix}

The prognostic equations for these moments, or diagnostic approximations of these equations, are, in turn, known from, e.g., the turbulence theories for the system (Eq. 1) in the context of the subgrid-scale distribution problem; thus the problem is closed in this manner. That is the current basic strategy of the assumed-PDF (DDF) approach (Larson, 2022).

However, there are problems with this strategy. First, the functional form of a PDF (DDF), p(ϕ,{λ_i}), must somehow be prescribed. However, no clear principle has been identified. A main thread of this paper is to use the maximum-entropy principle (cf. Sect. 3.3) for this purpose. The second problem is the difficulty of deriving a closed expression (Eq. 5) for defining the PDF (DDF) parameters from a given set of moments. Here, it is straightforward to compute the moments from a given PDF (DDF); thus we can readily write this down as

\begin{matrix} (6) & 〈 ϕ^{n} 〉 = 〈 ϕ^{n} 〉 ({λ_{i}}) . \end{matrix}

However, inverting Eq. (6) into Eq. (5) is often not at all trivial due to the nonlinearity in the former, and PDF parameters are defined only in an implicit manner from a set of moments. See Eq. (6) of Milbrandt and Yau (2005) for example. Often extra assumptions and approximations are required to make this inversion possible (cf. Machulskaya, 2015). Alternatively, an iterative procedure can be adopted in order to invert a given set of moments and deduce the PDF parameter values (e.g., Lewellen and Yoh, 1993). The difficulty of the inversion is exacerbated by predicting more (higher-order) moments and also by moving to multivariate PDFs. In Sect. 5, we will show, by more explicitly invoking the Liouville equation as introduced in Sect. 3.5, how a prognostic set of equations for {λ_i} can be written down explicitly. These equations are closed in the sense that no further inversion is required.

3.2.2 Choice of assumed-distribution forms

In current assumed-PDF (DDF) approaches in the context of the subgrid-scale distribution problem, distribution forms to be adopted are chosen in a rather subjective manner, mostly based on computational convenience. For this reason, one popular choice is double Gaussian, i.e., a sum of 2 Gaussians, for the purpose of representing a skewness (e.g., Larson et al., 2002; Fitch, 2019; Naumann et al., 2013). Although those studies show some fits to distributions obtained from either observation or large-eddy simulations as support, we should not consider that double-Gaussian distributions have been verified by data; no objective comparisons with alternative possible distributions have been made.

On the other hand, observations suggest that the hydrometeor PSD follows an exponential distribution in the size (Marshall and Palmer, 1948). However, Yano et al. (2016) point out the difficulty of identifying the best fit for the PSDs observationally from various exponential distribution forms that are derived from the maximum-entropy principle (cf. Sect. 3.3): it is indeed not possible to verify in any convincing manner that any of those fit the observations better than the others, although it is possible to discuss different values of errors of those fittings.

3.3 Maximum-entropy principle: derivation

3.3.1 Derivation

To address the first issue of the choice of PDF (DDF) form, we take the maximum entropy as a guiding principle. It must be emphasized that this is merely a mathematical principle. Here, the guiding principle suggests that it is not any physical principle but merely a principle that guides a choice of an assumed-PDF form. Note especially that Boltzmann's entropy, which takes a mathematically identical form, can be derived by physical reasoning. However, it should not be confused with the information entropy in general. From a physical point of view, although the principle is plausible, there is no guarantee that it actually works. For this reason, we adopt this principle merely as a guide for identifying a necessary assumed form of a PDF (DDF). Such a guiding principle is useful when there is no other principle for choosing an assumed distribution. This principle should not be interpreted as a hypothesis either because it suggests that the guiding principle may be disproved by experiments. Here, the success of a guiding principle may vary from case to case. However, so long as this principle is used with caution, we expect that it remains useful for choosing a PDF form.

The maximum-entropy principle asks the question of what the “most likely” distribution of a variable is under a given set of “constraints” (cf. Eq. 12 below). It simply argues that the “most likely” distribution is a distribution that is actually realized in a given system. The argument of this principle is simple and appealing enough to gain extensive application (cf. Kapur, 1989), notably in the statistical description of geophysical flows (e.g., Robert and Sommeria, 1991; Verkley and Lynch, 2009; Verkley, 2011; Verkley et al., 2016). For this reason, the present study also invokes this principle. See Yano (2019) for further implications of this principle, as well as for further references of applications in atmospheric sciences and many other disciplines.

Here, “most likely” is defined in terms of the number of possible combinations for a given state of a variable (cf. Eq. 8 below). We develop the idea for a discrete system first for ease of explanation. Thus, we assume a variable, ϕ, takes m values, say, designated by ϕ_i ( $i = 1, \dots, m$ ). For instance, in a cloud macrophysics application, ϕ might represent the liquid water content, whose values might be binned into m categories (0 to 1 g kg⁻¹, 1 to 2 g kg⁻¹, etc.). Let us assume that the total number of data (e.g., measurements, model outputs) is n and, among them, n_i takes a value ϕ_i ( $i = 1, \dots, m$ ). For instance, we might sample a cloud n times, each time drawing a value of liquid water content, and we might denote the number of samples that fall into the ith bin by n_i. Thus, the frequency distribution of the variable, ϕ, is given by

\begin{matrix} (7) & p_{i} = n_{i} / n, \end{matrix}

with $i = 1, \dots, m$ .

The total number of possible combinations for realizing this distribution is

\begin{matrix} (8) & W = \frac{n!}{n_{1}! \dots n_{m}!} . \end{matrix}

By applying a logarithm to the above and also applying Stirling's formula,

\begin{matrix} (9) & \log n! = n \log n + O (n), \end{matrix}

which is valid in the asymptotic limit of n→∞; for every integer involved in the definition of W, we can approximate

\begin{matrix} (10) & \frac{1}{n} \log W ≃ - \sum_{i = 1}^{m} p_{i} \log p_{i} . \end{matrix}

The right-hand side of Eq. (10) is the information entropy (Shannon, 1948), which we shall refer to as “entropy” for short.² Thus, the problem of maximizing the number of possible combinations reduces to that of maximizing the entropy, and it leads to the notion of the maximum-entropy principle.

The most extreme case of this distribution is when ϕ always takes only a particular value, say ϕ_j; thus p_i=δ_ij, using Kronecker's delta. In this case, there is no possibility of reshuffling the data; thus W=1 and the entropy is zero. Qualitatively, as a variable is more widely distributed, the entropy becomes larger.

A continuous version of the entropy is

\begin{matrix} (11) & - \int p \log p d ϕ, \end{matrix}

where p=p(ϕ). However, some subtleties will be remarked upon later in Sect. 3.4.1.

In applying the maximum-entropy principle, here, we suppose that the distribution is constrained by L conditions given by

\begin{matrix} (12) & \int G_{l} (p, ϕ) d ϕ = C_{l} \end{matrix}

for $l = 1, \dots, L$ . Here, $G_{l} (p, ϕ) = p σ_{l} (ϕ)$ , σ_l(ϕ) denotes functions of ϕ (they define the constraints by Eq. 12), and C_l denotes known constants. See Sect. 3.4.2 for specific examples (cf. Eq. 22) and Yano et al. (2016) for physical considerations of choices. Also keep in mind that a distribution is normalized by

\begin{matrix} (13) & \int p d ϕ = 1 . \end{matrix}

The normalization can be considered a special case of the constraints (Eq. 12) with G₀=p and C₀=1 by extending the above series to l=0. Note that, exceptionally, when the PSDs are considered, C₀ must be equal to the total particle number density.

Thus, the most likely distribution is obtained by maximizing Eq. (11) under the constraints (Eq. 12) with $l = 0, \dots, L$ . This goal is accomplished by applying a variational principle, as defined by following a standard notation (cf. Chap. 2, Goldstein et al., 2002):

\begin{matrix} (14) & δ [- \int p \log p d ϕ - \sum_{l = 0}^{m} λ_{l}^{'} \int G_{l} (p, ϕ) d ϕ] = 0, \end{matrix}

with Lagrange multipliers, $λ_{l}^{'}$ . The above variation reduces to

\begin{matrix} (15) & \begin{aligned} \frac{δ}{δ p} & [\int p \log p d ϕ + \sum_{l = 0}^{L} λ_{l}^{'} \int G_{l} (p, ϕ) d ϕ] δ p \\ = \int [\log p + \sum_{l = 0}^{L} λ_{l} \frac{\partial G_{l}}{\partial p}] d ϕ δ p = 0, \end{aligned} \end{matrix}

where the multipliers are re-set to

\begin{matrix} (16) & λ_{l} = \{\begin{cases} 1 + λ_{0}^{'} & l = 0, \\ λ_{l}^{'} & l \neq 0 . \end{cases} \end{matrix}

Noting that $\partial G_{0} / \partial p = 1$ and $\partial G_{l} / \partial p = σ_{l} (ϕ)$ ( $l = 1, \dots, L$ ) and further re-setting p₀ to equal $e^{- λ_{0}}$ , the most likely distribution under these constraints is

\begin{matrix} (17) & p = p_{0} \exp [- \sum_{l = 1}^{L} λ_{l} σ_{l} (ϕ)] . \end{matrix}

Here the constants, p₀ and λ_l are determined from the constraints (Eq. 12 and 13) by directly substituting the distribution form (Eq. 17) into them. This is the basic premise of the maximum-entropy principle: a distribution of a variable, ϕ, is completely determined from only L constraints if they are chosen properly. These L constraints determine L parameters, {λ_j}, that characterize the distribution. Recall that whenever the general assumed-PDF formulation is discussed, we further re-set λ₀ to equal p₀ (cf. Eq. 44a).

3.3.2 Technical remarks

A rather ostensible limitation of the general result (Eq. 17) from the maximum-entropy principle is that it does not include the possibility of a distribution zero at the zero value, as is the case with many semi-positive definite atmospheric variables, in any obvious manner. However, this simply stems from the fact that results from the maximum entropy are not exact: this principle is based on an approximate logarithmic expression of the number, W, of possible combinations under an asymptotic limit of n→∞ (cf. Eq. 9). In this respect, this principle may be considered a special case of the large-deviation principle (e.g., Touchette, 2009): it can elucidate only a predominant exponential dependence as seen in Eq. (17). A possible additional subdominant algebraic dependence is kept implicit because such a weak dependency drops out in the given asymptotic approximation. Thus, if required, an algebraic dependence of, say, ϕ^μ can be multiplied on this distribution without contradicting the given result (Eq. 17). Here, μ is an unspecified free positive parameter. This slight generalization ensures the condition p(0)=0 as required for many atmospheric variables. See further discussions in Sect. 3.4.3 and further mathematical background in Guiasu (1977).

3.4 Maximum-entropy principle: examples

In order to understand the general distribution given by Eq. (17) above better, this sub-section considers some special cases. Implications of the maximum-entropy principle are also remarked upon.

3.4.1 Homogeneous distribution

The simplest case for consideration is one without any constraints (i.e., L=0). Then Eq. (17) simply reduces to a homogeneous distribution:

\begin{matrix} (18) & p = p_{0} . \end{matrix}

This means that a variable, ϕ, has an equal chance of having every possible value when there is nothing to constrain ϕ.

However, there are a few difficulties in applying this conclusion to arbitrary physical variables. First, a distribution of a variable must be bounded from both below and above in order to apply this distribution. Second, the conclusion depends on the choice of a physical variable. This is realized by noting that any physical variable, ϕ, can be transformed into another, φ, by assuming a relation, for example,

\begin{matrix} (19) & ϕ = φ^{α}, \end{matrix}

with a constant α, and then the distribution is transformed by a relation

\begin{matrix} (20) & p (ϕ) d ϕ = α φ^{α - 1} p (φ^{α}) d φ . \end{matrix}

Here, recall the definition of the distribution, p, given by Eq. (4b).

Thus, although the system may represent a homogeneous distribution in terms of a particular variable, ϕ, it is no longer homogeneously distributed in terms of another related variable, φ. This is a contradiction because a constant distribution is obtained for a transformed variable, φ, when the maximum-entropy principle is directly applied to the latter. In this case, the original variable, ϕ, no longer follows a constant probability according to the relation of Eq. (20).

The source of this ambiguity, i.e., the result from the maximum-entropy principle depending on the choice of the distribution variable (ϕ or φ), stems from the fact that in translating a discrete expression for entropy (Eq. 10) into a continuous version (Eq. 11), it is assumed that a variable, ϕ, takes discrete values defined by a constant increment, $Δ ϕ = (ϕ_{m} - ϕ_{1}) / (m - 1)$ , over an interval, [ϕ₁,ϕ_m]:

\begin{matrix} (21) & ϕ_{i} = ϕ_{1} + (i - 1) Δ ϕ, \end{matrix}

with $i = 1, \dots, m$ . Then Eq. (11) is obtained from the right-hand side of Eq. (10) by multiplying Δϕ on the latter and taking a limit of m→∞. Conversely, Eq. (11) can be approximated by the right-hand side of Eq. (10) multiplied by Δϕ with the discretization (Eq. 21). Note that ambiguity with an arbitrary algebraic factor is also consistent with the nature of the maximum-entropy principle that is valid only in an asymptotic sense, as already suggested in Sect. 3.3.2.

3.4.2 Constraints by moments

When a variable is constrained by the first L moments, general constraints (Eq. 12) reduce to

\begin{matrix} (22) & \int ϕ^{l} p d ϕ = C_{l}, \end{matrix}

with C_l being a value of the lth moment with σ_l=ϕ^l. The general distribution (Eq. 17) reduces to

\begin{matrix} (23) & p = p_{0} \exp [- \sum_{l = 1}^{L} λ_{l} ϕ^{l}] . \end{matrix}

In particular, when a system is constrained only by a mean (i.e., L=1), the distribution reduces to an exponential distribution,

\begin{matrix} (24) & p = p_{0} \exp [- λ_{1} ϕ], \end{matrix}

i.e., the probability of the first occurrence of an event under a Poisson process, and when a system is also constrained by a variance (i.e., L=2), it reduces to a Gaussian distribution,

\begin{matrix} (25) & p = p_{0} \exp [- λ_{2} (ϕ - 〈 ϕ 〉)^{2}], \end{matrix}

with a slight reconfiguration of the general form (Eq. 23). Here, the mean is given by 〈ϕ〉 with λ₁=2λ₂〈ϕ〉. These results are consistent with our common usage of these distributions: when only a mean (e.g., waiting time) is of concern, an exponential distribution can be adopted. When a variance is also of interest, a Gaussian distribution is the most convenient.

3.4.3 Gamma distribution

Note that general distribution forms obtained from the maximum-entropy principle, as seen by Eqs. (17) and (23), always take an exponential form without any algebraic factor. However, in many atmospheric applications, a distribution with an algebraic dependence is observed. The best example would be the gamma distribution, which is commonly adopted for representing PSDs in cloud microphysics (e.g., Khain et al., 2015). Furthermore, the gamma distribution is a favorable choice for representing various semi-positive definite variables (e.g., water vapor, mixing ratios of various microphysical water species) as argued by Bishop (2016).

The issue may be commented on from three perspectives. First, it is important to keep in mind the asymptotic nature of the maximum-entropy principle, which is derived under an asymptotic limit of n→∞. As noted in Sect. 3.3.2, for this reason, the maximum-entropy principle is best understood as a special application of the large-deviation principle, which is designed to express only the dominant exponential dependence, and a remaining subdominant algebraic dependence is left implicit. From this perspective, the gamma distribution can be interpreted to be a straight generalization of an exponential distribution, obtained by multiplying an arbitrary, subdominant algebraic factor.

A way of deriving the gamma distribution more explicitly is as a consequence of a transformation of a distribution variable, as discussed in Sect. 3.4.1. By setting a new variable to be φ, a transformed distribution can contain an algebraic factor as shown by Eq. (20). Lastly, it is in fact possible to obtain an algebraic dependence from the maximum-entropy principle simply by setting one of the constraints to be σ_l=log ϕ. The physical meaning of such a constraint is not immediately clear, but it is a question that may be worthy of further investigation. When the constraints are chosen to be σ₁=log ϕ and σ₂=ϕ, then a gamma distribution is obtained.

3.5 Liouville equation

When a system is governed by an equation of the form of Eq. (2), as introduced in Sect. 2, the Liouville equation,

\begin{matrix} (26) & \frac{\partial p (ϕ)}{\partial t} = - \frac{\partial F p (ϕ)}{\partial ϕ}, \end{matrix}

describes the time evolution of a distribution density, p(ϕ), of a given physical variable, ϕ. Note that so long as the original full physics is exactly described by Eq. (2) in a deterministic manner, with F as a continuous function of ϕ, the associated evolution of the probability distribution density is also exactly described by Eq. (26). See Yano and Ouchtar (2017) for a very concise derivation. Generalization for the multiple-variable case is accomplished straightforwardly by replacing ϕ and F by vectors. See, e.g., Risken (1984) for systems with stochasticity. More general formulations for the partial differential equation (PDE) systems are presented, e.g., as Eq. (15) in Larson (2004), with a full derivation given by, e.g., Pope (1985) and Klimenko and Bilger (1999).

In spite of its advantage in directly evaluating the time evolution of a given distribution, the Liouville equation is unfortunately rarely adopted in the studies of atmospheric sciences (e.g., Ehrendorfer, 1994 a, b, 2006; Yano and Ouchtar, 2017; Garret, 2019; Hermoso et al., 2020) due to its prohibitive computational cost. An efficient computation methodology, which may make much wider application possible, will be presented in Sect. 5. The result can easily be generalized to a PDE system, as outlined in Sect. 5.3.

3.6 PSD equation

A prognostic equation for a PSD, n(r), of hydrometeors can be considered in an analogous manner to the Liouville equation, but it differs in the detail (cf. Khain et al., 2015). A PSD, being considered at a single macroscopic point, is advected, and also a source term, S, does not generally take a flux divergence form:

\begin{matrix} (27) & \frac{\partial n (r)}{\partial t} + \nabla_{h} \cdot n (r) u + \frac{\partial (w - w_{t} (r)) n (r)}{\partial z} = S (r) . \end{matrix}

Here, r is the particle size, u the horizontal velocity, w the vertical velocity, and w_t(r) the terminal velocity of the particle with the size r. The source term may furthermore be separated into two distinctive processes, collision processes, S_col, and non-collision processes, S_loc, namely the growth and the reduction (e.g., evaporation) processes of the individual particles:

\begin{matrix} (28a) & S (r) = S_{col} (r) + S_{loc} (r) . \end{matrix}

The collision term may take the form

\begin{matrix} (28b) & \begin{aligned} S_{col} (m) & = \frac{1}{2} \int_{0}^{m} n (m^{'}) n (m - m^{'}) K (m^{'}, m - m^{'}) d m^{'} \\ - \int_{0}^{\infty} n (m) n (m^{'}) K (m, m^{'}) d m^{'}, \end{aligned} \end{matrix}

setting the particle masses to be m=m(r) and $m^{'} = m (r^{'})$ . Here, $K (m, m^{'})$ is the collision kernel between the particles of the masses, m and m^′. Also note S_col(r)dr=S_col(m)dm. The first and the second terms on the right-hand side above represent gain and loss, respectively, for a given particle size. The collision process prevents Eq. (27) from being reduced to the Liouville equation because this process makes the source, F, discontinuous as a function of the particle size. Nevertheless, Eq. (27) can be treated in an analogous manner to the Liouville equation (Eq. 26) by replacing the right-hand side of Eq. (26), $- (\partial p (ϕ) F / \partial ϕ)$ , by the tendency of PSD as given in Eq. (27). Furthermore, the PSD equation (Eq. 27) reduces to the Liouville equation when the advection and the collision effects can be neglected, as seen in Sect. 6.

4 Applications to atmospheric processes

In applying the statistical principles discussed in the last section to atmospheric problems, some additional considerations are required due to differences from typical statistical problems. This section discusses those differences. Our discussions may be rather abstract and philosophical. However, we believe that they provide insights into critical issues of atmospheric modeling that are often overlooked. Our discussions lead to a principle for choosing distribution constraints in atmospheric problems, as required for the maximum-entropy principle, which we call the output-constrained distribution principle.

There are, namely, three important differences in the atmospheric applications from standard statistical applications. Those are discussed in the following three subsections.

4.1 Static or non-static, diagnostic or prognostic

First, statistics or, more precisely, mathematical statistics is fundamentally static and diagnostic: methodologies of statistics and probability (e.g., hypothesis testing, probabilities with a binomial system) as found in standard textbooks (e.g., Feller, 1968; Wonnacott and Wonnacott, 1969; Jaynes, 2003; Gregory, 2005) do not involve any time-dependent problems. Extensive time-dependent statistical models in the literature belong to statistical mechanics and stochastic modeling rather than to mathematical statistics. This fundamentally static nature of the statistics is also reflected upon in more modern statistic theories; for example a standard textbook on deep learning (Goodfellow et al., 2016) does not address any time-dependent problems. Very symbolically, the notion of “updating” a prior distribution in Bayesian probability theories (e.g., Bernardo and Smith, 1997), contrary to its connotation, involves not a concept of time but just an update of our knowledge within a fixed time.

This is a rather stark contrast to the atmospheric system, which continuously evolves with time: we are inherently interested in forecasts. Allegorically speaking, there is no time to update the priors with the atmospheric system because, as soon as information is updated, the original prior is already obsolete because the system itself has changed. Still allegorically speaking, the best we can do is to update (in a meteorological sense but not in a statistical sense) the priors themselves with time. In other words, in describing the atmospheric processes, the key issue is to predict the time evolution of the probability and the statistics: atmospheric problems are fundamentally non-static and prognostic.

The data assimilation problem falls in the middle of the two. As in any other atmospheric problem, the prediction of the evolution of the data uncertainty is a key aspect of data assimilation. At the same time, the statistical update of data by incorporating observational information is another key aspect of the data assimilation. In the present study, the focus is exclusively on the first aspect.

4.2 Output-constrained distribution principle

Second, in many statistical applications as well as in standard equilibrium statistical mechanics, as summarized by Jaynes (1978), a final aim is to know a distribution of a given variable. For this goal, the integrated quantities are inputs to a problem that constrains a distribution. Under these constraints, we define the most likely distribution from the maximum-entropy principle.

However, in atmospheric modeling, knowing a distribution itself, though it may be of theoretical interest, is not an ultimate aim. It is merely a means of obtaining certain integrated quantities (e.g., microphysical tendencies, grid-box-averaged quantities, standard deviation error measures) for a modeling purpose. For this goal, a precise form of a distribution is not of interest, but it must be just accurate enough for providing these required final outputs. This is a very different problem compared to the problems in standard equilibrium statistical mechanics. Here, we must clearly recognize that these are two different problems: although a more accurate distribution may help to evaluate the required statistical quantities more accurately, there should be a way of making the latter more accurate without making the former more accurate than necessary.

This observation leads to an interesting possibility for constructing a prescribed PDF (DDF) form in atmospheric-science applications: take the necessary outputs rather than the available inputs as constraints. Thus, for example, if the purpose is to know a mean value (e.g., a waiting time), take an exponential distribution (Eq. 24). If the purpose is to know a variance (e.g., a standard deviation error in temperature measurements), take a Gaussian distribution (Eq. 25). We propose to call these output-constrained distributions. Note that an assumed PDF obtained under this principle may provide a poor fit to the actual distribution. Our basic argument here is that, nevertheless, an assumed PDF will work reasonably well for the purpose of estimating the required output values (i.e., “constraints”) because the given distribution is obtained from the maximum-entropy principle by taking those required outputs as the constraints.

The proposed re-interpretation is consistent with a basic requirement for an assumed-PDF form: if we need to fit a PDF to L statistical variables, L parameters must be introduced. This is not the case with a popular approach of introducing an assumed double-Gaussian distribution for the sake of representing the skewness of an actual distribution (e.g., Larson et al., 2002; Fitch, 2019; Naumann et al., 2013): the number of parameters of an assumed PDF becomes greater than that of required outputs. For example, with a single variable, a double-Gaussian distribution introduces five parameters when only three outputs are required (mean, variance, skewness). In contrast, the proposed principle suggests how to choose a distribution that contains the minimum number of parameters compatible with the required number of outputs. Here, we invoke the maximum-entropy principle for this purpose.

A current standard approach of updating the PDF parameters is from some moments (e.g., mean, variance) of variables. The output-constrained distribution principle dictates the need to update those parameters using the actual output variables that are required for a host model or quantities that are crucial for predicting the evolution of the system of concern. This rather philosophical statement poses an important practical question of, for example, whether it is optimal to choose the radar reflectivity as a third constraint in bulk microphysics.

Here, the notion of “output variables that are required for a host model” is more specifically relevant to the subgrid-scale distribution for a parameterization. Recall that the goal of a parameterization is to provide not every detail of subgrid-scale processes but only the so-called apparent sources, Q₁ and Q₂, i.e., tendencies of the temperature and moisture due to the subgrid-scale processes (cf. Yanai et al., 1973), and only as grid-scale averages. All the other details are only for the purpose of a consistent calculation of the subgrid-scale processes. In the case of the clouds microphysics with explicit cloud modeling (thus the cloud processes themselves are not “parameterized”), certain variables must be passed over to different components of the model, which plays the role of “host model” in this context. For example, the mixing ratios of clouds and rain, q_c and q_r, must be counted for an accurate definition of the buoyancy in the momentum equation. Some radiation schemes require inputs of the mean radius of the cloud and rain droplets, r_c and r_p, although they are typically not prognostic variables of the cloud microphysics. Those variables are considered to be the “necessary variables (outputs) for the host model”. Thus, especially in the context of the cloud microphysics, the “necessary variables (outputs)” should be clearly distinguished from the prognostic variables in the cloud microphysics. The case of data assimilation is more subtle because there is neither a host model nor other model components to which information must be passed. Yet, for operational purposes, we are not interested in knowing the full shape of the probability distribution of a variable in order to quantify the uncertainty. In traditional assimilation formulations, we merely ask for the standard deviation errors/uncertainties in variables; those are considered the “necessary outputs” for the data assimilation.

4.3 Availability of input data

A third major difference of atmospheric problems compared to standard statistical problems is the availability of input data (e.g., initial constraints). Regarding the latter, we assume a situation whereby available input data (information) are rather limited. For example, the Maxwell–Boltzmann distribution is derived by assuming that only the total energy is known. It is rarely asked how to obtain more information so that, for example, a higher-order correction to the Maxwell–Boltzmann distribution can be obtained. A limited amount of information is the given starting point.

On the other hand, in atmospheric problems, available information is rather unlimited, or at least, we believe that we can obtain more data by either modeling or observation if necessary. In other words, the input data are rather unconstrained. For the subgrid-scale distributions, more explicit models such as cloud-resolving models (CRMs) or large-eddy simulations (LESs) can be used at will for any detailed simulations for the subgrid-scale processes of concern, especially with enhancements of computing power. Such an abundance of information tends to obscure the basic idea of statistical description. The same issue can also be identified from the perspective of the assumed-PDF (DDF) approach: we can take as many moments as required in principle. The only issues are the computational cost and accuracy benefit.

In other words, in principle, the number of available inputs is less limited in atmospheric problems. Thus, if this available information is simply adopted as constraints for defining the most likely distribution under the maximum-entropy principle, the number of distribution parameters can arbitrarily be increased to get as accurate a distribution as desired. This consideration also strengthens the argument of the last subsection. In the context of the assumed-PDF (DDF) approach, a number of “constraints” (e.g., moments) must be decided in such a manner that required outputs can be evaluated in a sufficiently accurate manner. In other words, the problem must be constrained by required outputs rather than by available inputs; that is the essence of the output-constrained distribution principle.

4.4 Validation: diffusion problem

Over the last two subsections (Sect. 4.2 and 4.3), arguments have been developed for re-interpreting the maximum-entropy principle in such a manner that the actual variables required as outputs are to be adopted as “constraints” for determining a distribution. Recall that the output-constrained distribution principle is proposed merely as a guiding principle for choosing an assumed form for distributions; thus we do not expect that those chosen distribution forms can make any perfect predictions. Nevertheless, it would be helpful to quantify the degrees of the accuracy of predictions; that is the purpose of this subsection.

Thus, this subsection tests the proposed output-constrained distribution principle by taking, as an example, a one-dimensional diffusion equation:

\begin{matrix} (29) & (\frac{\partial}{\partial t} - \frac{\partial^{2}}{\partial x^{2}}) p = 0, \end{matrix}

with a diffusion coefficient set to unity for simplicity. When a system evolves purely under white-noise forcing, as in the case of Brownian motion, the prognostic equation for the distribution, p, reduces to this form. Also note that Eq. (29) is a special case of the Fokker–Planck equation.

Let us assume that we are interested in predicting only a mean and a variance for the position, x, of the distribution, p. In other words, the required outputs for our problem are only a mean, 〈x〉, and a variance, 〈(x−〈x〉)²〉. In this case, the output-constrained distribution principle suggests that it suffices to take a Gaussian distribution; say

\begin{matrix} (30) & p (x, t) = p_{0} (t) e^{- λ_{2} (t) (x - 〈 x 〉 (t))^{2}} . \end{matrix}

Note that in this particular case, the adopted distribution form also corresponds to an exact solution of the system (Eq. 29).

The time dependence of the parameters, λ₂(t) and 〈x〉(t), introduced in the above solution (Eq. 30) can be derived by directly substituting Eq. (30) into Eq. (29); cf. Sect. 5.7.When the initial conditions are given by

\begin{array}{l} (31a) & 〈 x 〉 |_{t = 0} & = 〈 x 〉_{0}, \\ (31b) & 〈 (x - 〈 x 〉)^{2} 〉 |_{t = 0} & = \frac{1}{2 λ^{*}}, \end{array}

the time evolution of the distribution (Eq. 30) is given by

\begin{matrix} (32) & \begin{aligned} p (x, t) & = \frac{1}{(1 + 4 λ^{*} t)^{1 / 2}} {(\frac{λ^{*}}{π})}^{1 / 2} \\ \times \exp [- \frac{λ^{*} (x - 〈 x 〉_{0})^{2}}{1 + 4 λ^{*} t}] . \end{aligned} \end{matrix}

Here, the distribution, p, is also normalized so that its integral over the whole domain becomes unity, and the time evolution of the mean and the variance is given by

\begin{array}{l} (33a) & 〈 x 〉 & = 〈 x 〉_{0}, \\ 〈 (x - 〈 x 〉)^{2} 〉 & = \frac{1 + 4 λ^{*} t}{2 λ^{*}} \\ (33b) & = \frac{1}{2 λ^{*}} + 2 t . \end{array}

Note that the mean is a constant with time in the diffusion problem if the initial condition is Gaussian, whereas the variance increases linearly with time.

The basic idea behind the output-constrained distribution principle is to define a PDF in such a manner that the required output variables (mean and variance here) can be evaluated most effectively with the minimum possible parameters. Fitting an actual distribution under an assumed-PDF form is not a goal. To test the workings of this principle, we consider two examples in which we set both the initial mean and the variance of the output-constrained distribution (Eq. 32) to be equal to those of an actual initial distribution and compare the evolutions of means and variances of both distributions.

4.4.1 Example 1: double-Gaussian distribution

As a first example, we take an initial distribution consisting of two Gaussian distributions:

\begin{matrix} (34) & \begin{aligned} p (x, t = 0) = & α_{1} {(\frac{μ_{1}}{π})}^{1 / 2} e^{- μ_{1} (x - x_{1})^{2}} \\ + α_{2} {(\frac{μ_{2}}{π})}^{1 / 2} e^{- μ_{2} (x - x_{2})^{2}} . \end{aligned} \end{matrix}

Here, two Gaussians are centered as x=x₁ and x₂, respectively. By a normalization condition, we may set

\begin{matrix} (35a) & α_{1} + α_{2} = 1 . \end{matrix}

We may further set

\begin{matrix} (35b) & α_{1} x_{1} + α_{2} x_{2} = 0 \end{matrix}

so that the initial mean is 〈x〉=0.

It is immediately seen that the time evolution of this system is given by

\begin{matrix} (36) & \begin{aligned} p (x, t) & = \frac{α_{1}}{(1 + 4 μ_{1} t)^{1 / 2}} {(\frac{μ_{1}}{π})}^{1 / 2} e^{- μ_{1} (x - x_{1})^{2} / (1 + 4 μ_{1} t)} \\ + \frac{α_{2}}{(1 + 4 μ_{2} t)^{1 / 2}} {(\frac{μ_{2}}{π})}^{1 / 2} e^{- μ_{2} (x - x_{2})^{2} / (1 + 4 μ_{2} t)} . \end{aligned} \end{matrix}

Here, the mean remains a constant with time, thus 〈x〉=0, and evolution of the variance is given by

\begin{array}{l} 〈 x^{2} 〉 & = \frac{α_{1}}{2} \frac{1 + 4 μ_{1} t}{μ_{1}} + \frac{α_{2}}{2} \frac{1 + 4 μ_{2} t}{μ_{2}} \\ (37) & = \frac{1}{2} (\frac{α_{1}}{μ_{1}} + \frac{α_{2}}{μ_{2}}) + 2 t . \end{array}

We can evaluate the statistical evolution of this system by an output-constrained distribution (i.e., a single-Gaussian distribution) by setting the initial mean and variance to be identical. Thus, we obtain 〈x〉₀=0 and

\begin{matrix} (38) & λ^{*} = {(\frac{α_{1}}{μ_{1}} + \frac{α_{2}}{μ_{2}})}^{- 1} \end{matrix}

in Eq. (32). Substitution of Eq. (38) into Eq. (33b) shows that this single-Gaussian model can predict the time evolution of both the mean (rather trivially) and the variance of a two-Gaussian system perfectly.

Note that in this case, a single Gaussian hardly fits a double-Gaussian distribution in any good approximation, especially when two Gaussians are well separated from each other. However, if our interest is merely to predict a variance, then in this example a single-Gaussian approximation perfectly serves the purpose, being consistent with a proposed re-interpretation of the maximum-entropy principle. This example may appear to be rather too special and artificial. Nevertheless, it makes the case well that for predicting a limited number of statistical quantities satisfactorily, accurately predicting the evolution of the whole distribution is not necessarily a requirement.

4.4.2 Example 2: a skewed Gaussian distribution

The second example is a skewed initial distribution given by

\begin{matrix} (39) & p (x, t = 0) = {(\frac{λ}{π})}^{1 / 2} (1 + α x) e^{- λ x^{2}} . \end{matrix}

Here, a constant parameter, α, controls the skewness of this distribution. This example examines how well an assumed Gaussian distribution predicts the statistics when an actual distribution is not Gaussian.

The time evolution of this system is solved by, for example, a Fourier transform method, as summarized in Appendix A. The final answer is

\begin{matrix} (40) & \begin{aligned} p (x, t) = & \frac{1}{(1 + 4 λ t)^{1 / 2}} [1 + \frac{α x}{(1 + 4 λ t]^{2}}] \\ \times {(\frac{λ}{π})}^{1 / 2} \exp [- \frac{λ x^{2}}{1 + 4 λ t}] . \end{aligned} \end{matrix}

From this solution, the time evolution of mean and variance is readily evaluated as

\begin{array}{l} (41a) & 〈 x 〉 & = \frac{α}{2 λ (1 + 4 λ t)}, \\ (41b) & 〈 (x - 〈 x 〉)^{2} 〉 & = \frac{1 + 4 λ t}{2 λ} [1 - \frac{α^{2}}{2 λ (1 + 4 λ t)^{3}}] . \end{array}

“Fit” to this problem under the assumed single-Gaussian distribution is given by

\begin{array}{l} (42a) & 〈 x 〉_{0} & = \frac{α}{2 λ}, \\ (42b) & λ^{*} & = λ {(1 - \frac{α^{2}}{2 λ})}^{- 1} . \end{array}

In this case, the assumed PDF fails to predict a gradual shift of the mean to the origin from an initial position (cf. Eqs. 41a and 42a; Fig. 1a). This discrepancy is hardly a surprise because an assumed-PDF evolution is not expected to predict an evolution of an actual distribution in any perfect manner. However, the growth of the deviation of the variance with the assumed single-Gaussian distribution (Eq. 42b; Fig. 1b, long dash) from the real value (Eq. 41b; Fig. 1b, solid) is relatively slow, and an underestimate is only 25 % even at λt=10 with a relatively large nondimensional skewness parameter (i.e., $α / λ^{1 / 2} = 0.5$ ) assumed.

https://acp.copernicus.org/articles/25/9357/2025/acp-25-9357-2025-f01

Figure 1Plots of (a) $2 λ 〈 x 〉 / α$ and (b) 2λ〈(x−〈x〉)²〉 with an initial skewed Gaussian distribution (Eq. 39) with $α / λ^{1 / 2} = 0.5$ : exact (solid) and with a single-Gaussian approximation (long dash). Note that in panel (a), although the exact mean (solid) decreases to zero with time, the assumed Gaussian solution (long dash) totally fails to predict this tendency. Nevertheless, the prediction of the variance (b) with the assumed Gaussian (long dash) is still qualitatively correct, even though a tendency for underestimation compared to the exact value (solid) may be noted.

Download

5 Liouville equation constrained by an assumed PDF

As emphasized in the last section, in principle, a full physical system, as given by Eq. (2) in the present study, is known in atmospheric problems; thus it also prompts us to exploit the Liouville equation (Eq. 26) in predicting the time evolution of a distribution. The next question is how it can be exploited efficiently. The output-constrained distribution principle introduced in the last section provides half of the answer to this question: adopt a distribution form that is defined from the maximum-entropy principle but taking the required outputs for the host model as the “constraints”, 〈σ_l〉 ( $l = 1, \dots, n$ ). As a result, a distribution with a finite number of parameters, {λ_i}, is obtained. Thus, the problem of defining the time evolution of a continuous distribution function reduces to that of describing the time evolution of a finite number of parameters {λ_i(t)}. The remaining half of the question is how to actually calculate the time evolution of {λ_i} directly by a set of ordinary differential equations. The present section addresses this remaining half. Importantly, the formulation presented in this section does not rely on the output-constrained distribution (maximum-entropy) principle. Rather, it can be applied to any form of assumed PDFs.

The first key step required for this procedure is, as just suggested, to replace the time derivative, $\partial p / \partial t$ , of the distribution by time derivatives, ${\dot{λ}}_{l}$ , of the PDF parameters. This is simply accomplished by taking a chain rule to the time derivative, noting that the time dependence in the distribution (Eq. 4a) arises solely from the parameters, {λ_i}:

\begin{matrix} (43) & \frac{\partial p}{\partial t} = \sum_{i = 0}^{N} \frac{\partial p}{\partial λ_{i}} {\dot{λ}}_{i} . \end{matrix}

A specific example of this procedure is given by Eq. (55) below, which can also be performed either more directly or using Eq. (43) explicitly.

5.1 General formulation

This subsection derives a prognostic set of equations for the PDF parameters {λ_l} in a general manner. The derivation is repeated in Sect. 5.5 by taking the exponential distribution as an example so that the basic idea can be seen in a more concrete manner.

We assume a distribution in a general form (Eq. 4a) assuming that a distribution is defined by N+1 free parameters λ_i ( $i = 0, \dots, N$ ). Here, we assume that λ₀ is a constant factor that is required for normalizing the distribution, as already noted when introducing the assumed distribution in a general form by Eq. (4a); thus

\begin{matrix} (44a) & p \propto λ_{0} \end{matrix}

\begin{matrix} (44b) & \frac{\partial p}{\partial λ_{0}} = \frac{p}{λ_{0}} . \end{matrix}

Substitution of Eq. (44b) into Eq. (43) leads to

\begin{matrix} (45) & \frac{\partial p}{\partial t} = p \frac{{\dot{λ}}_{0}}{λ_{0}} + \sum_{i = 1}^{N} \frac{\partial p}{\partial λ_{i}} {\dot{λ}}_{i} . \end{matrix}

The replacement of the left-hand side of Eq. (45) by that of the right-hand side is the key step of reducing the problem of the evolution of a distribution, p, as a whole to that of the fixed number of parameters, λ_i ( $i = 1, \dots, N$ ).

The time evolution of this distribution is constrained in two ways: first, by a normalization condition (Eq. 13) and, second, by the Liouville equation (Eq. 26). The normalization condition (Eq. 13) can also be cast into a prognostic form by taking the time derivative:

\begin{matrix} (46) & \frac{\partial}{\partial t} \int p d ϕ = 0, \end{matrix}

with the normalization (Eq. 13) introduced as an initial condition. Note that in cases with PSDs, the right-hand side of Eq. (13) must be replaced by ${\dot{C}}_{0}$ , with C₀ standing for the total particle number density. Thus, the following reduction must also be modified accordingly.

When the integral range is fixed with time, the time derivative can be moved inside the integral, operating only on the distribution, p, in Eq. (46). Further substituting the formula for the time derivative (45), we obtain

\begin{matrix} (47) & \frac{{\dot{λ}}_{0}}{λ_{0}} = - \sum_{i = 1}^{N} [\int \frac{\partial p}{\partial λ_{i}} d ϕ] {\dot{λ}}_{i} . \end{matrix}

The Liouville equation (Eq. 26) also reduces from Eq. (45) to

\begin{matrix} (48) & p \frac{{\dot{λ}}_{0}}{λ_{0}} + \sum_{i = 1}^{N} \frac{\partial p}{\partial λ_{i}} {\dot{λ}}_{i} + \frac{\partial}{\partial ϕ} (p F) = 0, \end{matrix}

and furthermore, by substituting Eq. (47) into the above, to

\begin{matrix} (49a) & \sum_{i = 1}^{N} [\frac{\partial p}{\partial λ_{i}} - p \int \frac{\partial p}{\partial λ_{i}} d ϕ] {\dot{λ}}_{i} + \frac{\partial}{\partial ϕ} (p F) = 0 . \end{matrix}

Equation (49a) is the key result of the present study because it constitutes a prognostic equation for evaluating the time evolution of {λ_i}. Note that when the same procedure is applied to the diffusion equation (Eq. 29), by replacing $\partial p F / \partial ϕ$ by $- \partial^{2} p / \partial ϕ^{2}$ , it reduces to a set of ordinary differential equations for λ₂ and 〈x〉, as shown in Sect. 5.4.1. As in the case with the diffusion equation, more generally, when the assumed-PDF form constitutes an exact solution of a given system, Eq. (49a) is separated out into the ϕ and λ_i dependencies, and the latter dependencies can be solved separately, independent of the distribution variable, ϕ. This point can be understood directly from the fact that Eq. (49a) is equivalent to the original Liouville equation (Eq. 26) under the given assumed-PDF form.

However, because the assumed PDF generally constitutes merely an approximation of a true distribution, ϕ dependence in Eq. (49a) cannot be separated out in a general case; thus this equation cannot be solved in any consistent manner merely in terms of the assumed-PDF parameters, {λ_i}. The consequence of the approximate nature of the assumed PDF in Eq. (49a) is more explicitly seen by substituting Eq. (4c) into Eq. (26):

\begin{matrix} (49b) & \sum_{i = 1}^{N} [\frac{\partial p}{\partial λ_{i}} - p \int \frac{\partial p}{\partial λ_{i}} d ϕ] {\dot{λ}}_{i} + \frac{\partial}{\partial ϕ} (p F) = E, \end{matrix}

with ℰ suggesting a possible error. We should keep well in mind that this very last fact does not change regardless of the manner in which we attempt to predict the evolution of a distribution by an assumed PDF. In other words, Eq. (49a) itself is not defective, but the difficulty here is a simple consequence of the assumed-PDF approach, which attempts to solve the evolutions of distributions by assuming the forms that are not actual solutions.

Thus, the next goal is to derive a closed set of equations, not depending on ϕ, from Eq. (49a) in order to solve a set of distribution parameters, {λ_i}, in a consistent manner. For this purpose, we need to remove the ϕ dependence from Eq. (49a). An only option that we can see is to simply integrate it over ϕ. Here, a goal is to obtain N differential equations for λ_i ( $i = 1, \dots, N$ ) by removing ϕ dependence. For this purpose, we apply a set of weights, σ_l ( $l = 1, \dots, N$ ), to Eq. (49a) and integrate them over ϕ. Here, the weight, σ_l, is an unspecified function of ϕ but independent of {λ_i}. After integration in ϕ, we obtain

\begin{matrix} (50a) \end{matrix}

for $l = 1, \dots, N$ . As a result, we obtain N ordinary differential equations for N unknowns. The set of Eq. (50a) is linear in terms of ${\dot{λ}}_{i}$ ; thus it can be inverted in principle, and the tendencies, ${\dot{λ}}_{i}$ , can explicitly be evaluated. Here, keep in mind that Eq. (50a) is valid only approximately; thus it may be more emphatically stated as

\begin{matrix} (50b) & \begin{aligned} \sum_{i = 1}^{N} & [\int σ_{l} \frac{\partial p}{\partial λ_{i}} d ϕ - \int σ_{l} p d ϕ \int \frac{\partial p}{\partial λ_{i}} d ϕ] {\dot{λ}}_{i} \\ = - \int σ_{l} \frac{\partial}{\partial ϕ} (p F) d ϕ + E_{l}, \end{aligned} \end{matrix}

with ℰ_l suggesting an associated error. Note further that the right-hand side of Eq. (50a) can be re-written as

\begin{matrix} (51) & \int σ_{l} \frac{\partial}{\partial ϕ} (p F) d ϕ = - \int p F \frac{\partial σ_{l}}{\partial ϕ} d ϕ \end{matrix}

by an integration by parts, assuming that pF vanishes at the edges of the integral range. Realize that the key step introduced in the formulation here is to predict the PDF parameters, {〈λ_l〉(t)}, by Eq. (50a). In this manner, we circumvent the principal difficulty of the current assumed-PDF approaches of inverting the relations of Eq. (6) into the form of Eq. (5). Now, the major remaining open question with this procedure is the choice of the weights, {σ_l}, which is the issue to be addressed next.

5.2 Choice of the weights, {σ_l}

Here, the most appropriate choice of the weights, {σ_l}, becomes immediately clear by noting that the left-hand side of Eq. (50a) corresponds to a temporal tendency, $d 〈 σ_{l} 〉 / d t$ , of the “averaged” weight:

\begin{matrix} (52a) & \frac{d}{d t} 〈 σ_{l} 〉 = \int σ_{l} \frac{\partial p}{\partial t} d ϕ = \sum_{i = 0}^{N} [\int σ_{l} \frac{\partial p}{\partial λ_{i}} d ϕ] {\dot{λ}}_{i}, \end{matrix}

where the last expression reduces to the left-hand side of Eq. (50a) with the help of Eq. (47). Thus, symbolically, Eq. (50a) is equivalent to

\begin{matrix} (52b) & \frac{d}{d t} 〈 σ_{l} 〉 = 〈 F_{σ_{l}} 〉, \end{matrix}

with $F_{σ_{l}}$ the source term that defines the tendency of σ_l. By the deduction from Eq. (52a), we can conclude that Eq. (50a) predicts the time evolution of 〈σ_l〉, as given by Eq. (52b), where

\begin{matrix} (52c) & 〈 σ_{l} 〉 = \int p σ_{l} d ϕ . \end{matrix}

It also follows that if {〈σ_l〉} is chosen as the outputs to be used in the host model, by following the output-constrained distribution principle (cf. Sect. 4.2), Eq. (50a) predicts those required outputs consistently under a given assumed PDF (DDF), being equivalent for solving Eq. (52b). Thus, we choose {σ_l} to be the same as in the constraints of Eq. (12) with G_l=pσ_l and L=N.

A standard choice following the assumed-PDF (DDF) approach is to set σ_l=ϕ^l. This procedure is equivalent to time-integrating the moments for predicting {λ_i}. Equation (50a) or (52b) further reduces to a diagnostic method based on moments typically adopted in the subgrid-scale assumed-PDF formulations, when {〈σ_l〉} is taken as moments, and also a diagnostic limit is taken. As already emphasized in introducing the governing equation (Eq. 2) of the system in Sect. 2, the source term, F, includes all the physics associated with a variable, ϕ. A multi-variable extension is also straightforward. Thus, in principle, this formulation can be applied to any assumed-PDF approaches, including those in cloud microphysics and data assimilation.

In subsequent subsections, more specific versions of Eq. (50a) for various assumed-PDF forms are presented, as a demonstration that this general formulation can actually be used. These results can readily be used as receipts for applying the formulation to any physical problems under given assumed-PDF forms, once the source term, F, is specified. However, as a detour, in the next subsection, we first discuss the generalization of the formulation introduced into the PDE system, and in Sect. 5.4, we discuss its link to the existing assumed-PDF approaches and the bulk microphysics.

5.3 Generalization to the PDE system (Eq. 1)

The discussion of the last subsection suggests that the derivation of the prognostic equations for the assumed-PDF parameters in the one-dimensional dynamical system (Eq. 2) can be generalized into partial differential equation (PDE) systems, described by Eq. (1), in a relatively straightforward manner.

First note that in a PDE system, the time derivative on the right-hand side of Eq. (52a) is replaced by a partial time derivative:

\begin{matrix} (53a) & \frac{\partial}{\partial t} 〈 σ_{l} 〉 = \int σ_{l} \frac{\partial p}{\partial t} d ϕ = \sum_{i = 0}^{N} [\int σ_{l} \frac{\partial p}{\partial λ_{i}} d ϕ] {\dot{λ}}_{i} . \end{matrix}

Note next that a time-evolution equation for σ_l can be derived from the basic governing equation (Eq. 1) by taking a chain rule:

\begin{matrix} (53b) & \frac{\partial σ_{l}}{\partial t} = \frac{\partial σ_{l}}{\partial φ} \frac{\partial φ}{\partial t} = \frac{\partial σ_{l}}{\partial φ} (- v \cdot \nabla ϕ + F) . \end{matrix}

Thus,

\begin{matrix} (53c) & F_{σ_{l}} = \frac{\partial σ_{l}}{\partial φ} (- v \cdot \nabla ϕ + F), \end{matrix}

and with the help of Eq. (53c), the PDE version of Eq. (52b) becomes

\begin{matrix} (53d) & \frac{\partial}{\partial t} 〈 σ_{l} 〉 = 〈 F_{σ_{l}} 〉 . \end{matrix}

By combining Eqs. (53a) and (53d), the prognostic set of equations for ${\dot{λ}}_{i}$ is given by

\begin{matrix} (53e) & \frac{\partial}{\partial t} 〈 σ_{l} 〉 = \sum_{i = 0}^{N} [\int σ_{l} \frac{\partial p}{\partial λ_{i}} d ϕ] {\dot{λ}}_{i} = 〈 F_{σ_{l}} 〉 . \end{matrix}

Keep in mind that ${\dot{λ}}_{i}$ in the above designates the partial time derivative. The generality of the final result (Eq. 53e) would not be necessary to emphasize.

5.4 Link to the existing assumed-PDF approaches and the bulk microphysics

The formulation presented in the last three subsections constitutes a generalization of the existing assumed-PDF approaches in the following manner. Recall that with the help of Eq. (52a), the general equation (Eq. 50a) can be more symbolically be written as Eq. (52b). Here, keep in mind that a spatial dependence of variables with Eq. (1) can also be taken into account by simply replacing the time derivative on the left-hand side by a partial derivative (cf. Eq. 53d). Also keep in mind the possibilities of generalizations in the systems with multiple variables (cf. Yano, 2024), though they remain implicit here. In this manner, Eq. (52b) constitutes a general form of governing equations considered in the existing assumed-PDF approaches, as more specifically presented by, e.g., Eqs. (3.1)–(3.10) in Larson (2022).

The first generalization to be noted is the fact that 〈σ_l〉 can be of any output variables as required by a host model so long as σ_l is properly defined as functions of dependent variables. Also note that the integral range in definition (Eq. 12) can be taken in any manner, although such a full generalization itself is left for future studies (cf. Yano, 2024). Thus, for example, a cloud fraction can also be introduced as one of the output variables of the form, 〈σ_l〉, under this definition. On the other hand, the existing assumed-PDF approaches, rather arbitrarily, restrict these statistical variables, 〈σ_l〉, to being moments (e.g., Golaz et al., 2002). The formulation introduced here demonstrates that this restriction is not necessary, but a very wide range of choices can be made for 〈σ_l〉.

Furthermore, the existing assumed-PDF approaches perform time integrals of the statistical variables, 〈σ_l〉, by Eq. (52b). Note that with some variables, Eq. (52b) is solved diagnostically by setting the left-hand side to be zero (cf. Larson, 2022). After updating 〈σ_l〉, the PDF parameters, {λ_l}, are diagnosed from the given set of {〈σ_l〉} from the following relations:

\begin{matrix} (54) & 〈 σ_{j} 〉 = 〈 σ_{j} 〉 ({λ_{l}}) \end{matrix}

for $j = 1, \dots N$ . As already discussed in Sect. 3.2.1, the inversion of {〈σ_l〉} to {λ_l} is not always easy.

The introduced general formulation, in turn, shows that the left-hand side of Eq. (52b) or Eq. (53d) can be replaced by the left-hand side of Eq. (50a) or Eq. (53a); thus the problem directly reduces to the prognoses of the PDF parameters of a given distribution. As a result, there is no longer a need to perform a cumbersome inversion. This modification greatly facilitates the computational procedure. Note especially that when the same set of moments is taken for {〈σ_l〉}, with identical assumed-PDF forms, the present formulation is perfectly equivalent to an existing assumed-PDF model apart from the fact that the PDF parameters, {λ_l}, instead of the moment set, {〈σ_l〉}, are directly predicted.

Another strength of the present formulation is in more explicitly showing that the right-hand side of Eq. (52b) or Eq. (53d) can also be totally expressed in terms of the PDF parameters. As a result, there is no longer a need to introduce further closures to the assumed-PDF formulation, as also noted by Golaz et al. (2002). However, this last rather obvious point is not always recognized, and some assumed-PDF approaches often introduce additional closures to close their formulations (e.g., Fitch, 2019; Naumann et al., 2013).

The standard formulations in the bulk microphysics (cf. Milbrandt and Yau, 2005, and the references therein) are to adopt the mixing ratio, q; the total number density, N_T; and the radar reflectivity, Z, as the prognostic variables, with the order to be adopted with decreasing truncations. Under the present formulation, neglecting multiplication factors, q and Z correspond to setting σ₁=r³ and σ₂=r⁶, respectively, whereas N_T is predicted in a standalone manner by separating out the number density, n, into the two components by setting n=N_Tp. More or less the same remarks follow for them with further flexibilities in the formulation by re-writing it in terms of a general form of Eq. (52b) or Eq. (53d). Probably, most importantly, the choice of the radar reflectivity, Z, as a “constraint” can be questioned from the point of view of the output-constrained distribution principle (cf. Sect. 4.2): though the reflectivity, Z, may be a useful variable to compare with the observation, it is not directly required in any microphysical tendencies within a model.³

5.5 Application 1: exponential distribution

In this subsection, we repeat the general derivation presented in Sect. 5.1 by taking the exponential distribution (Eq. 24 with λ₀=p₀) as an example. Here, we immediately obtain $\partial p / \partial λ_{0} = p / λ_{0}$ , $\partial p / \partial λ_{1} = - p ϕ$ , and

\begin{matrix} (55) & \frac{\partial p}{\partial t} = p \frac{{\dot{λ}}_{0}}{λ_{0}} - p ϕ {\dot{λ}}_{1} . \end{matrix}

The normalization condition (Eq. 46) is obtained by integrating the above equation with respect to ϕ:

\begin{matrix} (56a) & \frac{{\dot{λ}}_{0}}{λ_{0}} \int p d ϕ - {\dot{λ}}_{1} \int p ϕ d ϕ = 0 . \end{matrix}

Noting that $\int p d ϕ = 1$ and $\int p ϕ d ϕ \equiv 〈 ϕ 〉 = 1 / λ_{1}$ , it reduces to

\begin{matrix} (56b) & \frac{{\dot{λ}}_{0}}{λ_{0}} - \frac{{\dot{λ}}_{1}}{λ_{1}} = 0, \end{matrix}

which can be immediately integrated into

\begin{matrix} (56c) & \frac{λ_{0}}{λ_{1}} = const . \end{matrix}

That is the constraint under the normalization condition. Alternatively, the normalization condition can be obtained directly by performing an integral of the distribution analytically:

\begin{matrix} (56d) & λ_{0} = λ_{1} . \end{matrix}

Substitution of Eq. (56b) into Eq. (55), in the same manner as that of Eq. (47) into Eq. (48), makes the right-hand side dependent only on λ₁. Substituting this final expression into the Liouville equation (Eq. 26), we obtain

\begin{matrix} (57a) & p (\frac{1}{λ_{1}} - ϕ) {\dot{λ}}_{1} + \frac{\partial}{\partial ϕ} (p F) = 0 . \end{matrix}

Here, this equation contains ϕ dependence; thus it cannot be directly used to predict λ₁.

We remove the ϕ dependence from Eq. (57a) by multiplying a weight σ₁ that depends only on ϕ and integrating it by ϕ over $[0, + \infty]$ . We choose the weight σ₁=ϕ because the exponential distribution is to be used for predicting the mean value, based on an argument in Sect. 4.2.

Thus,

\begin{matrix} (57b) & [\frac{1}{λ_{1}} 〈 ϕ 〉 - 〈 ϕ^{2} 〉] {\dot{λ}}_{1} - 〈 F 〉 = 0, \end{matrix}

or by further noting $〈 ϕ 〉 = 1 / λ_{1}$ and $〈 ϕ^{2} 〉 = 2 / λ_{1}^{2}$ ,

\begin{matrix} (58a) & {\dot{λ}}_{1} = - λ_{1}^{2} 〈 F 〉 . \end{matrix}

This equation states that when there is a positive mean source, 〈F〉>0, the slope of the distribution becomes gentler by transporting it to larger values, whereas a mean sink (negative source) steepens the distribution. The above equation can readily be solved analytically, and we obtain

\begin{matrix} (58b) & λ_{1} (t) = {[\frac{1}{λ_{1} (0)} + \int_{0}^{t} 〈 F 〉 d t]}^{- 1} . \end{matrix}

The significance of the above result may be best interpreted by re-writing it for the mean value:

\begin{matrix} (58c) & 〈 ϕ 〉 = λ_{1} (t)^{- 1} = 〈 ϕ 〉 |_{t = 0} + \int_{0}^{t} 〈 F 〉 d t . \end{matrix}

This is the consistent evolution of the mean state under the assumed exponential distribution.

Here, the weight, σ₁=ϕ, has been chosen above in a manner consistent with the fact that the exponential distribution has been derived from the maximum-entropy principle, taking the mean as the constraint. Yet, the general formulation presented in Sect. 5.1 can be used to predict any constraint defined by the weight, σ₁, consistently with time under a given assumed distribution. Thus, a natural question to ask is, how is the evolution of the PDF parameter, λ₁, sensitive to the choice of the weight, σ₁, for the constraint? To address this question, we now set the weight to be σ₁=ϕⁿ more generally with an unspecified integer, n. In this case, we evaluate the evolution of the assumed distribution (Eq. 24) in such a manner that 〈ϕⁿ〉 evolves consistently. As a result, the prediction of the evolution of the parameter, λ₁, is modified to achieve the best prediction of 〈ϕⁿ〉 for a specified particular n with the consequence of causing the prediction of the other moments to deteriorate. Especially, when we set n≠1, the prediction of the mean value is no longer optimized by the constraint with σ₁=ϕⁿ.

Consequently, instead of Eqs. (58b) and (58c), we obtain

\begin{array}{l} (59a) & λ_{1} (t) & = {[\frac{1}{λ_{1}^{n} (0)} + \frac{n}{n!} \int_{0}^{t} 〈 F ϕ^{n - 1} 〉 d t]}^{- 1 / n}, \\ (59b) & 〈 ϕ^{n} 〉 & = 〈 ϕ^{n} 〉 |_{t = 0} + n \int_{0}^{t} 〈 F ϕ^{n - 1} 〉 d t, \end{array}

noting that $〈 ϕ^{n} 〉 = n! / λ_{1}^{n}$ . In this case, Eq. (59b) presents a consistent evolution of 〈ϕⁿ〉.

Keep in mind that the solution of Eq. (59a) is implicit when F itself also depends on ϕ. A more explicit solution can be derived by setting it more specifically as, say, F=ϕ^m. Solving an equivalent equation to Eq. (57b), we find

\begin{matrix} (59c) & λ_{1} (t) = (λ_{1}^{m - 1} (0) - γ_{m} t)^{1 / (m - 1)}, \end{matrix}

where

\begin{matrix} (59d) & γ_{m} = (m - 1) (n + m - 1)! / n! \end{matrix}

when m≠1.

It is clear that the parameter, λ₁, qualitatively changes with a different “rate”, γ_m, of evolution with the varying n; thus the evolution of the assumed PDF is sensitive to the choice of the weight, σ₁. It also follows that a proper choice of σ₁ is crucial to ensure that an output, 〈σ₁〉, of a particular interest is consistently predicted. A simple way to achieve this consistency is to solve a prognostic equation for 〈σ₁〉 in terms of λ₁. The general formulation presented in Sect. 5.1 is constructed in this manner.

Here, however, there is a serious problem with the above solution (Eq. 59c): the exponential distribution continuously flattens with time, and the distribution becomes totally homogeneous at $t = λ_{1}^{m - 1} (0) / γ_{m}$ ; then λ₁=0, and the solution breaks down beyond this point with λ₁ becoming a complex number. Such a collapse of the distribution is a dramatic example of suggesting an inherent limitation of the assumed-PDF approach. Figure 2 plots the obtained time series of λ₁ with m=0, 2, 3, 4 in panels (a)–(d) with the weight exponents n=1 (solid), n=2 (long dash), and n=3 (short dash). It is seen that the discrepancy of the solution with different weights, σ₁, is exacerbated rapidly as a higher-order dependence of F on ϕ (i.e., with the increasing m: Fig. 2b–d). The exception is the case with m=0, where γ₀=0; thus this singularity is avoided (Fig. 2a). Here, we set the initial condition as λ₁(0)=1. This is equivalent to normalizing the PDF parameter and the time into $λ_{1} / λ_{1} (0)$ and $λ_{1}^{1 - m} (0) t$ , respectively.

https://acp.copernicus.org/articles/25/9357/2025/acp-25-9357-2025-f02

Figure 2Plot of the obtained time series of λ₁ with m=0 (a), 2 (b), 3 (c), and 4 (d) with the weight exponents n=1 (solid), n=2 (long dash), and n=3 (short dash). The values of λ₁ evaluated from 〈ϕⁿ〉 obtained using the exact solutions are further overlaid by green curves.

Download

The values of λ₁ evaluated from 〈ϕⁿ〉 with the exact solutions in Appendix B are further overlaid by green curves with further discussions concerning the exact solutions referred to therein. In the case with m=0, we find satisfactory agreement of the PDF parameter, λ₁, between the solution (Eq. 59c) and the exact results (Fig. 2a). However, due to the singularities discussed above with m≥2, the diagnosed solution (Eq. 59c) rapidly deviates from the exact results (Fig. 2b–d).

The case with m=1 must be considered separately, and in this case we find, for all n values,

\begin{matrix} (59e) & λ_{1} (t) = λ_{1} (0) e^{- t} . \end{matrix}

As it turns out, this is an exact solution of the evolution of the system, as shown in Appendix B.

5.6 Application 2: Gaussian distribution

The second application is the Gaussian distribution, especially because this is a standard distribution assumed in data assimilation. The Gaussian distribution is also often assumed in subgrid-scale distribution problems. Here, the simplest case with a single variable is considered, as given by Eq. (25). Here, p₀=λ₀, and we take the mean value, $〈 ϕ 〉 = - 2 λ_{2} / λ_{1}$ , to be a PDF parameter in place of λ₁. From a normalization condition,

\begin{matrix} (60) & p_{0} = {(\frac{λ_{2}}{π})}^{1 / 2} . \end{matrix}

This diagnostic relation can be used to update the distribution constant p₀ in place of updating it by integrating Eq. (47).

We further note that

\begin{array}{l} (61a) & \frac{\partial p}{\partial 〈 ϕ 〉} & = 2 λ_{2} (ϕ - 〈 ϕ 〉) p, \\ (61b) & \frac{\partial p}{\partial λ_{2}} & = - (ϕ - 〈 ϕ 〉)^{2} p . \end{array}

By substituting these expressions into Eq. (50a), we obtain

\begin{matrix} (62) & \begin{aligned} 2 λ_{2} \dot{〈 ϕ 〉} [〈 σ_{l} (ϕ - 〈 ϕ 〉) 〉 - 〈 σ_{l} 〉 〈 ϕ - 〈 ϕ 〉 〉] \\ + {\dot{λ}}_{2} [- 〈 σ_{l} (ϕ - 〈 ϕ 〉)^{2} 〉 + 〈 σ_{l} 〉 〈 (ϕ - 〈 ϕ 〉)^{2} 〉] = 〈F \frac{\partial σ_{l}}{\partial ϕ}〉 . \end{aligned} \end{matrix}

Note that an integration by parts is applied to the last term in Eq. (50a) to obtain the right-hand side above. Note also that $〈 ϕ - 〈 ϕ 〉 〉 = 0$ , and thus

\begin{matrix} (63) & \begin{aligned} 2 λ_{2} \dot{〈 ϕ 〉} 〈 σ_{l} (ϕ - 〈 ϕ 〉) 〉 \\ + {\dot{λ}}_{2} [- 〈 σ_{l} (ϕ - 〈 ϕ 〉)^{2} 〉 + 〈 σ_{l} 〉 〈 (ϕ - 〈 ϕ 〉)^{2} 〉] = 〈F \frac{\partial σ_{l}}{\partial ϕ}〉 . \end{aligned} \end{matrix}

Recall that the Gaussian distribution is obtained from the maximum-entropy principle when a system is constrained by the mean and the variance; thus we set $σ_{1} = ϕ - 〈 ϕ 〉$ and $σ_{2} = (ϕ - 〈 ϕ 〉)^{2}$ in the above. With l=1, we obtain

\begin{matrix} (64a) & 2 λ_{2} 〈 (ϕ - 〈 ϕ 〉)^{2} 〉 \dot{〈 ϕ 〉} - 〈 (ϕ - 〈 ϕ 〉)^{3} 〉 {\dot{λ}}_{2} = 〈 F 〉 \end{matrix}

and, with l=2,

\begin{matrix} (64b) & \begin{aligned} 2 λ_{2} 〈 (ϕ - 〈 ϕ 〉)^{3} 〉 \dot{〈 ϕ 〉} \\ + [- 〈 (ϕ - 〈 ϕ 〉)^{4} 〉 + 〈 (ϕ - 〈 ϕ 〉)^{2} 〉^{2}] {\dot{λ}}_{2} = \\ 2 〈 (ϕ - 〈 ϕ 〉) F 〉 . \end{aligned} \end{matrix}

Note that the Gaussian distribution is not skewed; thus $〈 (ϕ - 〈 ϕ 〉)^{3} 〉 = 0$ . Also note the relations

\begin{array}{l} (65a) & 〈 (ϕ - 〈 ϕ 〉)^{2} 〉 & = \frac{1}{2 λ_{2}}, \\ (65b) & 〈 (ϕ - 〈 ϕ 〉)^{4} 〉 & = 3 〈 (ϕ - 〈 ϕ 〉)^{2} 〉^{2} . \end{array}

By substituting these relations into Eqs. (64a) and (64b), we obtain the final results:

\begin{array}{l} (66a) & \dot{〈 ϕ 〉} & = 〈 F 〉, \\ (66b) & \frac{d}{d t} (\frac{1}{λ_{2}}) & = 4 〈 (ϕ - 〈 ϕ 〉) F 〉 . \end{array}

Equation (66a) simply means that the mean value evolves by following a tendency defined by the mean source, whereas Eq. (66b) suggests that the distribution is more dispersed when a more positive source is found away from the mean value.

5.7 Diffusion problem

The diffusion equation (Eq. 29) considered in Sect. 4.4 is a particular problem that can be solved exactly by the Gaussian distribution (Eq. 25). We note that the diffusion equation (Eq. 29), re-setting x=ϕ here, is obtained from the Liouville equation (Eq. 26) by setting the forcing to be an operator, $F = - \partial / \partial ϕ$ . As a result, the general formulation presented so far can directly be applied to the diffusion equation: by substituting the derivative relations (Eqs. 61a and b) into Eq. (49a) and also noting that

\begin{matrix} (67) & \frac{\partial^{2} p}{\partial ϕ^{2}} = [- 2 λ_{2} + 4 λ_{2}^{2} (ϕ - 〈 ϕ 〉)^{2}] p, \end{matrix}

we obtain

\begin{matrix} (68) & \begin{aligned} 2 λ_{2} (ϕ - 〈 ϕ 〉) 〈 \dot{ϕ} 〉 \\ - [(ϕ - 〈 ϕ 〉]^{2} - \frac{1}{2 λ_{2}}] [{\dot{λ}}_{2} + 4 λ_{2}^{2}] = 0 . \end{aligned} \end{matrix}

Note that the same can also be obtained by directly substituting Eq. (25) into Eq. (29). The above equation can be solved for the two parameters, 〈ϕ〉 and λ₂, independently of ϕ by setting

\begin{array}{l} (69a) & 〈 \dot{ϕ} 〉 & = 0, \\ (69b) & {\dot{λ}}_{2} + 4 λ_{2}^{2} & = 0 . \end{array}

The same pair as the above is more directly obtained by substituting 〈F〉=0 and $〈 (ϕ - 〈 ϕ 〉) F 〉 = - 1$ into Eqs. (66a) and (66b). Solving for Eqs. (69a) and (69b), we arrive at the solution (Eq. 32).

5.8 Application 3: gamma distribution

The third example to consider is the gamma distribution (cf. Sect. 3.4.3):

\begin{matrix} (70a) & p = p_{0} ϕ^{μ} e^{- λ ϕ}, \end{matrix}

where

\begin{matrix} (70b) & p_{0} = \frac{λ^{μ + 1}}{Γ (μ + 1)} \end{matrix}

from the normalization condition. Recall that the gamma function, Γ(x), is defined by

\begin{matrix} (70c) & Γ (x) = \int_{0}^{+ \infty} ξ^{x - 1} e^{- ξ} d ξ . \end{matrix}

We note the expressions for the derivatives by the distribution parameters:

\begin{array}{l} (71a) & \frac{\partial p}{\partial μ} & = p \log ϕ, \\ (71b) & \frac{\partial p}{\partial λ} & = - ϕ p . \end{array}

By substituting these two expressions into Eq. (50a), we obtain

\begin{matrix} (72) & \begin{aligned} \dot{μ} [〈 σ_{l} \log ϕ 〉 & - 〈 σ_{l} 〉 〈 \log ϕ 〉] + \dot{λ} [- 〈 σ_{l} ϕ 〉 + 〈 σ_{l} 〉 〈 ϕ 〉] = \\ 〈F \frac{\partial σ_{l}}{\partial ϕ}〉 \end{aligned} \end{matrix}

for l=1, 2. As before, we set σ₁=ϕ and σ₂=ϕ². We note especially

\begin{array}{l} (73a) & 〈 ϕ \log ϕ 〉 & = \frac{1}{λ} [(μ + 1) 〈 \log ϕ 〉 + 1], \\ (73b) & 〈 ϕ^{2} \log ϕ 〉 & = \frac{1}{λ^{2}} [(μ + 1) (μ + 2) 〈 \log ϕ 〉 + 2 μ + 3] . \end{array}

Thus, with l=1 and 2, respectively, we obtain

\begin{array}{l} (74a) & \frac{\dot{μ}}{λ} - (μ + 1) \frac{\dot{λ}}{λ^{2}} & = 〈 F 〉, \\ (74b) & \frac{2 μ + 3}{λ^{2}} \dot{μ} - \frac{2 (μ + 2) (μ + 1)}{λ^{3}} \dot{λ} & = 2 〈 ϕ F 〉 . \end{array}

By combining Eqs. (74a) and (74b), we obtain the equations for the tendencies of the two PDF parameters as

\begin{array}{l} (75a) & \dot{μ} & = 2 λ [(μ + 2) 〈 F 〉 - λ 〈 ϕ F 〉], \\ (75b) & \dot{λ} & = \frac{λ^{2}}{μ + 1} [(2 μ + 3) 〈 F 〉 - 2 λ 〈 ϕ F 〉] . \end{array}

6 Demonstration: condensation growth of cloud droplets

The general formulation for directly predicting the evolution of the assumed-PDF (DDF) parameters was presented in the last section. The purpose of this section is to demonstrate the steps of this formulation by taking the condensation growth of cloud droplets as an example. The output-constrained distribution introduced in Sect. 4.2 is also invoked in choosing a distribution form. It is known that the size, r, of a cloud droplet grows with a rate proportional to $1 / r$ under a fixed state of super-saturation (cf. Chap. 6, Rogers and Yau, 1989). Thus, by setting a proportionality constant to unity, the governing equation (Eq. 2) of this system becomes

\begin{matrix} (76a) & \dot{r} = \frac{1}{r}, \end{matrix}

with ϕ=r and $F = 1 / r$ . In this case, p(r) becomes a number density of drops with a radius r, or PSD. Here, we normalize the PSD with Eq. (13).

The system of Eq. (76a) can be solved analytically, and the general solution is

\begin{matrix} (76b) & r (t) = (r_{0} + 2 t)^{1 / 2}, \end{matrix}

with r₀ being the initial condition. Consequently, when an initial distribution of droplets is given by

\begin{matrix} (77) & p (r, t = 0) = p_{0} (r), \end{matrix}

its subsequent evolution is defined by

\begin{matrix} (78a) & p (r, t) d r = p_{0} (r_{0}) d r_{0} \end{matrix}

by following the chain rule of Eq. (20). Here, the initial condition, $r_{0} = r (0 | r, t)$ , is related to the droplet size, r, at the time t by

\begin{matrix} (78b) & r_{0} = r (0 | r, t) = (r^{2} - 2 t)^{1 / 2} . \end{matrix}

From Eq. (78b), we find $d r_{0} / d r = r / r_{0}$ , and by substituting this final result into Eq. (77), we obtain

\begin{matrix} (78c) & p = (r / r_{0}) p_{0}, \end{matrix}

but $p (r, t) = 0$ for r²(t)<2t. In this manner, the problem of time evolution of the droplet distribution under the condensation growth is solved analytically.

As a specific initial distribution, we set

\begin{matrix} (79) & p (r, t = 0) = \frac{λ^{μ + 1}}{Γ (μ + 1)} r^{μ} e^{- λ r}, \end{matrix}

with μ=1 and λ=1. The exact time evolution following this initialization, which obeys Eq. (78a) or (78c), is shown in Fig. 3a with varying curves for t=0–4 with an interval of Δt=1. The whole distribution moves with time to larger sizes, r, as seen in Fig. 3a, because of condensational growth, as expected, but it is also associated with the squeezing tendency by the factor, $r / r_{0}$ , with time, as predicted from the analytical solution (Eq. 78c). The rate of squeezing is larger towards the smaller r₀ and forms a shockwave front at the peak at the minimum droplet size,

\begin{matrix} (80) & r = (2 t)^{1 / 2} . \end{matrix}

The peak also sharpens with time, as it moves to larger values with time. The pronounced narrowing tendency of the distribution with time is well known in the literature. Remarkably, it leads to a decrease in the standard deviation with time as seen below.

https://acp.copernicus.org/articles/25/9357/2025/acp-25-9357-2025-f03

Figure 3Time evolution of the cloud-droplet size distribution under condensation growth with the curves corresponding to t=0 (solid) to t=4 (double-chain dash) with an interval of Δt=1: (a) exact result and (b) when a gamma distribution is assumed. Note that the whole distribution moves to a larger droplet size with time.

Download

In the following, the two assumed PSDs are considered for demonstrations of the general formulation of the last section.

6.1 Gamma distribution

Considering the fact that our example is a microphysical problem, the most natural choice to consider is the gamma distribution (Eq. 70a), as a commonly adopted distribution in microphysics (cf. Sect. 3.4.3). A general formulation for this case is presented in Sect. 5.7. The only additional information required is to note $〈 F 〉 = λ / μ$ and 〈rF〉=1 with the system (Eq. 76a), recalling the definition (Eq. 52c) for the angle bracket, and also $F = 1 / r$ . Substituting them, Eqs. (75a) and (75b) are solved by the fourth-order Runge–Kutta method with a time step of 10⁻² and the initial condition given by μ=1 and λ=1. The result is shown in Fig. 3b with the same curves as in Fig. 3a. Obviously, it is not possible to reproduce the shockwave structure at the minimum size (Eq. 80) by a gamma distribution. However, it is still remarkable that an overall evolution of the size distribution is reasonably reproduced.

The result here is especially remarkable because the computation of the evolution of the distribution involves only time integrations of two parameters, μ and γ; then the full distribution is automatically determined from the assumed gamma distribution form. In a standard approach of directly evaluating the evolution of a distribution by integrating Eq. (26) with time, we need to introduce a number of values for r that we wish to evaluate. For example, in Fig. 2a and b, 100 points are used for plotting the distribution curves for each time. Furthermore, for numerically integrating the Liouville equation (Eq. 26), we need to take r large enough so that we can set p=0 as the boundary condition at the largest r: this requirement further increases the number of points required for the computation by, say, a factor of 10. In standard bin microphysics, 30 points are considered the minimum (cf. Khain et al., 2015). The assumed-PDF (DDF) approach adopted here enables us to compute the same only with the two parameters, instead of many bins for r.

To verify the prediction of statistics, Fig. 4 plots the time evolutions of (a) the average size, 〈r〉, and (b) the standard deviation, $〈 (r - 〈 r 〉)^{2} 〉^{1 / 2}$ with $〈 r 〉 = (μ + 1) / λ$ and $〈 (r - 〈 r 〉)^{2} 〉 = (μ + 1) / λ^{2}$ . Here, the exact evolutions are shown in solid curves, whereas the approximate predictions with the assumed gamma distribution are shown by long-dash curves. Keep in mind that from the point of view of the maximum-entropy principle, the gamma distribution is designed to predict only a mean value properly. An additional algebraic correction factor is not the result of any “constraint” from the point of view of the large-deviation principle (cf. Sect. 3.3.2). These predictions are accurate only under limits of an assumed distribution. In fact, the prediction of the mean (Fig. 4a) is almost perfect, although that of the standard deviation deviates noticeably from the actual evolution with time (Fig. 4b).

https://acp.copernicus.org/articles/25/9357/2025/acp-25-9357-2025-f04

Figure 4Time evolutions of (a) the average size, 〈r〉, and (b) the standard deviation, $〈 (r - 〈 r 〉)^{2} 〉^{1 / 2}$ , of droplets undergoing condensation growth. Here, exact evolutions are shown in solid curves, whereas the predictions with the assumed gamma distribution are shown by long-dash curves.

Download

6.2 Exponential distribution

We further simplify the assumed distribution into an exponential so that there is only a single parameter, λ₁, in the distribution (cf. Eq. 57a). In this case, the assumed distribution no longer fits the actual evolution initiated with a gamma distribution (Eq. 79) as r→0.

Here, we face a technical problem for adopting the exponential distribution for this system because the integral for 〈F〉 diverges due to a singularity of the source, F, at r=0. To avoid this problem, we set λ₁ of the exponential distribution in such a manner that it gives the same mean size, 〈r〉, as the gamma distribution given by Eq. (79) (i.e., $λ_{1} = λ / (1 + μ)$ ), and then we adopt a formula, $〈 F 〉 = λ / μ$ , directly from the gamma distribution. Thus, $〈 F 〉 = (1 + μ) λ_{1} / μ$ . Here, μ is kept the same as the initial condition of the distribution (Eq. 79). Under this assumption, the time integral of Eq. (58a) can be performed, and the evolution of the mean size can be evaluated by Eq. (58c). Evolution of the standard deviation is also evaluated as $1 / λ_{1}$ .

These results are shown in Fig. 5 in the same format as in Fig. 4; the evolution of the standard deviation (long-dash curve in panel b) now becomes totally opposite to the actual tendency (solid curve). However, the evolution of the mean size is still predicted in a consistent manner overall with panel (a). The results are simply consistent with the assertion in Sect. 4.2 that if the sole purpose of using a distribution is to predict the evolution of a mean, the exponential distribution is sufficient for the purpose. Here, this assertion is supported to a reasonable extent, even though an actual distribution does not fit the exponential distribution at all.

https://acp.copernicus.org/articles/25/9357/2025/acp-25-9357-2025-f05

Figure 5The same as in Fig. 3 but long-dash curves are when an exponential distribution is assumed for prediction.

Download

Appendix C presents further demonstrative mathematical examples.

7 Conclusions and discussions

Distribution problems are identified in various contexts of atmospheric sciences: subgrid-scale distribution, the size distribution of hydrometeors, and probabilities in data assimilation. Considering the dispersed literature on those three distribution problems, the present paper has tried to set up a general perspective for all the distribution problems. As noted in the introduction, the majority of approaches in the distribution problems are based on a framework, which can be termed “assumed PDF”, in which a distribution to be simulated is approximated by a distribution form characterized only by a few free parameters (cf. Golaz et al., 2002).

The present work has attempted to answer the basic questions of how such a simple assumed PDF can be best chosen and how the evolution of those PDF parameters can be evaluated consistently. To address these basic questions, it is important to realize that not every statistical aspect of any distribution can be predicted by the assumed-PDF approaches either accurately or consistently. More specifically, the number of statistical variables (but not limited to simple moments) that can be consistently predicted by an assumed-PDF approach cannot exceed the number of assumed-PDF parameters introduced by the given assumed PDF. Thus, the best that we can accomplish under an assumed-PDF approach is to predict the same number of statistical variables as that of the assumed-PDF free parameters. The proposed formulation in the present study is designed exactly to accomplish this best.

The present study next notes (Sect. 4.2) that in many atmospheric applications, a given host model does not require using the full form of a distribution as an output from a distribution model, but, instead, it requires only a limited number of statistical quantities that correspond to the physical variables of interest in the host model (e.g., domain-averaged total condensed water, cloud fraction) as outputs. Thus, the most desirable formulation would be to predict the evolution of those statistical outputs required in a given host model consistently. The final general formulation presented in Sect. 5.1 is constructed exactly by following this receipt: the time evolution of the assumed-PDF parameters, {λ_l}, is evaluated by Eq. (50a), which is equivalent to solving the prediction (Eq. 52b) for the required model outputs, {〈σ_l〉}, in terms of {λ_l}.

Probably, the most novel aspect of the present study is in deriving equations for the assumed-PDF parameters and predicting them directly. This proposed alternative approach can overcome difficulties of the current standard method that requires a difficult mapping between the moments of the PDF (mean, variance, skewness, etc. of overall PDF) and the PDF parameters (e.g., mean of each Gaussian component in a double-Gaussian PDF; cf. Lewellen and Yoh, 1993; Machulskaya, 2015; Milbrandt and Yau, 2005). Furthermore, by introducing a re-interpretation of the maximum-entropy principle (cf. Sect. 3.3) in Sect. 4.2, we propose to adopt those required statistical outputs as “constraints” to define the assumed-PDF form for a given problem to resolve the common problem of how to choose it, which is usually done rather arbitrarily. The paper has also emphasized that approaches of trying to choose a distribution form based on analyses of observational or simulation data found in the literature lack objectivity (cf. Sect. 3.2.2).

In the present study, the possible model outputs and constraints have been limited to much simpler statistical quantities of the form, {〈σ_l〉}. However, generalizations to those physically more significant variables, such as the domain-averaged total condensed water and the cloud fraction, are conceptually straightforward. For example, the cloud fraction can formally be defined in terms of the DDF, p(q_t), of the total water, q_t, as

\begin{matrix} (81) & \int_{q^{*}}^{+ \infty} p (q_{t}) d q_{t}, \end{matrix}

where q^* is the saturated moisture at a given height. Following a similar line of argument to that presented in Sect. 5, a prognostic equation for the cloud fraction, as defined above, can be derived by integrating Eq. (49a) over the range of $[q^{*}, + \infty]$ , although a full derivation is left for a future paper. Alternatively, a prognostic equation for the consistent DDF parameters can more directly be derived from the prognostic equation for the cloud fraction (Eq. 81), to be written down similarly to Eq. (52b). A consistent assumed-distribution form can also be derived by introducing the integral (81) as a constraint for the maximum-entropy principle.

An obvious consequence of this formulation is that the evolution of the PDF (DDF) would be predicted differently by choosing the weights, {σ_l}, differently. However, this consequence should not be considered an inherent shortcoming of the present formulation. Rather, one should realize that this is the fundamental limitation of the assumed-PDF (DDF) approach: it cannot predict every statistic of a distribution accurately, as already emphasized. Simply, a different choice of the set, {σ_l}, predicts a different set of statistical outputs, {〈σ_l〉}, consistently. This point has been explicitly demonstrated in Sect. 5.5. Thus, our best advice is to choose {〈σ_l〉} to be those weights actually required by the host model so that they are actually predicted consistently under a given assumed-PDF form.

The present work is built upon the solid basis of two well-known mathematical and physical principles: (1) the maximum-entropy principle, which guides the determination of the most likely distribution of a problem, and (2) the Liouville equation, which predicts the time evolution of distributions. The stochastic-collection equation for the hydrometeor size distribution can also be treated in an analogous manner to the Liouville equation. However, adopting these two basic principles in practical atmospheric problems is not quite straightforward, and we face two major difficulties: first, application of the maximum-entropy principle to the atmospheric problems is not straightforward because it is often not obvious how to identify the physical constraints required for determining a distribution (cf. Yano et al., 2016). Second, though the Liouville equation permits us to perform a prediction of distributions in a formal manner, for both DDF and PDF, its direct use would entail enormous numerical cost. We have addressed these two difficulties by adopting a simple distribution form containing only a small number of parameters so that its evolution with time can be predicted only in terms of those parameters. In principle, the formulations presented in this study are applicable to any atmospheric model in a straightforward manner, albeit with required coding and testing, because the term, F, for the physical tendency in the Liouville Eq. (26) is left unspecified in presenting the general formulations. This straightforwardness includes no need to introduce any extra closure, say, based on a turbulence model, for example as is partially the case with CLUBB (Cloud Layers Modified By Binormals; Golaz et al., 2002; Larson and Golaz, 2005; Larson et al., 2019), so that higher-order moments can be expressed in terms of the moments considered under a given truncation of the system.

As a demonstrative example with a specified F, the paper has considered the condensational growth of cloud droplets. A rather unusual feature of this demonstration can be found in comparing its performance with an exact numerical result: the flexibility of the proposed formulation permits us to do this very easily. Additional mathematical demonstrative examples are found in Appendix C. Further examples with further elaborations of the methodology are also found in the study by Yano (2024). We also anticipate that fuller applications of the developed formulation are still to follow, especially because the only way to evaluate the accuracy of the method is to directly compare the results with more accurate sophisticated evaluations, as presented in Sects. 5.5 and 6 and Appendix C herein as well in Yano (2024). Yet, it is hoped that this simple example provides a concrete idea about how to use this general formulation and that readers can already apply the formulation to their own problems.

Two relatively distinct steps are involved in the present formulation: (1) determination of the PDF form based on the output-constrained distribution principle, as a re-interpretation of the maximum-entropy principle, and (2) prognostic equations for the PDF parameters derived from the Liouville equation. Thus, whenever new types of the output “constraints” are introduced into a problem, new types of general PDF forms must first be derived based on step 1. Next, new sets of prognostic equations must be derived based on step 2. Alternatively, two steps can be adopted separately. For example, the evolution of a PDF form defined under the output-constrained distribution principle can be evaluated by a more traditional assumed-PDF approach based on moments. Conversely, currently existing assumed-PDF schemes can be re-written based on step 2 but without changing the assumed-PDF forms.

The general formulation for the distribution problem presented here constitutes, first of all, a natural extension of the existing assumed-PDF (DDF) approaches for the subgrid-scale distributions as discussed in Sect. 5.3. The work further suggests that the same general formulation is also applicable to other distribution problems, including cloud microphysics and data assimilation. Extensive further general possibilities of the proposed formulation are yet to be fully explored. For example, constraints introduced by Eq. (12) have assumed a fixed integral range. On the other hand, for example, the cloud-fraction problem needs to take an integral with respect to the total water above the saturation value (cf. Eq. 81), which changes with time, as an output. It requires a further generalization of the formulation (e.g., with Eq. 47).

The overall accuracy of predicting the required outputs (“constraints”) by the proposed prognostic PDF formulation is emphasized, except for the case where the evolution of the assumed-PDF parameters presents singularities. Such undesirable behaviors stem from a highly truncated PDF form under the assumed-PDF formulation. When a multi-dimension system is considered as in Yano (2024), the evolution of the PDF parameters can even become chaotic with the given system behaving chaotically when the evolution of the distribution itself is statistically stable. We need to face this inherent limitation of the assumed-PDF approach with a limited number of PDF parameters.

Furthermore, there is a key issue left unaddressed: the performance of integrals denoted by 〈…〉 throughout the paper. In the present study, all the integrals of the problem have been performed analytically, apart from those including an unspecified physical-tendency term, F. Performing the latter integrals in general, especially with a complex physical-tendency term, is, however, not trivial. The most flexible approach currently available is Monte Carlo integration (e.g., Gentle, 2003; Larson and Schanen, 2013). However, this approach is numerically rather expensive and hardly considered an ultimate answer. A key to the success of the prognostic assumed-PDF formulation presented herein, especially our key result, Eq. (50a), is to perform those integrals in an efficient manner, albeit possibly with a numerical aide. The integral problems in general can be greatly facilitated by adopting Laplace's method in asymptotic expansion and more specifically by invoking Watson's lemma (cf. Chap. 6, Bender and Orszag, 1978), considering the fact that more or less all the types of distributions of interest here take the form of an exponential decay away from the maximum. In particular, all these integrals reduce to a form of an asymptotic expansion in terms of gamma functions if a Taylor expansion of a subdominant contribution is possible. The more strongly a distribution is peaked, the fewer terms required.

Another aspect that has not been explicitly taken into account is stochasticity; its potential importance in atmospheric modeling can hardly be overemphasized (cf. Berner et al., 2017). Formulations already exist for taking stochasticity into account as a generalization of the Liouville equation (cf. Risken, 1984).

Appendix A: Mathematical details of Sect. 4.4.2

We apply a Fourier integral

\begin{matrix} (A1) & p (x, t) = \int_{- \infty}^{+ \infty} \tilde{p} (k, t) e^{i k x} d k \end{matrix}

to the diffusion problem (Eq. 29). The Fourier transform for (Eq. A1) is given by

\begin{matrix} (A2) & \tilde{p} (k, t) = \frac{1}{2 π} \int_{- \infty}^{+ \infty} p (x, t) e^{- i k x} d x . \end{matrix}

Thus, the initial condition is transformed into

\begin{matrix} (A3) & \tilde{p} (k, t = 0) = \frac{1}{2 π} \int_{- \infty}^{+ \infty} (1 + α x) e^{- λ x^{2} - i k x} d x . \end{matrix}

The transform of the first term is straightforward, and

\begin{matrix} (A4) & \begin{aligned} \int_{- \infty}^{+ \infty} e^{- λ x^{2} - i k x} d x & = e^{- k^{2} / 4 λ} \int_{- \infty}^{+ \infty} e^{- λ (x + i k / 2 λ)^{2}} d x \\ = (\frac{π}{λ}) e^{- k^{2} / 4 λ} . \end{aligned} \end{matrix}

For performing the second term in the transform (Eq. A3), we note the following relation:

\begin{matrix} (A5) & \begin{aligned} x e^{- λ (x + i k / 2 λ)^{2}} = & - \frac{1}{2 λ} \frac{\partial}{\partial x} e^{- λ (x + i k / 2 λ)^{2}} \\ + \frac{i k}{4 λ^{2}} e^{- λ (x + i k / 2 λ)^{2}} . \end{aligned} \end{matrix}

Finally, in performing the inverse Fourier transform (Eq. A1), we re-factorize the exponent by noting

\begin{matrix} (A6) & \begin{aligned} k^{2} & (t + \frac{1}{4 λ}) - i k x = \\ (t + \frac{1}{4 λ}) [k - \frac{i x}{2} {(t + \frac{1}{4 λ})}^{- 1}] \\ + \frac{x^{2}}{4} {(t + \frac{1}{4 λ})}^{- 1} . \end{aligned} \end{matrix}

Appendix B: Exact solutions of the problem in Sect. 5.5

The problem of the system with F=ϕ^m in Sect. 5.5 can be solved analytically, and the evolution of the PDF can also be solved in a closed form by following the methodology of Sect. 6; the resulting solution of the PDF is

\begin{matrix} (B1) & p (ϕ, t) = p (ϕ_{0} (ϕ, t), t = 0) \frac{d ϕ_{0}}{d ϕ}, \end{matrix}

where $ϕ_{0} = ϕ_{0} (ϕ, t)$ is an initial condition that leads to a state, ϕ, at time, t.

Once an explicit PDF form is given, numerical integrals further provide the required moments, 〈ϕⁿ〉. As a minor technicality, the actual integral is performed with respect to ϕ₀ after transforming ϕ into ϕ₀ because the distribution tends to stretch to larger values increasingly with time, and we need to increase the upper limit of the integral with respect to ϕ accordingly. The variable transformation allows us to stretch the integral range with respect to ϕ automatically by simply performing the integral over the same range with respect to ϕ₀. Once a moment, 〈ϕⁿ〉, for a given n is obtained, a consistent exponent, λ₁, can be evaluated by the formula

\begin{matrix} (B1a) & λ_{1} = {[\frac{n!}{〈 ϕ^{n} 〉}]}^{1 / n} \end{matrix}

by invoking the fact that the moments are given by

\begin{matrix} (B1b) & 〈 ϕ^{n} 〉 = \frac{n!}{λ_{1}^{n}} \end{matrix}

under an assumption of the exponential distribution. The parameter, λ₁, estimated from Eq. (B1a) is added as green curves in Fig. 2.

The following subsections discuss the exact solutions for the specific choice of the power, m, that is required to accomplish those steps.

B1 When m=0

The system simply reduces to $\dot{ϕ} = 1$ , which is solved as $ϕ = ϕ_{0} + t$ . It follows that $ϕ_{0} = ϕ - t$ and $d ϕ_{0} / d ϕ = 1$ . By substituting those relations into Eq. (B1), we find

\begin{matrix} (B2) & p (ϕ, t) = \{\begin{cases} λ_{1} e^{- λ_{1} (x - t)} & x \geq t, \\ 0 & x < t . \end{cases} \end{matrix}

From this distribution, the moments can be readily evaluated analytically, and the first three moments are given by

\begin{array}{l} (B3a) & 〈 ϕ 〉 & = \frac{1}{λ_{1} (0)} + t, \\ (B3b) & 〈 ϕ^{2} 〉 & = 2 (\frac{1}{λ_{1}^{2} (0)} + \frac{t}{λ_{1} (0)} + \frac{t^{2}}{2}), \\ (B3c) & 〈 ϕ^{3} 〉 & = 6 (\frac{1}{λ_{1}^{3} (0)} + \frac{t}{λ_{1}^{2} (0)} + \frac{t^{2}}{3 λ_{1} (0)} + \frac{t^{3}}{27}) . \end{array}

The values of λ₁ obtained from the above results can directly be compared with those obtained by setting m=0 in Eqs. (59c) and (59d):

\begin{matrix} (B4) & λ_{1} = \frac{1}{λ_{1} (0)} + \frac{t}{n}, \end{matrix}

with varying n. The assumed formulation can predict the consistent value of λ₁ with the constraint of σ₁=ϕⁿ up to O(t), in general, and the agreement is perfect especially with n=1.

B2 When m=1

In this case, we find

\begin{array}{l} (B5a) & ϕ & = ϕ_{0} e^{t}, \\ (B5b) & ϕ_{0} & = ϕ e^{- t}, \\ (B5c) & \frac{d ϕ_{0}}{d ϕ} & = e^{- t} . \end{array}

From these relations, we find that this system evolves exactly as prescribed by Eq. (59e), maintaining the exponential distribution.

B3 When m>1

In this case, the exact solution of the system is

\begin{matrix} (B6) & ϕ = [ϕ_{0}^{- (m - 1)} - (m - 1) t]^{- 1 / (m - 1)}, \end{matrix}

and it follows that

\begin{array}{l} (B7a) & ϕ_{0} & = [ϕ^{- (m - 1)} + (m - 1) t]^{- 1 / (m - 1)}, \\ (B7b) & \frac{d ϕ_{0}}{d ϕ} & = \frac{1}{ϕ^{m}} [ϕ^{- (m - 1)} + (m - 1) t]^{- m / (m - 1)} . \end{array}

By substituting Eqs. (B7a) and (B7b) into Eq. (B1), an explicit form of the PDF evolution is obtained.

Note that the solution (Eq. B6) becomes singular at $t = ϕ^{- (m - 1)} / (m - 1)$ with the tendency of $ϕ \to + \infty$ . Due to this tendency, the distribution increasingly presents a longer tail with time with an increasingly significant deviation from an exponential distribution. This tendency is also aggravated with the increasing n; thus the estimate of λ₁ based on the constraint, σ₁=ϕⁿ, also deteriorates faster with time with increasing n (cf. Eqs. 59c and 59d), as seen in Fig. 2.

Appendix C: Further demonstrative examples

This appendix further compares between the exact and assumed-PDF solutions by taking three simple dynamical systems, and their basic behaviors can be inferred from the forcing forms. These comparisons follow on from a demonstration of the assumed-PDF approach taking a simple physical system in Sect. 6 and further expose basic characteristics of the assumed-PDF solutions.

As simple dynamical systems, we consider the following three forms of forcing:

(i)
$F = - ϕ$ : the only stable fixed point of this system is ϕ=0, and the system approaches this fixed point regardless of the initial condition, ϕ₀, with $ϕ = ϕ_{0} e^{- t}$ .
(ii)
$F = ϕ (ϕ - 1)$ : the system consists of a stable fixed point at ϕ=0 and an unstable fixed point at ϕ=1. Every initial point below ϕ=1 exponentially approaches ϕ=0, and every initial point above ϕ=1 exponentially approaches infinity with time. An explicit solution is
$\begin{matrix} (C1) & ϕ = \frac{ϕ_{0}}{ϕ_{0} - (ϕ_{0} - 1) e^{t}}, \end{matrix}$
with ϕ₀ being the initial condition.
(iii)
$F = - ϕ (ϕ - 1) (ϕ - 2)$ : the system consists of two stable fixed points at ϕ=0 and 2 and an unstable fixed point at ϕ=1. Every initial point below and above ϕ=1 exponentially approaches ϕ=0 and 2, respectively. An explicit solution is
$\begin{matrix} (C2a) & ϕ = 1 \pm (1 - A e^{- 2 t})^{- 1 / 2}, \end{matrix}$
where
$\begin{matrix} (C2b) & A = \frac{ϕ_{0} (ϕ_{0} - 2)}{(ϕ_{0} - 1)^{2}} \end{matrix}$
and the sign in the solution is chosen by the sign of ϕ₀−1.

We examine the evolution of these systems by initializing the distribution as Gaussian of the form (Eq. 25) with $〈 ϕ 〉 |_{t = 0} = 1$ and $λ_{1} |_{t = 0} = 10$ . Here, the initial peak of the distribution is taken to be sharp enough so that we can focus on the evolution of the system initialized in the vicinity of x=1. The assumed peak point is not a stable fixed point with any of those systems, and it is more precisely an unstable fixed point with the latter two systems. Thus, the evolution is expected to be away from the given initial peak in all these three cases.

The exact evolutions of those three systems with this initial distribution can be derived using the following relations:

\begin{matrix} (C3) & (i) ϕ_{0} = ϕ e^{t}, d ϕ_{0} / d ϕ = e^{t} . \end{matrix}

\begin{array}{l} (C4a) & (ii) & ϕ_{0} & = \frac{ϕ}{ϕ - (ϕ - 1) e^{- t}}, \\ (C4b) & \frac{d ϕ_{0}}{d ϕ} & = \frac{e^{- t}}{(ϕ - (ϕ - 1) e^{- t})^{2}} . \end{array}

\begin{array}{l} (C5a) & (iii) & ϕ_{0} & = 1 \pm (1 - B e^{2 t})^{- 1 / 2}, \\ (C5b) & \frac{d ϕ_{0}}{d ϕ} & = 1 \pm (1 - B e^{2 t})^{- 3 / 2} e^{2 t}, \end{array}

with

C5c

\begin{array}{l} (C5c) & B = \frac{ϕ (ϕ - 2)}{(ϕ - 1)^{2}} . \end{array}

The exact evolutions of the distributions are obtained by substituting these relations into Eq. (B1).

Evolution of a distribution under the assumed PDF (i.e., Gaussian) is evaluated in terms of the evolutions of the two PDF parameters, 〈ϕ〉 and λ₁, as given by Eqs. (66a) and (66b). For this purpose, the following relations for the three systems are invoked:

\begin{array}{l} (C6a) & (i) & 〈 F 〉 & = - 〈 ϕ 〉, \\ (C6b) & 〈 (ϕ - 〈 ϕ 〉) F 〉 & = - \frac{1}{2 λ} . \end{array}

\begin{array}{l} (C7a) & (ii) & 〈 F 〉 & = \frac{1}{2 λ} + F (〈 ϕ 〉), \\ (C7b) & 〈 (ϕ - 〈 ϕ 〉) F 〉 & = \frac{1}{2 λ} (3 〈 ϕ 〉 - 1) . \end{array}

\begin{array}{l} (C8a) & (iii) & 〈 F 〉 & = - \frac{3}{2 λ} (〈 ϕ 〉 - 1) + F (〈 ϕ 〉), \\ (C8b) & 〈 (ϕ - 〈 ϕ 〉) F 〉 & = - [\frac{3}{2 λ^{2}} + \frac{1}{λ} (3 〈 ϕ 〉^{2} - \frac{9}{2} 〈 ϕ 〉 + 1)] . \end{array}

Figures C1–C3 show the obtained evolutions of the distributions for those three systems, respectively. In each figure, the upper and lower frames, (a) and (b), respectively, show the results for the exact computations and with the assumed PDFs (Gaussian). Here, the intervals for the plots are set as Δt=0.6, 0.2, and 0.3 for those three cases, respectively. Different intervals are chosen for the ease of following the sequence of the evolution of the distributions for all three cases visually.

https://acp.copernicus.org/articles/25/9357/2025/acp-25-9357-2025-f06

Figure C1Evolution of the distribution for model i with the interval of Δt=0.6, with the initial distribution given by the solid line for (a) the exact solution and (b) the assumed Gaussian distribution, shown with the varying types of curves. In both cases, the distribution peak moves towards smaller values with time.

Download

With the first dynamical system (i), the distribution peak moves towards ϕ=0, a stable fixed point of the system, with both exact and assumed-PDF-based calculations, as seen in Fig. C1. The exact evolution (Fig. C1a) presents a strong tendency towards the sharpening of the peak as well. This sharpening tendency is substantially weaker with the assumed-PDF calculation (Fig. C1b).

With the second system (ii), the distribution tends to spread with time in both directions, towards the two stable fixed points at ϕ=0 and $ϕ \to + \infty$ . Both calculations present an expected spreading tendency of the distribution with time but in different manners. The exact calculation (Fig. C2a) shows that the distribution peak moves to a smaller ϕ with time but, at the same time, with a tendency for a long tail to stretch towards a larger ϕ. However, the calculation with the assumed Gaussian distribution cannot reproduce the tendency of the distribution to be skewed with time by design. The assumed-PDF calculation (Fig. C2b), instead, shows that the distribution spreads by moving the peak towards a larger ϕ with time. The tendency of spread is less dramatic, too.

https://acp.copernicus.org/articles/25/9357/2025/acp-25-9357-2025-f07

Figure C2The same as Fig. C1 but for model ii with the interval of Δt=0.2. The peak moves towards smaller and larger values with the exact and assumed distributions with this model, respectively.

Download

With the third dynamical system (iii), the system initialized in the vicinity of the unstable fixed point, ϕ=1, tends to evolve towards either of the two stable fixed points at ϕ=0 and 2. As a result, as the exact calculation shows (Fig. C3a), an initial distribution that peaked at ϕ=1 splits into the two peaks centered close to those two stable fixed points with time. Of course, it is not possible to reproduce such a tendency by assuming a Gaussian distribution as an assumed form. Thus, under the assumed-PDF calculation (Fig. C3b), we only see a tendency for the distribution to gradually spread with time. One may also get an impression that the rate of spread of the distribution is not sufficient enough to reproduce a drastic tendency towards the two peaks in exact calculation.

https://acp.copernicus.org/articles/25/9357/2025/acp-25-9357-2025-f08

Figure C3The same as Fig. C1 but for model iii with the interval of Δt=0.4. The distribution peak monotonously decreases with time in both cases.

Download

From the overview so far, it is clear that the assumed-PDF approach can reproduce the actual evolution of the distribution in a manner that is less satisfactory even qualitatively. However, as emphasized in the main text, the purpose of the assumed PDF is not to predict the evolution of the whole distribution. Its purpose is solely limited to reproducing limited statistics that are specified under the output-constrained distribution principle. More specifically, under the given specific assumed-PDF formulation with the Gaussian, the sole goal is to predict the mean and the variance consistently. Thus, the main question to be addressed is how those statistics have been predicted.

To address this question, Fig. C4a and b plot the time series of the mean and the standard deviation, respectively, with the assumed-PDF (black) and exact (green) calculations. The curves for models i, ii, and iii are shown by the solid, long-dash, and short-dash curves, respectively. Here, the assumed PDF predicts the means of models i and iii perfectly. The predictions of the standard deviations of these two models are overall consistent with the exact results, although the predicted tendencies are less pronounced than the exact results, especially with model i. On the other hand, the agreement of the evolution of the standard deviation with model iii (short dash) is still rather remarkable, considering the fact that the assumed PDF does not reproduce an overall evolution of the distribution even qualitatively. This provides another example, in addition to the case of Sect. 4.4.1, demonstrating that it is not indispensable to fit the distribution quantitatively well to obtain a realistic prediction of a required statistics.

https://acp.copernicus.org/articles/25/9357/2025/acp-25-9357-2025-f09

Figure C4Statistics of the dynamical systems under consideration: the (a) average and (b) standard deviation for models i (solid), ii (long dash), and iii (short dash). Black curves are the results with the assumed PDF, and green ones are the results from direct integrals of the exact solutions. Here, the results of the means for models i and iii with the assumed PDF agree perfectly with the exact results.

Download

In the case with model ii, the calculations with the exact distribution have their own problem: the numerical integral of the distribution is extremely sensitive to the integral range and the total number of points used. In Fig. C4, the integral is performed over the range of −49 to 51 with 10⁴ points with respect to ϕ₀. Examination of those sensitivities shows that the overall behavior of the curve (long-dash line in green) is still correct, although the random-looking oscillatory behavior sensitively changes with the integral range and the number of points adopted. This inherent difficulty fundamentally stems from the fact that the distribution increasingly presents a long tail towards the positive direction with time, as seen in Fig. C2a. This tendency is reflected by the singularities in Eqs. (C4a) and (C4b) that lead to a more direct difficulty in the integrals.

Comparing the curves (long-dash line) for the model ii with the exact (green) and assumed-PDF (black) calculations, we conclude that the assumed-PDF approach can predict the tendencies of the statistics (mean and variance) only up to t≃0.5, but beyond that point, the actual evolution begins to dramatically deviate from the monotonously increasing tendencies predicted by the assumed-PDF method. The former shows that, due to the shift of the peak towards the smaller values with time, both the average and the standard deviation begin to decrease after t≃0.5. This later transition simply fails to be captured by the assumed-PDF solution. Here, the singularities in Eqs. (C4a) and (C4b) lead to a divergence of both the mean and the standard deviations in finite time.

Code availability

All the Fortran codes used in the present study are available by request to the first author.

Data availability

The present study does not use any data, neither numerically generated nor observationally obtained.

Author contributions

The three authors contributed equally to the conceptualization, formal analysis, investigation, methodology, and writing.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

Jun-Ichi Yano thanks Wim Verkley for carefully reading an earlier version of this paper. Thanks are also due to Craig Bishop for his active inputs.

Review statement

This paper was edited by Franziska Glassmeier and reviewed by two anonymous referees.

References

Bannister, R. N.: A review of operational methods of variational and ensemble-variational data assimilation, Q. J. Roy. Meteor. Soc., 143, 607–633, https://doi.org/10.1002/qj.2982, 2017. a

Bechtold, P., Fravalo, C., and Pinty, J. P.: A model of marine boundary–layer cloudness for mesoscale applications, J. Atmos. Sci., 49, 1723–1744, https://doi.org/10.1175/1520-0469(1992)049<1723:AMOMBL>2.0.CO;2, 1992. a

Bechtold, P., Cuijpers, J. W. M., Mascart, P., and Trouilhet, P.: Modeling of trade–wind cumuli with a low–order turbulence model – Towards a unified description of Cu and Sc clouds in meteorological models, J. Atmos. Sci., 52, 455–463, https://doi.org/10.1175/1520-0469(1995)052<0455:MOTWCW>2.0.CO;2, 1995. a

Bender, C. M. and Orszag, S. A.: Advanced Mathematical Methods for Scientists and Engineers, McGraw-Hill, New York, 593 pp., ISBN 978-1-4757-3069-2, 1978. a

Bernardo, J. M. and Smith, A. F. M.: Bayesian Tehory, John Wiley & Son, Chicheter, 586 pp., ISBN 0 471 92416 4, 1997. a

Berner, J., Achatz, U., Batte, L., Bengtsson, L., de la Cámara, A., Crommelin, D., Christensen, H., Colangeli, M., Dolaptchiev, S., Franzke, C. L. E., Friederichs, P., Imkeller, P., Järvinen, H., Juricke, S., Kitsios, V., Lott, F., Lucarini, V., Mahajan, S., Palmer, T. N., Penland, C., von Storch, J.-S., Sakradžija, M., Weniger, M., Weisheimer, A., Williams, P. D., and Yano, J.-I.: Stochastic Parameterization: Towards a new view of weather and climate models, B. Am. Meteorol. Soc., 98, 565–588, https://doi.org/10.1175/BAMS-D-15-00268.1, 2017. a

Bishop, C. H.: The GIGG-EnKF: ensemble Kalman filtering for highly skewed non-negative uncertainty distributions, Q. J. Roy. Meteor. Soc., 142, 1395–1412, https://doi.org/10.1002/qj.2742, 2016. a

Bony, S. and Emanuel, K. A.: A parameterization of the cloudness associated with cumulus convection: Evaluation using TOGA COARE data, J. Atmos. Sci., 58, 3158–3183, https://doi.org/10.1175/1520-0469(2001)058<3158:APOTCA>2.0.CO;2, 2001. a

Bougeault, P.: Modeling the trade–wind cumulus boundary–layer. Part I: Testing the ensemble cloud relations against numerical data, J. Atmos. Sci., 38, 2414–2428, https://doi.org/10.1175/1520-0469(1981)038<2414:MTTWCB>2.0.CO;2, 1981. a

Butler, R. W.: Saddlepoint Approximations with Applications, Cambridge University Press, https://doi.org/10.1017/CBO9780511619083, 2007. a

Carrassi, A., Bocquet, M., Bertino, L., and Evensen, G.: Data assimilation in the geosciences: An overview of methods, issues, and perspectives, Climatic Change, 9, e535, https://doi.org/10.1002/wcc.535, 2018. a, b

Dang, C. and Xu, J.: Novel algorithm for reconstruction of a distribution by fitting its first-four statistical moments, App. Math. Model., 71, 505–524, https://doi.org/10.1016/j.apm.2019.02.040, 2019. a

Daniels, H. E.: Saddlepoint Approximations in Statistics, Ann. Math. Stat., 25, 631–650, https://doi.org/10.1214/aoms/1177728652, 1954. a

Ehrendorfer, M.: The Liouville equation and its potential usefulness for the prediction of forecast skill. Part I: Theory, J. Atmos. Sci., 122, 703–713, https://doi.org/10.1175/1520-0493(1994)122<0703:TLEAIP>2.0.CO;2, 1994a. a

Ehrendorfer, M.: The Liouville equation and its potential usefulness for the prediction of forecast skill. Part II: Applications, J. Atmos. Sci., 122, 714–728, https://doi.org/10.1175/1520-0493(1994)122<0714:TLEAIP>2.0.CO;2, 1994b. a

Ehrendorfer, M.: The Liouville equation and atmospheric predictability, in: Predictability of Weather and Climate, edited by: Palmer, T. and Hagedorn, R., Cambridge University Press, Cambridge, 59–98, https://doi.org/10.1017/CBO9780511617652.005, 2006. a

Feller, W.: An Introduction to Probability Theory and Its Applications, Vol. 1, 3rd edn., John Wiley and Sons, U. K., 509 pp., ISBN-10 0471257087, 1968. a

Fitch, A. C.: An improved double-Gaussian closure for the subgrid vertical velocity probability distribution function, J. Atmos. Sci., 76, 285–304, https://doi.org/10.1175/JAS-D-18-0149.1, 2019. a, b, c

Garratt, J. R.: The Atmospheric Boundary Layer, Cambridge University Press, U. K., 316 pp., ISBN 0 521 38052 9, 1992. a

Garret, T.: Analytical solutions for precipitation size distributions at steady state, J. Atmos. Sci., 76, 1031–1037, https://doi.org/10.1175/JAS-D-18-0309.1, 2019. a

Gentle, J. E.: Random Number Generation and Monte Carlo Methods, 2nd edn., Springer, Berlin, ISBN-10 9780387001784, 2003. a

Golaz, J.-C., Larson, V. E., and Cotton, W. R.: A PDF–based model for boundary layer clouds. Part I: Method and model description, J. Atmos. Sci., 59, 3540–3551, https://doi.org/10.1175/1520-0469(2002)059<3519:SSAMVI>2.0.CO;2, 2002. a, b, c, d, e

Goldstein, H., Poole, C., and Safko, J.: Classical Mechanics, 3rd edn., Addison Wesley, San Francisco, 638 pp., ISBN-10 9780201657029, 2002. a

Goodfellow, I., Bengio, Y., and Courville, A.: Deep Learning, MIT Press, Cambridge, MA, 785 pp., ISBN 9780262035613, 2016. a

Gregory, P.: Bayesian Logical Data Analysis for the Physical Sciences, Cambridge University Press, Cambridge, 468 pp., ISBN 0 521 84150 X, 2005. a

Guiasu, S.: Information Theory with Applications, McGraw Hill, New York, 439 pp., ISBN 10: 0070251096, 1977. a

Hermoso, A., Hommar, V., and Yano, J.-I.: Exploring the limits of ensemble forecasting via solutions of the Liouville equation for realistic geophysical models, Atmos. Res., 246, 105127, https://doi.org/10.1016/j.atmosres.2020.105127, 2020. a

Jaynes, E. T.: Where do we standn on maximum entropy?, in: The Maximum Entropy Formulation, edited by: Levine, R. D. and Tribus, M., MIT Press, Cambridge, MA, 15–118, http://philsci-archive.pitt.edu/22626/1/MaximalEntropy.pdf (last access: 25 August 2025), 1978. a

Jaynes, E. T.: Probability Theory, The Logic of Science, Cambridge University Press, Cambridge, UK, 725 pp., ISBN-10 0521592712, 2003. a

Jazwinski, A. H.: Stochastic Processes and Filtering Theory, Academic Press, New York, 376 pp., ISBN: 9780080960906, 1970. a

Kapur, J. N.: Maximum–Entropy Models in Science and Engineering, John Wily & Sons, U. K., 643 pp., ISBN 9780470214596, 1989. a

Khain, A. P. and Pinsky, M.: Physical Processes in Clouds and Cloud Modeling, Cambridge University Press, Cambridge, 626 pp., ISBN 9780521767439, 2018. a, b

Khain, A. P., Beheng, K. D., Heymsfield, A., Korolev, A., Krichak, S. O., Levin, Z., Pinsky, M., Phillips, V., Prabhakaran, T., Teller, A., van den Heever, S. C., and Yano, J. I.: Representation of microphysical processes in cloud–resolving models: spectral (bin) microphysics vs. bulk–microphysics, Rev. Geophys., 53, 247–322, https://doi.org/10.1002/2014RG000468, 2015. a, b, c, d, e

Klimenko, A. Y. and Bilger, R. W.: Conditional moment closure for turbulent combustion, Prog. Energ. Combust., 25, 596–687, https://doi.org/10.1016/S0360-1285(99)00006-4, 1999. a

Larson, V. E.: Prognostic equations for cloud fraction and liquid water, and their relation to filtered density functions, J. Atmos. Sci., 61, 338–351, https://doi.org/10.1175/1520-0469(2004)061<0338:PEFCFA>2.0.CO;2, 2004. a

Larson, V. E.: CLUBB-SILHS: A parameterization of subgrid variability in the atmosphere, arXiv [preprint], https://doi.org/10.48550/arXiv.1711.03675, 26 March 2022. a, b, c

Larson, V. E. and Golaz, J.: Using probability density functions to derive consistent closure relationships among higher-order Moments, Mon. Weather Rev., 133, 1023–1042, https://doi.org/10.1175/MWR2902.1, 2005. a

Larson, V. E. and Schanen, D. P.: The Subgrid Importance Latin Hypercube Sampler (SILHS): a multivariate subcolumn generator, Geosci. Model Dev., 6, 1813–1829, https://doi.org/10.5194/gmd-6-1813-2013, 2013. a

Larson, V. E., Golaz, J. C., and Cotton, W. R.: Small–scale and mesoscale variability in cloudy boundary layers: Joint probability density functions, J. Atmos. Sci., 59, 3519–3539, https://doi.org/10.1175/1520-0469(2002)059<3519:SSAMVI>2.0.CO;2, 2002. a, b

Larson, V. E., Domke, S., and Griffin, B. M.: Momentum transport in shallow cumulus clouds and its parameterization by higher-order closure, J. Adv. Model. Earth Sy., 11, 3419–3442, https://doi.org/10.1029/2019MS001743, 2019. a

Le Treut, H. and Li, Z. X.: Sensitivity of an atmospheric general circulation model to prescribed SST changes: Feedback effects associated with the simulation of cloud optical properties, Clim. Dynam., 5, 175–187, https://doi.org/10.1007/BF00251808, 1991. a

Lewellen, W. S. and Yoh, S.: Binormal model of ensemble partial cloudiness, J. Atmos. Sci., 50, 1228–1237, https://doi.org/10.1175/1520-0469(1993)050<1228:BMOEPC>2.0.CO;2, 1993. a, b

Lockwood, F. C. and Naguib, A. S.: The predictin of th efluctuations in the properties of free, round–jet, turbulent, diffusion flames, Combust. Flame, 24, 109–124, https://doi.org/10.1016/0010-2180(75)90133-9, 1975. a

Machulskaya, E.: Clouds and convection as subgrid–scale distributions, in: Parameterization of Atmospheric Convection, Volume II, edited by: Plant, R. S. and Yano, J.-I., World Scientific, Imperial College Press, U. K., 377–422, https://doi.org/10.1142/p1005, 2015. a, b, c, d

Marshall, J. S. and Palmer, W. M. K.: The distribution of raindrops with size, J. Atmos. Sci., 5, 165–166, 1948. a

Mellor, G. L.: Analytic prediction of the properties of stratified surface layers, J. Atmos. Sci., 30, 1061–1069, https://doi.org/10.1175/1520-0469(1973)030<1061:APOTPO>2.0.CO;2, 1973. a

Mellor, G. L.: Gaussian cloud model relations, J. Atmos. Sci., 34, 356–358, https://doi.org/10.1175/1520-0469(1977)034<0356:TGCMR>2.0.CO;2, 1977. a

Mellor, G. L. and Yamada, T.: A hierarchy of turbulence closure models for planetary boundary layers, J. Atmos. Sci., 31, 1791–1806, https://doi.org/10.1175/1520-0469(1974)031<1791:AHOTCM>2.0.CO;2, 1974. a

Milbrandt, J. A. and Yau, M. K.: A multimoment bulk microphysics parameterization. Part I: Analysis of the role of the spectrum shape parameter, J. Atmos. Sci., 62, 3051-3064, https://doi.org/10.1175/JAS3534.1, 2005. a, b, c

Naumann, A. K., Seifert, A., and Mellado, J. P.: A refined statistical cloud closure using double-Gaussian probability density functions, Geosci. Model Dev., 6, 1641–1657, https://doi.org/10.5194/gmd-6-1641-2013, 2013. a, b, c

Pope, S. B.: The statistical theory of turbulent flames, Philos. T. R. Soc. S.-A, 291, 529–568, 1979. a

Pope, S. B.: PDF methods for turbulent reactive flows, Prog. Energ. Combust., 11, 119–192, https://doi.org/10.1016/0360-1285(85)90002-4, 1985. a

Richard, J. L. and Royer, J. F.: A statistical cloud scheme for ruse in an AGCM, Ann. Geophys., 11, 1095–1115, 1993. a

Risken, H.: The Fokker–Plank Equation, Springer-Verlag, Berlin, 454 pp., ISBN 0387504982, 1984. a, b

Robert, R. and Sommeria, J.: Statistical equilibrium states for two-dimensional flows, J. Fluid Mech., 229, 291–310, https://doi.org/10.1017/S0022112091003038, 1991. a

Rogers, R. R. and Yau, M. K.: Short Course in Cloud Physics, 3rd edn., Pergamon Press, Oxford, 290 pp., ISBN: 9780750632157, 1989. a

Seifert, A. and Beheng, K. D.: A double-moment parameterization for simulating autoconversion, accretion and selfcollection, Atmos. Res., 59–60, 265–281, https://doi.org/10.1016/S0169-8095(01)00126-0, 2001. a

Seifert, A. and Beheng, K. D.: A two-moment cloud microphysics parameterization for mixed-phase clouds. Part 1: Model description, Meteorol. Atmos. Phys., 92, 45–66, https://doi.org/10.1007/s00703-005-0112-4, 2006. a

Shannon, C. E.: A mathematical theory of communication, Bell Syst. Tech. J., 27, 379–423, 623–656, 1948. a

Sommeria, G. and Deadorff, J. W.: Subgrid–scale condensation in models of nonprecipitating clouds, J. Atmos. Sci., 34, 344–355, https://doi.org/10.1175/1520-0469(1977)034<0344:SSCIMO>2.0.CO;2, 1977. a

Stull, R. B.: An Introduction to Boundary Layer Meteorology, Kluwer Academic Press, Dordrecht, 666 pp., ISBN 978-90-277-2768-8, 1988. a

Tompkins, A. M.: A prognostic parameterization for the subgrid–scale variability of water vapor and clouds in large–scale models and its use to diagnose cloud cover, J. Atmos. Sci., 59, 1917–1942, https://doi.org/10.1175/1520-0469(2002)059<1917:APPFTS>2.0.CO;2, 2002. a

Touchette, H.: The large deviation approach to statistical mechanics, Phys. Rep., 478, 1–69, https://doi.org/10.1016/j.physrep.2009.05.002, 2009. a

Verkley, W.: A maximum entropy approach to the problem of parameterization, Q. J. Roy. Meteor. Soc., 137, 1872–1886, https://doi.org/10.1002/qj.860, 2011. a

Verkley, W. and Lynch, P.: Energy and enstropy spectra of geostrophic turbulent flows derived from a maximum entropy principle, J. Atmos. Sci., 66, 2216–2236, https://doi.org/10.1175/2009JAS2889.1, 2009. a

Verkley, W., Kalverla, P., and Severijns, C.: A maximum entropy approach to the parameterization of subgrid–scale processes in two–dimensional flow, Q. J. Roy. Meteor. Soc., 142, 2273–2283, https://doi.org/10.1002/qj.2817, 2016. a

Wikle, C. K. and Berliner, L. M.: A Bayesian tutorial for data assimilation, Physica D, 230, 1–16, 2007. a

Wonnacott, T. H. and Wonnacott, R. J.: Introductory Statistics, John Wiley & Sons, New York, 402 pp., ISBN-13: 9780471615187, 1969. a

Yanai, M., Esbensen, S., and Chu, J.-H.: Determination of bulk properties of tropical cloud clusters from large-scale heat and moisture budgets, J. Atmos. Sci., 30, 611–627, 1973.

Yano, J.-I.: Fromulation structure of mass-flux convection parameterization, Dynam. Atmos. Oceans, 67, 1–28, https://doi.org/10.1016/j.dynatmoce.2014.04.002, 2014. a, b

Yano, J.-I.: Scale separation, in: Parameterization of Atmospheric Convection, Vol. I, edited by: Plant, R. S. and Yano, J.-I., World Scientific, Imperial College Press, U. K., 73–99, https://doi.org/10.1142/p1005, 2015a. a

Yano, J.-I.: Subgrid–scale parameterization problem, in: Parameterization of Atmospheric Convection, Vol. I, edited by: Plant, R. S. and Yano, J.-I., World Scientific, Imperial College Press, U. K., https://doi.org/10.1142/p1005, 2015b. a

Yano, J. I.: Subgrid–scale physical parameterization in atmospheric modelling: How can we make it consistent?, J. Phys. A-Math. Theor., 49, 284001, https://doi.org/10.1088/1751-8113/49/28/284001, 2016. a, b, c, d, e

Yano, J.-I.: What is the Maximum Entropy Principle?: Comments on “Statistical Theory on the Functional Form of Cloud Particle Size Distributions”, J. Atmos. Sci., 76, 3955–3960, https://doi.org/10.1175/JAS-D-18-0223.1, 2019. a

Yano, J.-I.: Prognostic assumed-probability-density-function (distribution density function) approach: further generalization and demonstrations, Nonlin. Processes Geophys., 31, 359–380, https://doi.org/10.5194/npg-31-359-2024, 2024. a, b, c, d, e, f

Yano, J.-I. and Manzato, A.: Does more moisture in the atmosphere lead to more intense rains?, J. Atmos. Sci., 79, 663–681, https://doi.org/10.1175/JAS-D-21-0117.1, 2022. a

Yano, J.-I. and Ouchtar, E.: Convective Initiation Uncertainties Without Trigger or Stochasticity: Probabilistic Description by the Liouville Equation and Bayes' Theorem, Q. J. Roy. Meteor. Soc., 143, 2015–2035, https://doi.org/10.1002/qj.3064, 2017. a, b

Yano, J.-I., Redelsperger, J.-L., Guichard, F., and Bechtold, P.: Mode decomposition as a methodology for developing convective-scale representations in global models, Q. J. Roy. Meteor. Soc., 131, 2313–2336, https://doi.org/10.1256/qj.04.44, 2005. a

Yano, J.-I., Geleyn, J.-F., Köhler, M., Mironov, D., Quaas, J., Soares, P. M. M., Phillips, V. T. J., Plant, R. S., Deluca, A., Marquet, P., Stulic, L., and Fuchs, Z.: Basic Concepts for Convection Parameterization in Weather Forecast and Climate Models: COST Action ES0905 Final Report, Atmosphere-Basel, 6, 88–147, 2014. a, b

Yano, J.-I., Heymsfield, A. J., and Phillips, V. T. J.: Size distributions of hydrometeors: Analysis with the maximum entropy principle, J. Atmos. Sci., 73, 95–108, https://doi.org/10.1175/JAS-D-15-0097.1, 2016. a, b, c

Yano, J.-I., Ziemiański, M., Cullen, M., Termonia, P., Onvlee, J., Bengtsson, L., Carrassi, A., Davy, R., Deluca, A., Gray, S. L., Homar, V., Köhler, M., Krichak, S., Michaelides, S., Phillips, V. T. J., Soares, P. M. M., and Wyszogrodzki, A.: Scientific Challenges of Convective–Scale Numerical Weather Prediction, B. Am. Meteorol. Soc., 99, 699–710, https://doi.org/10.1175/bams-d-17-0125.1, 2018. a, b

In the literature, higher moments (n≥2) are often defined in terms of a deviation from the mean (i.e., the first moment). Here, the definition is presented without discriminating between the first and the higher moments for mathematical lucidity.

However, this entropy should not be confused with thermodynamic entropy (Jaynes, 1978).

A standard argument for adopting the sixth moment, 〈r⁶〉, as a prognostic variable for predicting the PSD distribution is that it can describe the spread of PSD well. In distribution problems, on the other hand, a standard choice for measuring the spread is the variance; thus a given distribution should be constrained by the second moment, 〈r²〉, rather than by the sixth moment, 〈r⁶〉. The choice of σ=r⁶ would only be justified when a mass distribution rather than a size distribution is considered because in that case the variance is defined by $σ = m^{2} \propto r^{6}$ , where m is the particle mass. On the other hand, from the point of view of the output-constrained distribution principle, what is important is to predict a set of output variables that are required for the host model, rather than simply trying to predict a spread in an accurate manner. In this respect, as already pointed out, the reflectivity, 〈r⁶〉, is usually not a variable that is directly required within a cloud model. Probably, the most important process to be predicted is the coalescence, which is, very crudely speaking, controlled by $n_{c}^{2}$ ; thus a weight to adopt would be σ=n_c, noting that there is already a factor, n_c, in the definition of the integral with σ. For the precipitating particles, the same would apply to the sedimentation rate, which is proportional to a certain power, say a, of the particle size, r; then σ=r^a would be the choice.

Technical note: General formulation for the distribution problem – prognostic assumed probability density function (PDF) approach based on the maximum-entropy principle and the Liouville equation

3.1 Distributions and moments

3.2 Assumed PDF (DDF)

3.2.1 General formulation

3.2.2 Choice of assumed-distribution forms

3.3 Maximum-entropy principle: derivation

3.3.1 Derivation

3.3.2 Technical remarks

3.4 Maximum-entropy principle: examples

3.4.1 Homogeneous distribution

3.4.2 Constraints by moments

3.4.3 Gamma distribution

3.5 Liouville equation

3.6 PSD equation

4.1 Static or non-static, diagnostic or prognostic

4.2 Output-constrained distribution principle

4.3 Availability of input data

4.4 Validation: diffusion problem

4.4.1 Example 1: double-Gaussian distribution

4.4.2 Example 2: a skewed Gaussian distribution

5.1 General formulation

5.2 Choice of the weights, {σl}

5.3 Generalization to the PDE system (Eq. 1)

5.4 Link to the existing assumed-PDF approaches and the bulk microphysics

5.5 Application 1: exponential distribution

5.6 Application 2: Gaussian distribution

5.7 Diffusion problem

5.8 Application 3: gamma distribution

6.1 Gamma distribution

6.2 Exponential distribution

B1 When m=0

B2 When m=1

B3 When m>1

5.2 Choice of the weights, {σ_l}