
61
Goodfellow et al. pioneered the concept of
Generative Adversarial Networks (GANs) as an
adversarial process (Sharma et al. 2024). This
framework involves the simultaneous training of
two models: a Generator and a Discriminator. As
depicted in Figure 1, the Generator serves as a
generative model designed to approximate the
data distribution, while the Discriminator acts as
a discriminative model tasked with estimating the
probability that a given sample originates from the
training data rather than the Generator (Nayak et
al., 2024; Yadav et al., 2023; Dutta et al., 2020).
One of the most prevalent applications of GANs is
in privacy protection, where they create synthetic
datasets that mimic the statistical properties of
original data without exposing sensitive information
(Choi et al., 2017).
Beyond GANs, alternative methods exist for
generating statistically synthetic data. Ping et
al. demonstrated the utility of Bayesian models
for capturing the relationships within synthetic
data generation frameworks (Hindistan & Yetkin,
2023). However, the primary advantage of GANs
over traditional statistical approaches lies in their
superior capability to approximate real-world data
distributions. Xu and Veeramachaneni (2023)
highlighted the potential of GANs in producing
high-quality synthetic datasets benecial
for data science applications. For instance,
techniques such as Recurrent Conditional GANs
(RCGANs) (Yilmaz & Korn, 2022), Time-Series
GANs (TimeGANs) (Esteban et al., 2017), and
Wasserstein-based models, including Conditional
Wasserstein GANs (CWGANs) (Arjovsky, 2017)
and Recurrent Conditional Wasserstein GANs
(RCWGANs), have been explored for generating
synthetic data with high delity.
Traditional methods like ARIMA or recurrent neural
networks (RNNs) have also been applied to synthetic
data generation but often fall short in capturing
complex, nonlinear relationships. GANs have
emerged as a robust alternative, nding applications
in sectors such as healthcare and cybersecurity.
However, their integration into the energy sector
remains at an early stage (Fekri, 2020).
Amasyali and El-Gohary (2018) conducted
an extensive review of energy forecasting
methodologies, reporting that 67% of the
analyzed studies utilized real data, 19% employed
simulated data, and 14% relied on publicly
available reference datasets. This reliance on real
data underscores the importance of historical
records and highlights the urgent need to develop
larger, high-quality datasets to advance energy
prediction capabilities. Although some real
datasets are publicly accessible, many studies
depend on private, proprietary data derived from
real-world scenarios (Sehovac & Grolinger, 2019).
In their review, Amasyali and El-Gohary (2018)
emphasized the role of simulation-based
approaches using tools such as EnergyPlus,
eQUEST, and Ecotect. These physical models
estimate energy consumption based on detailed
environmental and building characteristics.
However, acquiring such granular information
is often impractical. In contrast, data-driven
approaches leverage sensor-derived data and do
not require the same level of specicity. Simulation
techniques are predominantly utilized in the
design phase, whereas data-driven methods are
more commonly applied to demand and supply
management scenarios. Both approaches are
complementary and are selected based on
the specic objectives and constraints of each
application.
Deb et al. (2017) reviewed time-series forecasting
techniques for building energy consumption and
noted the eectiveness of simulation tools like
EnergyPlus, IES, and Ecotect in modeling energy
use for new buildings. When historical data is
unavailable, simulations oer a viable alternative.
Nevertheless, accurately forecasting energy
consumption involves accounting for numerous
complex factors, such as material properties,
climate conditions, and occupant behavior. While
simulations can approximate these variables, data-
driven methods often achieve greater accuracy for
existing buildings with accessible historical data.
Lazos et al. (2014) categorized energy forecasting
approaches into statistical, machine learning, and
physics-based models. Physics-based models
provide detailed, explainable predictions without
requiring historical data but demand extensive input
on structural, thermodynamic, and operational
parameters. Modeling occupant behavior within
these systems remains a signicant challenge.
Conectando mentes, energizando el futuro