Quality Prediction of Complex Sand Castings Using Multi-Source Heterogeneous Data

In modern industrial manufacturing, sand casting remains a fundamental and irreplaceable process for producing metal components with complex geometries. Its primary advantages include exceptional design flexibility, the ability to manufacture very large parts, cost-effectiveness for medium to low volumes, and compatibility with a wide range of ferrous and non-ferrous alloys. The process involves creating a mold from a sand aggregate and pouring molten metal into its cavity. However, the final quality of a casting is highly sensitive to a multitude of interacting factors. Despite strict control over process parameters, defects such as cold shuts, porosity, sand inclusions, and shrinkage cavities frequently occur, leading to significant scrap rates, increased costs, and production delays. Therefore, accurate prediction and prevention of these sand casting defects is a critical challenge for foundries aiming to improve yield and competitiveness.

The root causes of sand casting defects are complex and multifaceted. They originate from the interplay between two primary data sources: structured process parameters and the unstructured three-dimensional (3D) geometry of the part itself. Structured data encompasses all measurable process variables, including sand properties (e.g., moisture, compactability, green strength), melting and pouring parameters (e.g., chemical composition, pouring temperature, inoculation amount), and other operational settings. Simultaneously, the 3D shape of the casting profoundly influences solidification patterns, heat distribution, and flow dynamics of the molten metal. Features like thin-to-thick section transitions, isolated hot spots, complex internal cavities, and large flat surfaces create inherent vulnerabilities to specific sand casting defects. For instance, shrinkage porosity tends to form in isolated heavy sections, while cold shuts are likely in thin-walled areas distant from the gating system.

Traditionally, quality analysis in foundries has heavily relied on expert experience and statistical process control (SPC) of structured工艺 parameters. A significant gap exists in quantitatively incorporating and analyzing the part’s geometry due to the inherent complexity of representing 3D shapes in a form suitable for computational analysis. This non-structured data is difficult to characterize with traditional metrics. Consequently, most data-driven quality models fail to account for geometric influences, limiting their predictive accuracy and making it impossible to determine why defects consistently appear at certain complex geometric locations. To achieve a breakthrough in predicting sand casting defects, it is imperative to develop methods that can fuse these two heterogeneous data streams—quantitative process parameters and qualitative geometric features—into a unified predictive model.

Analysis of Sand Casting Process and Key Data Sources

The typical sand casting process using a green sand molding line involves several sequential stages, each generating critical data. A systematic breakdown is essential for data collection.

Process Stage	Key Parameters & Data Generated	Potential Impact on Sand Casting Defects
Sand Preparation & Molding	Moisture content, compactability, green strength, permeability; Mold hardness.	Inconsistent sand properties lead to sand inclusions, erosion, veining, and gas-related porosity.
Core Making	Core sand type, binder content, curing parameters, core gas permeability.	Poor core strength causes breakages (sand defects); inadequate venting leads to core gas porosity.
Melting & Alloying	Charge materials, chemical composition (C, Si, Mn, S, P, etc.), melting temperature, holding time.	Incorrect composition affects fluidity and shrinkage behavior, leading to misruns, cold shuts, and shrinkage porosity.
Inoculation & Treatment	Inoculant type/amount, treatment method (e.g., nodularization for ductile iron), post-inoculation temperature drop.	Poor treatment results in undesired microstructure, weakened mechanical properties, and increased shrinkage tendency.
Pouring	Pouring temperature, pouring time, ladle transfer time.	Low temperature causes cold shuts and mistuns; high temperature promotes metal-mold reaction and penetration defects.
Cooling & Shakeout	Time in mold, shakeout temperature.	Premature shakeout can cause hot tears or distortions; slow cooling in certain zones can promote coarse microstructure.

The ultimate goal is to establish a per-casting traceability system. This involves linking every single casting, via a unique identifier, to the precise set of process parameters that governed its production. This is achieved by integrating data from Manufacturing Execution Systems (MES) modules like furnace log, spectral analysis, treatment station records, and final inspection logs. By cross-referencing timestamps, lot numbers, and casting IDs, a comprehensive and unique “data fingerprint” for each physical casting is created, forming the structured-data foundation for predictive modeling.

Quantitative Characterization of 3D Casting Geometry

To integrate geometric information, the 3D CAD model of the casting must be transformed into a quantifiable feature vector. Traditional handcrafted geometric descriptors often fail to capture the complex, high-level shape semantics relevant to casting solidification physics. Deep learning, particularly unsupervised learning models, offers a powerful solution for automated feature extraction from 3D data.

The core idea is to use a Deep Convolutional Autoencoder (3D-DCAE). An autoencoder is a neural network designed to learn an efficient, compressed representation (encoding) of input data by training the network to reconstruct its own input. The model consists of an encoder that compresses the input into a latent-space representation, and a decoder that reconstructs the input from this representation. The loss function is the reconstruction error. By successfully learning to reconstruct the 3D shape, the model’s latent vector (the bottleneck layer) inherently contains the essential features that define that shape.

For 3D castings, the CAD model is first voxelized, converting it into a 3D grid of binary values (1 for metal, 0 for air). This 3D voxel grid serves as the input to a 3D Convolutional Autoencoder. The encoder uses 3D convolutional and pooling layers to progressively downsample the spatial dimensions while increasing the depth (feature channels), effectively summarizing the geometric information.

The decoder mirrors this process with 3D transposed convolutional layers to upsample the latent vector back to the original voxel grid size. The training objective is to minimize the difference between the original voxel grid $\mathbf{X}$ and the reconstructed grid $\mathbf{\hat{X}}$. A common loss function is the Binary Cross-Entropy:

$$
\mathcal{L}_{recon} = -\frac{1}{N} \sum_{i=1}^{N} [x_i \log(\hat{x}_i) + (1-x_i) \log(1-\hat{x}_i)]
$$

where $N$ is the total number of voxels, $x_i$ is the original voxel value, and $\hat{x}_i$ is the reconstructed value. Through this unsupervised training, the latent vector $\mathbf{z}$ of the autoencoder becomes a powerful, low-dimensional numeric descriptor of the casting’s 3D geometry:

$$
\mathbf{z} = Encoder(\mathbf{X}) \quad \text{,} \quad \mathbf{\hat{X}} = Decoder(\mathbf{z})
$$

The superiority of a 3D-DCAE over a 2D approach (which treats the 3D model as a stack of 2D slices) is evident. The 3D convolutional kernels can capture spatial correlations in all three dimensions simultaneously, which is crucial for understanding volumetric phenomena like heat dissipation and solidification fronts. Comparative experiments show that a 3D-DCAE achieves a significantly higher reconstruction accuracy (e.g., >99.5%) compared to a 2D-DCAE, proving its superior capability in extracting meaningful 3D shape features relevant to the physical processes that cause sand casting defects.

Building a Hybrid Data-Driven Defect Prediction Model

With both data streams prepared—the structured process parameter vector $\mathbf{P}$ and the unstructured geometric feature vector $\mathbf{z}$—the next step is to fuse them into a single predictive model. The target is to predict the occurrence of one or more defect types $D$ (e.g., cold shut, porosity, sand inclusion, shrinkage) for a given casting.

The model architecture is a hybrid neural network. The processed structured parameters and the geometric latent vector are concatenated to form a combined input feature vector $\mathbf{F}$:

$$
\mathbf{F} = \mathbf{P} \oplus \mathbf{z}
$$

This fused vector is then fed into a supervised learning network designed for classification. A effective architecture involves several fully connected (dense) layers with non-linear activation functions (e.g., ReLU) to learn complex interactions between process and geometry:

$$
\begin{aligned}
\mathbf{h}_1 &= \text{ReLU}(\mathbf{W}_1 \mathbf{F} + \mathbf{b}_1) \\
\mathbf{h}_2 &= \text{ReLU}(\mathbf{W}_2 \mathbf{h}_1 + \mathbf{b}_2) \\
\vdots \\
\mathbf{y} &= \text{Softmax}(\mathbf{W}_k \mathbf{h}_{k-1} + \mathbf{b}_k)
\end{aligned}
$$

where $\mathbf{y}$ is the output vector representing the probability of each defect class. Given the typical class imbalance in defect data (few defective samples compared to good ones), the loss function must be carefully designed. A Cross-Entropy loss weighted by class frequency or a Focal Loss is often employed to prevent the model from ignoring the minority defect classes:

$$
\mathcal{L}_{Focal} = -\sum_{c=1}^{C} \alpha_c (1 – p_c)^{\gamma} y_c \log(p_c)
$$

where $C$ is the number of defect classes, $y_c$ is the true label, $p_c$ is the predicted probability, $\alpha_c$ is a class balancing weight, and $\gamma$ is a focusing parameter.

The training procedure involves splitting the per-casting traceability dataset (with fused features $\mathbf{F}$ and defect labels) into training, validation, and test sets. The model’s hyperparameters, such as layer sizes, learning rate, and batch size, are optimized on the validation set. Key training parameters are summarized below:

Parameter	Setting / Value
Data Split Ratio (Train/Validation/Test)	70% / 15% / 15%
Optimization Algorithm	Adam Optimizer
Initial Learning Rate	0.001 (with decay scheduler)
Batch Size	32 or 64
Primary Loss Function	Weighted Categorical Cross-Entropy / Focal Loss
Key Regularization	Dropout, L2 Weight Regularization

Experimental results demonstrate that the hybrid model, leveraging both process and 3D geometric data, significantly outperforms models based on process data alone. For example, a traditional model using only process parameters might achieve an accuracy of 85-90% on a test set for predicting major sand casting defects. In contrast, the proposed multi-source heterogeneous data model can improve accuracy to 93-96%, with particularly notable gains in predicting defects that are strongly geometry-dependent, such as shrinkage porosity in complex junctions or cold shuts in extended thin sections. This performance boost confirms the critical hypothesis that geometry is a fundamental, quantifiable factor in the formation of sand casting defects.

Conclusion and Outlook

This work presents a comprehensive framework for predicting sand casting defects by integrating multi-source heterogeneous data. The key innovation lies in the quantitative bridging of the gap between unstructured 3D geometric information and structured process data. By employing a 3D Deep Convolutional Autoencoder, complex casting geometries are transformed into compact, informative feature vectors that encapsulate shape characteristics critical to solidification and flow. These geometric features are then fused with traditional process parameters to train a high-performance hybrid neural network classifier.

The results affirm that a substantial portion of the variance in defect occurrence, often attributed to “special causes” or unknown interactions, can be explained and predicted by explicitly accounting for part geometry. This methodology moves quality control from a reactive, inspection-based paradigm to a proactive, prediction-driven one. Foundries can potentially use such a model in the pre-production stage to perform a “virtual quality audit” on new part designs, identifying geometric features that are high-risk for specific sand casting defects and suggesting design or process modifications before tooling is even created.

Future work will focus on enhancing the model’s interpretability (e.g., using attention mechanisms to highlight which geometric region and which process parameter most contributed to a predicted defect), extending it to predict defect severity or location, and integrating real-time sensor data from the foundry floor for dynamic quality prediction and closed-loop process control. The ultimate goal is to create a robust, intelligent system that continuously learns from production data, reducing the incidence of sand casting defects, improving yield, and driving the foundry industry toward greater efficiency and sustainability.