## 1.

## Introduction

Mask, process, and lithography-tool parameters are the most important aspects for achieving high lithography performance. Various methods have been proposed to optimize these parameters in recent years. Optical proximity correction is the main technology to optimize the mask pattern to enhance the resolution.^{1} Source mask optimization (SMO) improves the lithography performance by co-optimizing the illumination source and mask.^{2} The optimized mask structure and source shape via SMO suffers from extreme complexity, especially for the pixelated mask and source,^{3}4.^{–}^{5} which leads to difficulty in manufacturing. Furthermore, the parameters related to the process, such as the film stacks, postexposure bake (PEB), and photoresist development, have a strong impact on the process window (PW).^{6}^{,}^{7} However, most published optimization technologies were implemented under fixed process conditions.^{3}^{,}^{8}^{,}^{9} Moreover, lithography-tool parameters such as the numerical aperture (NA) and source parameter also determine the PW.^{1} Actually, the parameters related to the mask, process, and lithography tool simultaneously impact the lithography performance.^{10}^{,}^{11} The lithography effects caused by multiple parameter errors could compensate for each other,^{12} indicating that the co-optimization of multiple parameters could improve the PW. However, no effective methods have been published for co-optimizing the mask, process, and lithography-tool parameters.

In this study, the first time to our knowledge, a co-optimization method is developed to simultaneously optimize the mask, process, and lithography-tool parameters. The PW is extended by co-optimizing the parameters rather than optimizing them independently. An effective co-optimization flow is built to realize this goal. Conjugate gradient algorithm is used to calculate the search direction in optimization. The parameters are optimized along the search direction, where a line search is applied within the parameters’ boundary. Mask, process, and lithography-tool parameter co-optimization (MPLCO) is a multidimensional optimization method. As large-scale parameters are obstructed by small-scale parameters during co-optimization, we also propose a normalized conjugate gradient (NCG) algorithm to improve the convergence efficiency of the MPLCO. In order to reduce the complexity of the mask and source, a parametric mask and source are used in the MPLCO. The mask pattern is optimized by adding bias and changing the feature transmittance, while the source shape is optimized by only adjusting the sigma value in the MPLCO. As an example, we chose eight parameters in the MPLCO to optimize 45-nm node patterns. The MPLCO results show that the depth of focus (DOF) of the line-space pattern is 93.75% larger than only mask and source are optimized. At the same time, the complexity of the mask and source has been reduced compared with pixelated SMO in Ref. 3. In order to further improve the PW of the SMO work in Ref. 3, we optimize the process parameters of the SMO. The DOF of the SMO with optimized process parameters is 13.54% larger than the SMO for the line-space pattern but still smaller than that of MPLCO. The co-optimization results demonstrate that the large PW could be obtained only if the mask, process, and lithography-tool parameters are optimized simultaneously.

The remainder of the paper is organized as follows. In Sec. 2, the MPLCO is described in detail. The optimization results are presented in Sec. 3. Finally, the conclusions are summarized in Sec. 4.

## 2.

## Co-Optimization Method

The goal of MPLCO is to achieve high lithography performance in a large PW. To realize this goal, we first create an effective cost function to evaluate the lithography performance. The cost function is sufficient in promoting the optimization of the PW and benefits to the optimization procedure. Then, the parameters are optimized along the search direction iteratively until the largest PW is found.

## 2.1.

### Cost Function

The cost function is composed of the DOF, image contrast, and normalized image log-slope (NILS). Usually, the PW is measured under a designated metrology plane, where the metrology plane is a slice through the aerial image or photoresist pattern to make the measurement. The overlap of PW (common PW) through various metrology planes could evaluate the overall lithography performance. A common method for evaluating the overlap PW is to examine the DOF with specific exposure latitude (EL), and a larger DOF indicates better lithography performance.^{11}^{,}^{13} Thus, the DOF is chosen as the main objective function in the optimization. The DOF is a highly nonlinear function; it is usually been trapped at a local minimum when using a gradient-based algorithm.^{14}^{,}^{15} Some other objective functions should be used to assist in optimizing the DOF. Because the image contrast and NILS are strongly related to the DOF^{14}^{,}^{16}17.^{–}^{18} and converge much easier than DOF, they can serve as the assistant objective function in promoting the optimization of the DOF. Therefore, the cost function is formulated as a superposition of the objective functions mentioned above by assigning different weights.

## (1)

$$F=-\sum _{t=1}^{z}({w}_{1}\xb7|{P}_{\text{Contrast}}^{t}|+{w}_{2}\xb7|{P}_{\mathrm{NILS}}^{t}|)\phantom{\rule{0ex}{0ex}}-\sum _{j=1}^{m}{\epsilon}_{j}\xb7|{P}_{\mathrm{DOF}@\mathrm{EL}j}|,$$The cost function is beneficial to optimizing the PW, while the image contrast and NILS are used to promote the optimization of the DOF. Thus, the weight ${\epsilon}_{j}$ should be set much larger than ${w}_{1}$ and ${w}_{2}$ in optimization.

PROLITH software is used to simulate the DOF, contrast, and NILS owing to its superior ability in lithography simulation, where the mask, process, and lithography-tool models are accurately established.

## 2.2.

### Co-Optimization Flow

In this section, the parametric-based MPLCO is proposed and realized by an NCG algorithm. First, all parameters in the optimization are normalized in the same scale. Then, the parameters are optimized along the search direction iteratively until the best combination of mask, process, and lithography-tool parameters is found. Because the combination of mask, process, and lithography-tool parameters is a multidimensional point in the optimization space, we use point instead of combination and use ${x}_{n}$ to denote each parameter in the following sections.

The co-optimization flow can be generalized in the following steps, as shown in Fig. 1.

Step 1: Set the initial point and constraints of the parameters.

The constraints contain the range of parameters, the weight of the objective functions, the increment of parameters for derivative calculation, and the termination conditions. The termination conditions can be set as the number of iterations.

Step 2: Normalize the different scale parameters in the same range of [0, 1].

The large-scale parameters are likely to be obstructed by the small-scale parameters when optimizing different scale parameters. Normalization is carried out to eliminate the scale differences of the lithography parameters and to improve the convergence efficiency. The normalization is expressed as

where ${\overline{x}}_{n}$ is the normalized parameter, ${x}_{n}$ is the non-normalized original parameter, and ${x}_{\mathrm{max}}$ and ${x}_{\mathrm{min}}$ are the maximum and minimum values of ${x}_{n}$.Step 3: Calculate the search direction at the initial or iteration point.

A gradient-based algorithm is chosen to calculate the search direction owing to its fast convergence speed in solving lithography problems.

where ${\{{\overline{x}}_{n}\}}^{(0)}$ is the initial point, and ${\mathbf{d}}^{(0)}$ is the first search direction. For the $k$’th iteration, according to Fletcher–Reeves conjugate gradient method, the search direction is expressed as^{4}^{,}^{19}20.21.^{–}^{22}At the first iteration, the search direction is^{23}where ${\Vert \u2022\Vert}_{2}$ is the ${L}_{2}$ norm, and ${\{{\overline{x}}_{n}\}}^{(k)}$ represents the $k$’th iterative point.## (4)

$${\mathbf{d}}^{(k)}=-\nabla F({\{{\overline{x}}_{n}\}}^{(k)})+\frac{{\Vert \nabla F({\{{\overline{x}}_{n}\}}^{(k)})\Vert}_{2}^{2}}{{\Vert \nabla F({\{{\overline{x}}_{n}\}}^{(k-1)})\Vert}_{2}^{2}}\phantom{\rule{0ex}{0ex}}\xb7{\mathbf{d}}^{(k-1)},(k\ge 1),$$As there are no explicitly analytical relations between the cost function $F$ and the point $\{{x}_{n}\}$, the calculation of the derivative is intractable. We use a numerical approximation of the gradient instead of calculating the derivative of $F$. Hence, the approximate backward derivative is written as

where $\mathrm{\Delta}{\overline{x}}_{n}$ is a small increment or variation between the current iteration point and its neighborhood. For each parameter ${\overline{x}}_{n}$, $\mathrm{\Delta}{\overline{x}}_{n}$ needs to be set before calculating $\nabla F$. When ${x}_{n}$ is normalized to ${\overline{x}}_{n}$, $\mathrm{\Delta}{\overline{x}}_{n}$ and $\nabla F$ are the normalized values.## (5)

$$\nabla F({\{{\overline{x}}_{n}\}}^{(k)})=\frac{F({\{{\overline{x}}_{n}\}}^{(k)})-F({\{{\overline{x}}_{n}\}}^{(k)}-\{\mathrm{\Delta}{\overline{x}}_{n}\})}{\{\mathrm{\Delta}{\overline{x}}_{n}\}},$$Step 4: Calculate the local boundary along the search direction.

We define two types of boundaries: the global boundary and the local boundary. The global boundary is the range of parameters set in step 1, while the local boundary is the upper and lower limitations along the search direction. The local boundary falls within the range of the global boundary. The local boundary calculation ensures that the co-optimization of parameter is always within the valid search space. For the two-parameter local boundary calculation in Fig. 2, the points $A({a}_{1},{a}_{2})$ and $B({b}_{1},{b}_{2})$ are the global boundaries of the two parameters ${x}_{1}$ and ${x}_{2}$. $U({u}_{1},{u}_{2})$ and $V({v}_{1},{v}_{2})$ are the local boundaries along the direction $\mathbf{d}({d}_{1},{d}_{2})$ from the $k$’th iterative point $X({x}_{1}^{k},{x}_{2}^{k})$, which cannot exceed the range of $A({a}_{1},{a}_{2})$ and $B({b}_{1},{b}_{2})$. The line search will be carried out between point $X({x}_{1}^{k},{x}_{2}^{k})$ and $U({u}_{1},{u}_{2})$.

The local upper boundary point $U$ is calculated as follows:

where ${u}_{n}$ is the local upper boundary point along the direction $\mathbf{d}$, ${a}_{i}$ and ${b}_{i}$ are the global upper and lower boundary values, and $t$ is the dimension of the parameters. The function $\mathrm{min}\{\xb7\}$ is used to calculate the minimum common divisor of $({a}_{i}-{x}_{i})/{d}_{i}$.## (6)

$${u}_{n}=\{\begin{array}{cc}{x}_{n}+\mathrm{min}\left\{\frac{{a}_{i}-{x}_{i}}{{d}_{i}}\right\}\xb7{d}_{n}& (i=1,2,\cdots ,t;{d}_{n}\ne 0)\\ {a}_{n}& ({d}_{n}=0)\end{array},$$In Eq. (6), $\mathrm{min}\{({a}_{i}-{x}_{i})/{d}_{i}\}$ ensures that the local boundary point value ${u}_{k}$ is always less than or equal to the global boundary point value ${a}_{k}$. However, the calculation of $\mathrm{min}\{({a}_{i}-{x}_{i})/{d}_{i}\}$ brings an obstacle in optimizing different scale parameters because the local boundary of the large-scale parameter is usually restricted by the small-scale parameter. The local boundary of the large-scale parameter always falls within a small range along the search direction $\mathbf{d}$, which makes it difficult to optimize the large-scale parameter. For example, the scale for the mask bias value is usually $>20\text{\hspace{0.17em}}\text{\hspace{0.17em}}\mathrm{nm}$, while the scale for the illumination sigma value is 1. The mask bias value is large-scale parameter compared with the illumination sigma value. The local boundary of mask bias is likely restricted by the illumination sigma value when the two parameters are optimized together. However, there will be no difference in scales if the parameters are normalized to the same range in step 2, and the large-scale parameter will not be limited by the small-scale parameter.

Step 5: Apply a line search to find the optimal iteration point along the search direction.

After the local boundary is calculated, a line search is applied to find the optimum value in the segment between the iterative point $X$ and the upper local boundary $U$. The Hopfinger golden section search method is used to apply the line search owing to its ability in solving the problem in which the cost function is not unimodal.

^{24}We will not present the details of line search method here; interested readers may refer to Ref. 24 for more details.Step 6: If the termination condition is satisfied, go to step 7. Otherwise, go to step 3.

The MPLCO could find the best combination of mask, process, and lithography-tool parameters iteratively until the termination condition is satisfied.

Step 7: Output the optimized parameters of the mask, process, and lithography tool and the corresponding cost function $F$.

It is noted that a restitution operation (an inverse operation of normalization) is necessary for the simulation of $F$ when calculating the search direction (step 3) and applying the line search (step 5). All parameters need to be restituted to their original scales for the PROLITH simulation of DOF, contrast, and NILS.

Without the parameter normalization step (step 2) and the restitution operation, the optimization procedure can be denoted as a conventional conjugate gradient (CCG) method. The NCG method is developed on the basis of the CCG method and is an improvement of the CCG method. The normalizing operation could greatly improve the convergence when optimizing different-scale parameters, and the advantage is demonstrated in Sec. 3.2.

The MPLCO is a multidimensional optimization method; thus, all parameters related to the mask, process, and lithography tool could be optimized simultaneously. The convergence robustness of the MPLCO is guaranteed in three aspects. First, the cost function we built is easy to converge. As the DOF is easily trapped at the local minimum, we add the image contrast and NILS as assistant objective function. Second, the normalization in MPLCO guarantees the optimization of different scale parameters. Third, the Hopfinger golden section search method used in MPLCO is able to solve the nonunimodal cost function.

## 3.

## Co-Optimization Results

## 3.1.

### Simulation Conditions

Two typical geometries of a 45-nm node are used to demonstrate the validity of the MPLCO: the semidense line-space pattern and the dense contact hole. The duty ratios of these patterns are 1:2 and 1:1. The critical dimension (CD) of the line is 45 nm, while the CD of the contact hole is 60 nm.^{3} The PROLITH programming interface helps to communicate between the optimization algorithm and PROLITH.^{25} The PROLITH simulation condition is same as that in Ref. 3 except that the parameters have been optimized. We use a water (${n}_{\text{water}}=1.44$) immersion lithography system with the demagnification factor $R=4$. The source shape is annular with a fixed sigma width ${\sigma}_{\text{width}}=0.15$. The illumination is Y-polarized for the line-space pattern and TE-polarized for the contact hole.^{3} Since a tunable transmittance mask is available, the feature transmittance could be varied over a large range,^{26} and an attenuated phase-shift mask is used to enhance the lithography capability. The well-calibrated photoresist JSR ARX2895JN is used with a thickness of 120 nm for the line-space pattern and 102 nm for the contact hole.^{3} Dual bottom antireflective coatings (BARCs) are applied to reduce the substrate reflectivity and are optimized under the initial parameter settings. The thicknesses of BARCs are 33 and 43 nm for the line-space pattern and 24 and 43 nm for contact hole. The PW is constrained by the tolerances of $\mathrm{\Delta}\mathrm{CD}=\pm 10\%\mathrm{CD}$, an 80-deg sidewall angle, and a 10% photoresist loss to maintain pattern fidelity.^{27}

It is necessary to use a robust photoresist model to optimize the process parameters. In order to accurately simulate the process conditions, a full physical photoresist model, three-stage PEB model, and Mack development model are used in PROLITH.^{25} The full physical photoresist model is capable of simulating the reaction and diffusion of a chemically amplified photoresist accurately. The three-stage PEB model describes the temperature variation when the wafer is placed on a hotplate or chillplate and the wafer transition time between the two plates.^{6} The three-stage temperature profile is represented by nine parameters: the bake temperature, the bake duration, and the temperature rise (or fall) time for each stage. The initial temperature is 25°C. The temperatures for the hotplate stage, transfer stage, and chillplate stage are 110, 45, and 25°C, respectively. The rise (or fall) times for these three stages are 5, 50, and 5 s, respectively.^{25}

Eight parameters including the mask, process, and lithography-tool parameters are selected as an example to demonstrate the validity of the MPLCO. For the mask parameters, the mask bias and feature transmittance are optimized. For the process parameters, the hotplate duration, transition duration, chillplate duration, and photoresist development time are chosen for the optimization. For the lithography-tool parameters, the outer sigma ${\sigma}_{\text{out}}$ of annular illumination and the NA are selected for the optimization.

The DOF at $\mathrm{EL}=5\%$ is chosen to evaluate the PW, where $\mathrm{EL}=5\%$ is a common value for 45-nm lithography and is closer to the actual case.^{10} The goal of co-optimization is to extend the DOF at $\mathrm{EL}=5\%$ to be as large as possible. The DOF at $\mathrm{EL}=3\%$ is another objective function in the optimization. The weights for ${P}_{\text{Contrast}}$, ${P}_{\text{NILS}}$, ${P}_{\mathrm{DOF}@\mathrm{EL}=5\%}$, and ${P}_{\mathrm{DOF}@\mathrm{EL}=3\%}$ in $F$ are ${w}_{1}=0.2$, ${w}_{2}=0.1$, ${\epsilon}_{1}=10$, and ${\epsilon}_{2}=2$, respectively. Only one metrology plane is set for each pattern, and the metrology plane could cover a whole pitch. The PWs measured at the metrology planes are the overlap PWs for these two infinite and periodic patterns. The metrology plane for the line space is a cutline perpendicular to the line direction, while for the contact hole the cutline is set at the center of the hole. The optimization terminates after 20 iterations of the line search.

The unoptimized parameters for the line space and contact hole are set as the initial point ${\{x\}}^{(0)}$. The ranges of the parameters are set in a reasonable range, which ensures that the values of the parameters are realizable in lithography. The conditions for the initial value ${\{x\}}^{(0)}$, the minimum value (lower global boundary), the maximum value (upper global boundary), and the increment of the parameters $\{\mathrm{\Delta}x\}$ are listed in Table 1. Owing to the different properties of these parameters, different increments $\mathrm{\Delta}{x}_{n}$ and $\mathrm{\Delta}{\overline{x}}_{n}$ are set to calculate the gradient for the CCG and NCG methods. It is noted that $\mathrm{\Delta}{x}_{n}$ is the actual value of the parameter, while $\mathrm{\Delta}{\overline{x}}_{n}$ is the normalized value.

## Table 1

Co-optimization conditions and results.

Pattern | Parameters | Conditions | Optimized results | |||||
---|---|---|---|---|---|---|---|---|

Initial | Min. | Max. | CCG increment | NCG increment | MPLCO-CCG | MPLCO-NCG | ||

Line space | Mask bias (nm) | 10 | 0 | 40 | 5 | 0.1 | 10 | 39.98 |

Feature transmittance | 0.06 | 0.03 | 0.3 | 0.01 | 0.1 | 0.167 | 0.108 | |

Hotplate duration (s) | 60 | 10 | 100 | 5 | 0.1 | 60 | 10.03 | |

Transition duration (s) | 10 | 2 | 20 | 2 | 0.1 | 10 | 6.93 | |

Chillplate duration (s) | 30 | 10 | 60 | 5 | 0.1 | 30 | 13 | |

Development time (s) | 30 | 5 | 60 | 2 | 0.1 | 30 | 60 | |

σout | 0.78 | 0.65 | 0.99 | 0.02 | 0.1 | 0.86 | 0.84 | |

NA | 1.2 | 1 | 1.35 | 0.02 | 0.1 | 1 | 1 | |

Contact hole | Mask bias (nm) | 10 | 0 | 30 | 5 | 0.05 | 10 | 16.31 |

Feature transmittance | 0.06 | 0.03 | 0.3 | 0.02 | 0.05 | 0.06 | 0.17 | |

Hotplate duration (s) | 60 | 10 | 100 | 5 | 0.05 | 60 | 59.8 | |

Transition duration (s) | 10 | 2 | 20 | 2 | 0.05 | 10 | 9.96 | |

Chillplate duration (s) | 30 | 10 | 60 | 5 | 0.05 | 30 | 29.91 | |

Development time (s) | 12 | 5 | 60 | 2 | 0.05 | 12 | 24.37 | |

σout | 0.86 | 0.65 | 0.99 | 0.05 | 0.05 | 0.92 | 0.92 | |

NA | 1.2 | 1 | 1.35 | 0.05 | 0.05 | 1 | 1 |

Note: CCG, conventional conjugate gradient, NCG, normalized conjugate gradient, MPLCO, mask, process, and lithography-tool parameter co-optimization.

## 3.2.

### Co-Optimization Results

The results are demonstrated in two aspects. We first present the improvement in the PW by using the MPLCO method. The convergence efficiency of the MPLCO by using the CCG (MPLCO-CCG) method and the proposed NCG (MPLCO-NCG) method are also compared. Then, the mask and source complexities are compared between the MPLCO method and the pixelated SMO in Ref. 3.

The optimized parameters of the MPLCO-CCG and MPLCO-NCG methods are listed in the last two columns in Table 1. The MPLCO-CCG results show no changes for the mask bias, PEB duration, and develop time, although the small-scale parameters are well optimized. This is mainly due to the drawback of the local boundary calculation when optimizing the different scale parameters. The optimization of the large-scale parameters is limited by small-scale parameters in the optimization procedure. However, when all the parameters are normalized to the same scale in the MPLCO-NCG method, the optimization can move further. The normalized method could improve the convergence efficiency of the optimization. Figure 3 illustrates the convergence curve of the MPLCO-CCG and MPLCO-NCG methods, which show that the MPLCO-NCG method could effectively reduce the cost function.

The MPLCO-NCG method greatly decreases the hotplate and chillplate duration but increases the development time. The PEB mainly affects the reaction of chemically amplified photoresist, which thereby impacts the profile of the photoresist. The entire PEB durations are 29.96 and 99.67 s for the line-space pattern and contact hole, which are shorter than the initial setting of 100 s. The development time affects the CD of the photoresist, which has been optimized to 60 and 24.37 s for the line-space pattern and contact hole. The optimized process parameters ensure the exact chemical reaction of the photoresist with an acceptable profile in a large PW.

The EL versus DOF curves for the line-space pattern and contact hole are illustrated in Fig. 4. The curves denoted with triangles and stars represent the PW of the MPLCO-CCG and MPLCO-NCG methods, respectively. We use the dashed line to indicate the DOF at $\mathrm{EL}=5\%$ in the figure. The DOF at $\mathrm{EL}=5\%$ of the MPLCO-CCG and MPLCO-NCG methods are 0.349 and 0.558 *μ*m for the line-space pattern, respectively, and 0.344 and 0.356 *μ*m for the contact hole, respectively. The PW is greatly extended after co-optimization, especially when the mask, process, and lithography-tool parameters are well optimized by using the MPLCO-NCG method. The DOF at $\mathrm{EL}=5\%$ of the MPLCO-NCG method increases by 59.89 and 3.49% for the line-space pattern and contact hole compared with the MPLCO-CCG method. The reason is mainly because the large-scale parameters are not being optimized when using the MPLCO-CCG. The comparison between MPLCO-CCG and MPLCO-NCG indicates that the more parameters have been well optimized, the larger PW can be obtained. As an SMO work is done to extending the PW in Ref. 3, the EL versus DOF curve of the SMO denoted with square is represented in Fig. 4. The DOF at $\mathrm{EL}=5\%$ of the SMO is 0.288 *μ*m for line-space pattern and 0.172 *μ*m for contact hole, which are much smaller than the DOF of MPLCO-NCG. The comparison between the MPLCO and SMO could also prove that large PW can be obtained when more parameters are involved in optimization.

In order to demonstrate the merits of MPLCO and the impact of the process parameters on lithography, we only optimize the process parameters under the condition that the mask and source are optimized by the SMO in Ref. 3. The increments of NCG method are $\mathrm{\Delta}{\overline{x}}_{n}=0.01$ for the line-space pattern and $\mathrm{\Delta}{\overline{x}}_{n}=0.1$ for the contact hole. The parameter settings and optimized results are listed in Table 2. The EL versus DOF curves are shown in Fig. 4. The DOF at $\mathrm{EL}=5\%$ of the SMO with process optimization (PO) are 0.327 *μ*m for the line-space pattern and 0.178 *μ*m for the contact hole, which are 13.54 and 3.49% larger than that of SMO. This verifies that the PW could be extended by optimizing the process parameters. From Fig. 4, we also find that the DOF at $\mathrm{EL}=5\%$ for the MPLCO-NCG method is still larger than that of SMO with PO. Although the maximum EL of the MPLCO-NCG method is smaller than that of SMO and SMO with PO, the DOF at $\mathrm{EL}=5\%$ of the MPLCO-NCG method definitely increases. The results indicate that a large PW could be obtained only if the mask, process, and lithography-tool parameters are optimized simultaneously.

## Table 2

Co-optimization of the process parameters for source mask optimization.

Parameters | Conditions | Optimized results | |||
---|---|---|---|---|---|

Initial | Min. | Max. | Line space | Contact hole | |

Hotplate duration (s) | 60 | 10 | 100 | 23.01 | 59.92 |

Transition duration (s) | 10 | 2 | 20 | 8.43 | 9.99 |

Chillplate duration (s) | 30 | 10 | 60 | 12.62 | 29.98 |

Development time (s) | 12 | 5 | 60 | 34.23 | 6.72 |

The second advantage of the proposed MPLCO method is to reduce the mask and source complexity. A parametric mask and source are used in the MPLCO. For comparison, Fig. 5 shows the mask pattern and source shape of the SMO, MPLCO-CCG method, and MPLCO-NCG method. For the contact hole of the SMO, the mask pattern is much more complex compared with the MPLCO-CCG and MPLCO-NCG results. A complex mask is difficult to manufacture and will greatly increase the cost of mask fabrication. For the parametric mask, the mask pattern has only been biased rather than applying complex mask correction, which could reduce the complexity of the mask. The source shapes of the SMO results suffer from an intricate distribution with a nonuniform intensity. It is necessary to use a customized diffractive optical element or an expensive programmable illuminator to generate the pixelated source. However, one just needs to adjust the sigma value for the parametric source in which the source shape always has high uniformity and stability.

The mask error enhancement factor (MEEF) is compared. The MEEF of the SMO result, MPLCO-CCG result, and MPLCO-NCG result are 5.45, 4.74, and 4.47 for the line-space pattern, respectively, and 12.69, 15, and 15.39 for the contact hole, respectively. There are nearly no improvements for the MEEF when using MPLCO compared with SMO. The MEEF of MPLCO result is smaller than SMO result for line-space pattern, but larger than SMO result for contact hole. Contact hole suffers the largest MEEFs of any feature type because the change in mask size of a regular contact hole will lead to change in exposure dose.^{27} There are more clear areas at the corner of contact hole optimized by SMO; the change of mask size will lead to less change in exposure dose compared with the regular contact hole optimized by MPLCO. Therefore, the MEEF of regular contact hole optimized by MPLCO is larger than the contact hole optimized by SMO.

## 4.

## Conclusion and Discussion

This paper proposes an MPLCO method that can optimize the mask, process, and lithography-tool parameters simultaneously. An NCG algorithm is used in the MPLCO to improve the convergence efficiency and is successfully applied to optimize different scale parameters. As the process parameters have strong impact on lithography performance, it is necessary to use a full physical photoresist model to simulate the process behavior accurately when optimizing the process parameters. The PW of the SMO with optimized process is larger than the SMO, but still smaller than that of MPLCO. The results indicate that the more parameters have been optimized simultaneously, the larger PW could be obtained.

In addition, the complexity of the mask and source could be greatly reduced by using the MPLCO compared with the SMO. The mask pattern has only been biased rather than applying complex mask correction, while the source shape is controlled by adjusting the sigma value. The assumption and simplification for the mask is sufficient when optimizing the one-dimensional line-space pattern and two-dimensional contact hole. The cost of fabrication could be greatly reduced by using the parametric mask and source. In our future work, we will model the complex two-dimensional mask pattern in parametric form and apply the MPLCO to it.

## Acknowledgments

We gratefully acknowledge the financial support from the National Science and Technology Major Project, Key Program of National Natural Science Foundation of China under Grant No. 60938003, and National Natural Science Foundation of China under Grant No. 61204113. This work is also supported by the Key Laboratory of Photoelectronic Imaging Technology and System, Beijing Institute of Technology, Ministry of Education of China, under Grant No. 2013OEIOF05. We thank KLA-Tencor Corporation for providing academic use of PROLITH.

## References

A. K. Wong, Resolution Enhancement Techniques in Optical Lithography, SPIE, Bellingham, WA (2001).Google Scholar

X. MaG. R. Arce, Computational Lithography, 1st ed., John Wiley and Sons, New York (2010).Google Scholar

X. Maet al., “Hybrid source mask optimization for robust immersion lithography,” Appl. Opt. 52(18), 4200–4211 (2013).APOPAI0003-6935http://dx.doi.org/10.1364/AO.52.004200Google Scholar

J. LiY. ShenE. Y. Lam, “Hotspot-aware fast source and mask optimization,” Opt. Express 20(19), 21792–21804 (2012).OPEXFF1094-4087http://dx.doi.org/10.1364/OE.20.021792Google Scholar

X. MaG. R. Arce, “Pixel-based simultaneous source and mask optimization for resolution enhancement in optical lithography,” Opt. Express 17(7), 5783–5793 (2009).OPEXFF1094-4087http://dx.doi.org/10.1364/OE.17.005783Google Scholar

M. D. SmithC. MackJ. S. Pertersen, “Modeling the impact of thermal history during post exposure bake on the lithographic performance of chemically amplified resists,” Proc. SPIE 4345, 1013–1021 (2001).PSISDG0277-786Xhttp://dx.doi.org/10.1117/12.436826Google Scholar

B. M. Rathsacket al., “Optical lithography simulation and photoresist optimization for photomask fabrication,” Proc. SPIE 3678, 1215–1226 (1999).PSISDG0277-786Xhttp://dx.doi.org/10.1117/12.350173Google Scholar

Y. Shenet al., “Robust level-set-based inverse lithography,” Opt. Express 19(6), 5511–5521 (2011).OPEXFF1094-4087http://dx.doi.org/10.1364/OE.19.005511Google Scholar

P. YuS. X. ShiD. Z. Pan, “True process variation aware optical proximity correction with variational lithography modeling and model calibration,” J. Micro/Nanolith. MEMS MOEMS 6(3), 031004 (2007).JMMMGF1932-5134http://dx.doi.org/10.1117/1.2752814Google Scholar

A. V. Pretet al., “Combined mask and illumination scheme optimization for robust contact patterning on 45 nm technology node flash memory devices,” Proc. SPIE 6924, 69243B (2008).PSISDG0277-786Xhttp://dx.doi.org/10.1117/12.772676Google Scholar

J. V. Wingerdenet al., “Lithographic process optimization using process capability analysis,” Proc. SPIE 5040, 882–893 (2003).PSISDG0277-786Xhttp://dx.doi.org/10.1117/12.485520Google Scholar

Y. Liet al., “The cross talk of multi-errors impact on lithography performance and the method of its control,” Proc. SPIE 8418, 841802 (2012).PSISDG0277-786Xhttp://dx.doi.org/10.1117/12.978295Google Scholar

N. JiaE. Y. Lam, “Pixelated source mask optimization for process robustness in optical lithography,” Opt. Express 19(20), 19384–19398 (2011).OPEXFF1094-4087http://dx.doi.org/10.1364/OE.19.019384Google Scholar

Y. Granik, “Source optimization for image fidelity and throughput,” J. Microlithogr., Microfabr., Microsyst. 3(4), 509–522 (2004).http://dx.doi.org/10.1117/1.1794708Google Scholar

A. E. RosenbluthN. Seong, “Global optimization of the illumination distribution to maximize integrated process window,” Proc. SPIE 6154, 61540H (2006).PSISDG0277-786Xhttp://dx.doi.org/10.1117/12.656950Google Scholar

J. A. TorresY. GranikF. Schellenberg, “Contrast analysis and optimization for resolution enhancement technique,” J. Microlithogr., Microfabr., Microsyst. 2(2), 119–128 (2003).http://dx.doi.org/10.1117/1.1562931Google Scholar

J. YuP. Yu, “Impacts of cost functions on inverse lithography patterning,” Opt. Express 18(22), 23331–23342 (2010).OPEXFF1094-4087http://dx.doi.org/10.1364/OE.18.023331Google Scholar

M. C. SmaylingV. Axelrad, “32 nm and below logic patterning using optimized illumination and double patterning,” Proc. SPIE 7274, 72740K (2009).PSISDG0277-786Xhttp://dx.doi.org/10.1117/12.814258Google Scholar

J. YuP. YuH. Chao, “Fast source optimization involving quadratic line-contour objectives for the resist image,” Opt. Express 20(7), 8161–8174 (2012).OPEXFF1094-4087http://dx.doi.org/10.1364/OE.20.008161Google Scholar

A. PoonawalaP. Milanfar, “Mask design for optical microlithography—an inverse imaging problem,” IEEE Trans. Image Process. 16(3), 774–788 (2007).IIPRE41057-7149http://dx.doi.org/10.1109/TIP.2006.891332Google Scholar

Y. Granik, “Fast pixel-based mask optimization for inverse lithography,” J. Microlithogr., Microfabr., Microsyst. 5(4), 043002 (2006).http://dx.doi.org/10.1117/1.2399537Google Scholar

Y. Peng et al., “Gradient-based source and mask optimization in optical lithography,” IEEE Trans. Image Process. 20(10), 2856–2864 (2011).IIPRE41057-7149http://dx.doi.org/10.1109/TIP.2011.2131668Google Scholar

A. RavindranK. M. RagsdellG. V. Reklaitis, Engineering Optimization Methods and Applications, John Wiley and Sons, New Jersey (2006).Google Scholar

E. Hopfinger, “On the solution of the unidimensional local minimization problem,” J. Optim. Theory Appl. 18(3), 425–428 (1976).JOTABN0022-3239http://dx.doi.org/10.1007/BF00933821Google Scholar

KLA-Tencor Corporation, “PROLITH Manual,” http://www.K-T.com/current-software-solutions/computational-lithography.html.Google Scholar

B. S. Kasprowiczet al., “Tunable transmission phase mask options for $65/45\text{\hspace{0.17em}}\text{\hspace{0.17em}}\mathrm{nm}$ node gate and contact processing,” Proc. SPIE 5754, 1469–1477 (2005).PSISDG0277-786Xhttp://dx.doi.org/10.1117/12.601011Google Scholar

C. Mack, Fundamental Principles of Optical Lithography: The Science of Microfabrication, John Wiley and Sons, New York (2007).Google Scholar

## Biography

**Xuejia Guo** received his BS degree in optical information science and technology from University of Jinan in 2008. He is currently a PhD candidate directed by Professor Yanqiu Li in the School of Optoelectronics at Beijing Institute of Technology. His current interests involve lithography simulation and optimization.

**Yanqiu Li** received her MS and PhD degrees in optics from Harbin Institute of Technology of China. She is currently a professor of School of Optoelectronics at Beijing Institute of Technology, Beijing, China. She holds over 30 Chinese patents and has published numerous articles on lithographic science.

**Lisong Dong** received his BS degree in applied physics from Hefei University of Technology, China, in 2008. He is currently a PhD candidate directed by Professor Yanqiu Li at Beijing Institute of Technology, Beijing, China. His research interests include lithography simulation and resolution enhancement technology.

**Lihui Liu** received his BS and MS degrees in optics from Hebei University, respectively, in 1999 and 2002. He received his PhD from Institute of Modern Optics in NanKai University in 2005. He is currently an associate professor of School of Optoelectronics at Beijing Institute of Technology, Beijing, China. His research is focused on optical sensing, lithography simulation, and optimization.