Introduction
In wind observation for the purpose of wind power development, the use of remote sensing equipment such as Doppler LiDAR is rapidly expanding in order to obtain wind conditions at sites where observation masts are difficult to use.1)2)3)4)shows statistics (regression coefficient and coefficient of determination) that can be obtained by referring to wind direction and speed measured by an observation mast as an index to quantitatively evaluate the accuracy of wind direction and speed measured by remote sensing equipment. It is known that there are various definitions of regression coefficient and coefficient of determination in statistics, but in this paper, we will use the guidelines, etc.1)2)3)4)We will refer to this article to explain the statistics used in wind condition surveys, providing formulas and concrete examples.
Wind direction data correction
Wind direction, the direction from which the wind blows, takes on values ranging from 0° to 360°, but if each piece of data is used directly for accuracy verification, the two-dimensional positional relationship between the reference data 𝑥 (e.g., observation mast data) and the target data 𝑦 (e.g., Doppler lidar data) may result in the inclusion of data (raw data) that deviates significantly from the line 𝑦 = 𝑥 as shown in Figure 1, and therefore statistics will not be calculated correctly. Therefore, before evaluating accuracy, the following corrections are made to ensure that the two pieces of data align with the line 𝑦 = 𝑥.

here, 𝑥and the subscripts of 𝑦, 𝑖(=1,2,3,…,𝑁)As shown in Figure 1, the modified data can be obtained instead of the raw data, and the wind direction statistics can be calculated correctly.

Figure 1. Wind direction data manipulation
Regression coefficients and coefficients of determination
The wind direction and speed of the remote sensing equipment are explained using the data 𝑥𝑖 from the observation mast as explanatory variables and the data yiIt is evaluated by linear regression with the objective variable.
The regression line of wind direction is expressed as y=ax+b with an intercept. The intercept of the regression line isbRoadmap for the commercial acceptance of floating LIDAR technology1)According to the definition of xiand yiIt is expressed as the average difference between

here, 𝑥 ̅ The𝑥 iThe average value ofy ̅ TheyiThe slope of the regression line isais the interceptbBased on the least squares method, the following equation is obtained so that

The regression line for wind speed is expressed as 𝑦 = 𝑎′𝑥 with no intercept. The slope of the regression line 𝑎 can be calculated using the least squares method as follows:

As shown in equations (4) and (5), the slope of the regression coefficient with and without an intercept has different definitions.
Coefficient of determinationR2is how well the regression line fits, that is, the explanatory variablesxIt is an index that shows how well the objective variable y is explained. Tarald O. Kvalseth5)It is,R2Eight definitions forR21~R28), but this article focuses on the most common textbooks.6)It is published inR21is shown in the following equation.

here, fiis the estimated value based on the regression line, and if the wind direction isfi= 𝑎𝑥i+b, if the wind speedfi= 𝑎′𝑥i.R2takes values between 0 and 1,R2When =1,𝑥i yiIt fully explains𝑥iとyiThis means that if you plot this on a graph, all the points will fall on a straight line.
Warnings
When calculating statistics in practice, you probably have many opportunities to use spreadsheet software (such as Microsoft Excel) or programming languages (such as Python and R). The regression analysis functions implemented in these programs are very useful for calculating statistics, but you need to fully understand how the statistics derived from the functions are defined before using them.
For example, a regression line obtained using a spreadsheet or programming languagey = 𝑎𝑥 + bThe intercept of is defined by the following equation based on the least squares method, which does not match equation (3) defined by the difference in means.

またR2It has been pointed out that the definition of (6) differs depending on the spreadsheet software and programming language, and on the presence or absence of an intercept.R2When calculatingR27is calculated,R21It has been confirmed that the value is larger than that7)8).
Therefore, when calculating statistics based on equations (4) to (6) using a spreadsheet or programming language, we recommend that you build a program that faithfully follows the definitions by using your own functions instead of regression analysis functions.
Example :
As a concrete example of accuracy verification, Wind Farm Certification: Onshore Wind Power Plant Edition2)(hereinafter referred to as the NK Guidelines) and the Offshore Wind Observation Guidebook4)This section introduces the requirements shown in the .
The NK Guidelines require that correlations between observation data from observation masts and remote sensing instruments be verified according to the following conditions (1) and (2).3).
(1) Combining observation data from an observation mast and remote sensing equipment to treat it as observation data at hub height and use it as input data for airflow analysis
(2) When observation data from an observation mast is used as input data for airflow analysis, and observation data from a remote sensing device is used only to verify the validity of the airflow analysis.
Table 1. Correlation between observation data from observation masts and remote sensing instruments3)

Offshore Wind Observation Guidebook4)defines KPIs (Key Performance Indicators) and acceptance criteria for the accuracy of wind direction and speed (Table 2). If the error of wind direction and speed measured by remote sensing equipment is below the minimum in the table, the measurement value is deemed to be of the same accuracy as that of a cup-type anemometer or arrow-type anemometer on an observation mast and can be used. The wind speed range is set to 4m/s or more and less than 16m/s because it is considered to have a large contribution to power generation and load for the purpose of calculating power generation forecasts and wind conditions.
Table 2 Accuracy KPIs and acceptance criteria

Conclusion
This article explained regression coefficients and coefficients of determination, which are used to evaluate wind condition data measured by remote sensing equipment. If you have any questions about the technical content, please feel free to contact RTI.
(Written by Takeyuki Misaki)
References
- Carbon Trust Offshore Wind Accelerator (OWA), Roadmap for the commercial acceptance of floating LIDAR technology, 2018, URL:
https://www.carbontrust.com/our-work-and-impact/guides-reports-and-tools/roadmap-for-commercial-acceptance-of-floating-lidar,
Access: 2023/. - ClassNK, Wind Farm Certification for Onshore Wind Power Plants, 2023,
URL: https://www.classnk.or.jp/hp/pdf/authentication/renewableenergy/ja/windfarm/NKRE-GL-WFC01_March2023_Jpn_20230331.pdf,
Access: 2023/08/07 - Bureau Veritas Japan, Registered Conformity Assessment Body Technical Standards for Conformity Assessment, 2023, URL:
https://www.bureauveritas.jp/sites/g/files/zypfnx696/files/media/document/windfirm_gijutsukijun_Rev0_0.pdf,
Access: 2023/08/07 - New Energy and Industrial Technology Development Organization, Offshore Wind Observation Guidebook, 2023, URL:
https://www.nedo.go.jp/content/100948288.pdf,
Access: 2023/08/07 - Tarald O. Kvalseth, Cautionary Note about R2, The American Statistician Vol.39, No. 4, Part 1 (Nov., 1985), pp.279-285.
- Department of Statistics, Faculty of Liberal Arts, University of Tokyo, Introduction to Statistics (Basic Statistics I), University of Tokyo Press.
- Memo log, I plotted the difference in the coefficient of determination R^2 of R's lm depending on the definition formula, URL:
https://indenkun.hatenablog.com/entry/2020/03/11/180921,
Accessed: 2023/08/07. - Iguchi Laboratory, Institute of Biological Sciences, Coefficient of Determination R2Differences: Using Excel, OpenOffice, LibreOffice and the statistical analysis software R, URL:
https://biolab.sakura.ne.jp/r-squared.html,
Accessed: 2023/08/07.