Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 5034 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 761 |
Duplicate rows (%) | 15.1% |
Total size in memory | 250.8 KiB |
Average record size in memory | 51.0 B |
Variable types
Numeric | 3 |
---|---|
Text | 2 |
DateTime | 1 |
Dataset
Description | 일반병해충 참나무시들음병 고사목 상세정보 데이터를 제공합니다.- 지역X좌표, 지역Y좌표, 국가지점번호, 법정동코드, PNU코드, 조사일자 |
---|---|
Author | 산림청 |
URL | https://www.data.go.kr/data/15120578/fileData.do |
Dataset has 761 (15.1%) duplicate rows | Duplicates |
지역X좌표 is highly overall correlated with PNU코드 | High correlation |
PNU코드 is highly overall correlated with 지역X좌표 | High correlation |
Reproduction
Analysis started | 2023-12-12 20:31:59.529238 |
---|---|
Analysis finished | 2023-12-12 20:32:01.138774 |
Duration | 1.61 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
지역X좌표
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 2718 |
---|---|
Distinct (%) | 54.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 214738.59 |
Minimum | 179302 |
---|---|
Maximum | 2015722 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 44.4 KiB |
Quantile statistics
Minimum | 179302 |
---|---|
5-th percentile | 179816.7 |
Q1 | 193730 |
median | 205793.5 |
Q3 | 223623.75 |
95-th percentile | 281184.4 |
Maximum | 2015722 |
Range | 1836420 |
Interquartile range (IQR) | 29893.75 |
Descriptive statistics
Standard deviation | 42472.054 |
---|---|
Coefficient of variation (CV) | 0.19778492 |
Kurtosis | 644.69716 |
Mean | 214738.59 |
Median Absolute Deviation (MAD) | 16353 |
Skewness | 16.398076 |
Sum | 1.0809941 × 109 |
Variance | 1.8038753 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
193753 | 12 | 0.2% |
235109 | 10 | 0.2% |
219538 | 10 | 0.2% |
220125 | 10 | 0.2% |
235479 | 10 | 0.2% |
235458 | 9 | 0.2% |
219534 | 8 | 0.2% |
193737 | 8 | 0.2% |
235478 | 8 | 0.2% |
235456 | 8 | 0.2% |
Other values (2708) | 4941 |
Value | Count | Frequency (%) |
179302 | 1 | < 0.1% |
179304 | 3 | |
179306 | 2 | |
179309 | 2 | |
179310 | 1 | < 0.1% |
179311 | 2 | |
179312 | 1 | < 0.1% |
179313 | 4 | |
179315 | 4 | |
179316 | 1 | < 0.1% |
Value | Count | Frequency (%) |
2015722 | 1 | |
404781 | 1 | |
404753 | 1 | |
401416 | 1 | |
401126 | 1 | |
401099 | 1 | |
400983 | 1 | |
400963 | 1 | |
400945 | 1 | |
400939 | 1 |
지역Y좌표
Real number (ℝ)
Distinct | 3074 |
---|---|
Distinct (%) | 61.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 539527.65 |
Minimum | 258801 |
---|---|
Maximum | 627144 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 44.4 KiB |
Quantile statistics
Minimum | 258801 |
---|---|
5-th percentile | 507028 |
Q1 | 521113.5 |
median | 540581 |
Q3 | 565535 |
95-th percentile | 578138.35 |
Maximum | 627144 |
Range | 368343 |
Interquartile range (IQR) | 44421.5 |
Descriptive statistics
Standard deviation | 37928.863 |
---|---|
Coefficient of variation (CV) | 0.070300129 |
Kurtosis | 20.950014 |
Mean | 539527.65 |
Median Absolute Deviation (MAD) | 19601.5 |
Skewness | -3.5706326 |
Sum | 2.7159822 × 109 |
Variance | 1.4385987 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
565532 | 12 | 0.2% |
565737 | 12 | 0.2% |
565615 | 12 | 0.2% |
565717 | 10 | 0.2% |
551278 | 10 | 0.2% |
565647 | 10 | 0.2% |
565848 | 10 | 0.2% |
551281 | 10 | 0.2% |
565713 | 8 | 0.2% |
541078 | 8 | 0.2% |
Other values (3064) | 4932 |
Value | Count | Frequency (%) |
258801 | 1 | |
286306 | 1 | |
286307 | 1 | |
286308 | 1 | |
286309 | 1 | |
286310 | 1 | |
286311 | 1 | |
286312 | 1 | |
286313 | 1 | |
286314 | 1 |
Value | Count | Frequency (%) |
627144 | 1 | |
627140 | 1 | |
627079 | 1 | |
627043 | 1 | |
626967 | 1 | |
626966 | 1 | |
626953 | 1 | |
626813 | 1 | |
626165 | 1 | |
626163 | 1 |
국가지점번호
Text
Distinct | 2707 |
---|---|
Distinct (%) | 53.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.5 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 50340 |
---|---|
Distinct characters | 15 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 1470 ? |
---|---|
Unique (%) | 29.2% |
Sample
1st row | 라사34058610 |
---|---|
2nd row | 라사34058610 |
3rd row | 라사34058610 |
4th row | 라사34058610 |
5th row | 라사31588267 |
Value | Count | Frequency (%) |
다라33948700 | 21 | 0.4% |
다사75486555 | 20 | 0.4% |
다사76056572 | 18 | 0.4% |
다사76046576 | 14 | 0.3% |
다사75446574 | 14 | 0.3% |
다사90925143 | 13 | 0.3% |
다사75366556 | 12 | 0.2% |
다사90975132 | 11 | 0.2% |
다사90915142 | 10 | 0.2% |
다사90935136 | 10 | 0.2% |
Other values (2697) | 4891 |
Most occurring characters
Value | Count | Frequency (%) |
5 | 5346 | |
4 | 5137 | |
사 | 4886 | |
다 | 4681 | |
1 | 4450 | |
6 | 4230 | |
9 | 3965 | |
7 | 3956 | |
3 | 3801 | |
2 | 3267 | |
Other values (5) | 6621 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 40272 | |
Other Letter | 10068 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 5346 | |
4 | 5137 | |
1 | 4450 | |
6 | 4230 | |
9 | 3965 | |
7 | 3956 | |
3 | 3801 | |
2 | 3267 | |
0 | 3240 | |
8 | 2880 |
Other Letter
Value | Count | Frequency (%) |
사 | 4886 | |
다 | 4681 | |
라 | 337 | 3.3% |
마 | 108 | 1.1% |
아 | 56 | 0.6% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 40272 | |
Hangul | 10068 | 20.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 5346 | |
4 | 5137 | |
1 | 4450 | |
6 | 4230 | |
9 | 3965 | |
7 | 3956 | |
3 | 3801 | |
2 | 3267 | |
0 | 3240 | |
8 | 2880 |
Hangul
Value | Count | Frequency (%) |
사 | 4886 | |
다 | 4681 | |
라 | 337 | 3.3% |
마 | 108 | 1.1% |
아 | 56 | 0.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 40272 | |
Hangul | 10068 | 20.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 5346 | |
4 | 5137 | |
1 | 4450 | |
6 | 4230 | |
9 | 3965 | |
7 | 3956 | |
3 | 3801 | |
2 | 3267 | |
0 | 3240 | |
8 | 2880 |
Hangul
Value | Count | Frequency (%) |
사 | 4886 | |
다 | 4681 | |
라 | 337 | 3.3% |
마 | 108 | 1.1% |
아 | 56 | 0.6% |
법정동코드
Text
Distinct | 132 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.5 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.9992054 |
Min length | 6 |
Characters and Unicode
Total characters | 50336 |
---|---|
Distinct characters | 13 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 21 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | 5111031027 |
---|---|
2nd row | 5111031027 |
3rd row | 5111031027 |
4th row | 5111031027 |
5th row | 5111031030 |
Value | Count | Frequency (%) |
4183033024 | 727 | |
2820010400 | 480 | 9.5% |
4136026223 | 412 | 8.2% |
4163010800 | 351 | 7.0% |
4121010400 | 327 | 6.5% |
4111314100 | 307 | 6.1% |
4111313400 | 280 | 5.6% |
4121010600 | 270 | 5.4% |
4146125627 | 226 | 4.5% |
4111113900 | 150 | 3.0% |
Other values (122) | 1504 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 13825 | |
1 | 11904 | |
4 | 6881 | |
3 | 5755 | |
2 | 5226 | 10.4% |
6 | 2558 | 5.1% |
8 | 1819 | 3.6% |
5 | 1226 | 2.4% |
7 | 611 | 1.2% |
9 | 527 | 1.0% |
Other values (3) | 4 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 50332 | |
Lowercase Letter | 4 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 13825 | |
1 | 11904 | |
4 | 6881 | |
3 | 5755 | |
2 | 5226 | 10.4% |
6 | 2558 | 5.1% |
8 | 1819 | 3.6% |
5 | 1226 | 2.4% |
7 | 611 | 1.2% |
9 | 527 | 1.0% |
Lowercase Letter
Value | Count | Frequency (%) |
l | 2 | |
n | 1 | |
u | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 50332 | |
Latin | 4 | < 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 13825 | |
1 | 11904 | |
4 | 6881 | |
3 | 5755 | |
2 | 5226 | 10.4% |
6 | 2558 | 5.1% |
8 | 1819 | 3.6% |
5 | 1226 | 2.4% |
7 | 611 | 1.2% |
9 | 527 | 1.0% |
Latin
Value | Count | Frequency (%) |
l | 2 | |
n | 1 | |
u | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 50336 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 13825 | |
1 | 11904 | |
4 | 6881 | |
3 | 5755 | |
2 | 5226 | 10.4% |
6 | 2558 | 5.1% |
8 | 1819 | 3.6% |
5 | 1226 | 2.4% |
7 | 611 | 1.2% |
9 | 527 | 1.0% |
Other values (3) | 4 | < 0.1% |
PNU코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 392 |
---|---|
Distinct (%) | 7.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.0587638 × 1018 |
Minimum | 2.6290106 × 1018 |
---|---|
Maximum | 5.181035 × 1018 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 44.4 KiB |
Quantile statistics
Minimum | 2.6290106 × 1018 |
---|---|
5-th percentile | 2.8200104 × 1018 |
Q1 | 4.1113141 × 1018 |
median | 4.1360262 × 1018 |
Q3 | 4.1630108 × 1018 |
95-th percentile | 5.1401802 × 1018 |
Maximum | 5.181035 × 1018 |
Range | 2.5520244 × 1018 |
Interquartile range (IQR) | 5.16967 × 1016 |
Descriptive statistics
Standard deviation | 4.8878039 × 1017 |
---|---|
Coefficient of variation (CV) | 0.12042593 |
Kurtosis | 2.8565961 |
Mean | 4.0587638 × 1018 |
Median Absolute Deviation (MAD) | 2.4712823 × 1016 |
Skewness | -1.0627132 |
Sum | -7.1753251 × 1018 |
Variance | 2.3890627 × 1035 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4183033024200510000 | 724 | 14.4% |
4136026223200500000 | 409 | 8.1% |
2820010400200010001 | 234 | 4.6% |
4111313400200230000 | 192 | 3.8% |
2820010400200040002 | 179 | 3.6% |
4111314100200530000 | 154 | 3.1% |
4146125627200740008 | 146 | 2.9% |
4121010600200310006 | 104 | 2.1% |
4163010800200480000 | 103 | 2.0% |
4159013200200800000 | 100 | 2.0% |
Other values (382) | 2689 |
Value | Count | Frequency (%) |
2629010600200530001 | 5 | 0.1% |
2629010600200530019 | 2 | < 0.1% |
2629010600200530032 | 4 | 0.1% |
2632010300105780000 | 1 | < 0.1% |
2632010400201290009 | 1 | < 0.1% |
2632010500200120000 | 2 | < 0.1% |
2820010400100010000 | 1 | < 0.1% |
2820010400100760000 | 2 | < 0.1% |
2820010400200010001 | 234 | |
2820010400200010002 | 4 | 0.1% |
Value | Count | Frequency (%) |
5181035023200010000 | 2 | < 0.1% |
5181035022201560001 | 2 | < 0.1% |
5181035021203050017 | 1 | < 0.1% |
5181035021203050000 | 2 | < 0.1% |
5181033025200710125 | 1 | < 0.1% |
5181033024202040000 | 4 | |
5181033023203190001 | 8 | |
5181032024202620012 | 6 | |
5181032024202590000 | 1 | < 0.1% |
5181032024200120062 | 1 | < 0.1% |
조사일자
Date
Distinct | 90 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.5 KiB |
Minimum | 2017-05-22 00:00:00 |
---|---|
Maximum | 2021-09-02 00:00:00 |
지역X좌표 | 지역Y좌표 | PNU코드 | 조사일자 | |
---|---|---|---|---|
지역X좌표 | 1.000 | 0.688 | 0.731 | 0.912 |
지역Y좌표 | 0.688 | 1.000 | 0.887 | 0.983 |
PNU코드 | 0.731 | 0.887 | 1.000 | 0.984 |
조사일자 | 0.912 | 0.983 | 0.984 | 1.000 |
지역X좌표 | 지역Y좌표 | PNU코드 | |
---|---|---|---|
지역X좌표 | 1.000 | 0.261 | 0.803 |
지역Y좌표 | 0.261 | 1.000 | 0.463 |
PNU코드 | 0.803 | 0.463 | 1.000 |
지역X좌표 | 지역Y좌표 | 국가지점번호 | 법정동코드 | PNU코드 | 조사일자 | |
---|---|---|---|---|---|---|
0 | 278061 | 586400 | 라사34058610 | 5111031027 | 5111031027200010021 | 2017-05-22 |
1 | 278061 | 586400 | 라사34058610 | 5111031027 | 5111031027200010021 | 2017-05-22 |
2 | 278061 | 586400 | 라사34058610 | 5111031027 | 5111031027200010021 | 2017-05-22 |
3 | 278061 | 586400 | 라사34058610 | 5111031027 | 5111031027200010021 | 2017-05-22 |
4 | 275605 | 582957 | 라사31588267 | 5111031030 | 5111031030200530000 | 2017-05-22 |
5 | 254513 | 568326 | 라사10426816 | 5111034027 | 5111034027200040000 | 2017-05-22 |
6 | 254513 | 568326 | 라사10426816 | 5111034027 | 5111034027200040000 | 2017-05-22 |
7 | 254513 | 568326 | 라사10426816 | 5111034027 | 5111034027200040000 | 2017-05-22 |
8 | 254513 | 568326 | 라사10426816 | 5111034027 | 5111034027200040000 | 2017-05-22 |
9 | 254513 | 568326 | 라사10426816 | 5111034027 | 5111034027200040000 | 2017-05-22 |
지역X좌표 | 지역Y좌표 | 국가지점번호 | 법정동코드 | PNU코드 | 조사일자 | |
---|---|---|---|---|---|---|
5024 | 333935 | 535965 | 라사89633539 | 5177025028 | 5177025028200020001 | 2021-08-30 |
5025 | 334045 | 535748 | 라사89743518 | 5177025028 | 5177025028200020001 | 2021-08-31 |
5026 | 391682 | 517126 | 마사47251626 | 5123033036 | 5123033036201040000 | 2021-09-01 |
5027 | 391682 | 517126 | 마사47251626 | 5123033036 | 5123033036201040000 | 2021-09-01 |
5028 | 338177 | 535401 | 라사93863481 | 5177025028 | 5177025028200020001 | 2021-09-01 |
5029 | 336980 | 535164 | 라사92673458 | 5177025028 | 5177025028200020001 | 2021-09-01 |
5030 | 337745 | 535032 | 라사93433444 | 5177025029 | 5177025029204000000 | 2021-09-01 |
5031 | 337743 | 535041 | 라사93433445 | 5177025029 | 5177025029204000000 | 2021-09-01 |
5032 | 338642 | 536254 | 라사94333566 | 5177025028 | 5177025028200020001 | 2021-09-02 |
5033 | 342176 | 538215 | 라사97883760 | 5177025029 | 5177025029200010000 | 2021-09-02 |
Most frequently occurring
지역X좌표 | 지역Y좌표 | 국가지점번호 | 법정동코드 | PNU코드 | 조사일자 | # duplicates | |
---|---|---|---|---|---|---|---|
26 | 179463 | 286665 | 다라33948700 | 2920010600 | 2920010600200670021 | 2018-07-13 | 7 |
27 | 179464 | 286666 | 다라33948700 | 2920010600 | 2920010600200670021 | 2018-07-13 | 7 |
28 | 179465 | 286667 | 다라33948700 | 2920010600 | 2920010600200670021 | 2018-07-13 | 7 |
29 | 179466 | 286668 | 다라33948701 | 2920010600 | 2920010600200670021 | 2018-07-13 | 7 |
717 | 254513 | 568326 | 라사10426816 | 5111034027 | 5111034027200040000 | 2017-05-22 | 5 |
268 | 211870 | 512149 | 다사67491223 | 4159013200 | 4159013200200800000 | 2018-07-17 | 4 |
285 | 212003 | 528807 | 다사67712888 | 4113510200 | 4113510200200660000 | 2018-07-30 | 4 |
510 | 220144 | 565700 | 다사76056572 | 4136026222 | 4136026222201160000 | 2018-09-13 | 4 |
626 | 222694 | 506346 | 다사78280638 | 4146125627 | 4146125627200740008 | 2018-02-08 | 4 |
647 | 224185 | 507647 | 다사79780767 | 4146125627 | 4146125627200660001 | 2018-05-20 | 4 |