Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory70.3 B

Variable types

Categorical3
Text1
Numeric4

Alerts

시군명 has constant value ""Constant
최소값 has constant value ""Constant
금년(이번년) is highly overall correlated with 평균값High correlation
평균값 is highly overall correlated with 금년(이번년) and 1 other fieldsHigh correlation
최고값 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
시도명 is highly overall correlated with 최고값High correlation
금년(이번년) has unique valuesUnique
평균값 has unique valuesUnique
최고값 has unique valuesUnique

Reproduction

Analysis started2023-12-10 12:49:50.622579
Analysis finished2023-12-10 12:49:53.083036
Duration2.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
경기도
31 
경상북도
23 
강원도
18 
경상남도
18 
전라남도
10 

Length

Max length4
Median length4
Mean length3.51
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원도
2nd row강원도
3rd row강원도
4th row강원도
5th row강원도

Common Values

ValueCountFrequency (%)
경기도 31
31.0%
경상북도 23
23.0%
강원도 18
18.0%
경상남도 18
18.0%
전라남도 10
 
10.0%

Length

2023-12-10T21:49:53.184502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:49:53.335113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 31
31.0%
경상북도 23
23.0%
강원도 18
18.0%
경상남도 18
18.0%
전라남도 10
 
10.0%

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 100
100.0%

Length

2023-12-10T21:49:53.458586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:49:53.561297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 100
100.0%

코드
Text

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T21:49:53.859503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.03
Min length3

Characters and Unicode

Total characters303
Distinct characters85
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)98.0%

Sample

1st row정선군
2nd row평창군
3rd row영월군
4th row횡성군
5th row홍천군
ValueCountFrequency (%)
고성군 2
 
2.0%
경산시 1
 
1.0%
김해시 1
 
1.0%
영천시 1
 
1.0%
영주시 1
 
1.0%
구미시 1
 
1.0%
안동시 1
 
1.0%
경주시 1
 
1.0%
포항시 1
 
1.0%
김천시 1
 
1.0%
Other values (89) 89
89.0%
2023-12-10T21:49:54.398735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
17.8%
49
 
16.2%
14
 
4.6%
12
 
4.0%
11
 
3.6%
9
 
3.0%
8
 
2.6%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (75) 130
42.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 303
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
17.8%
49
 
16.2%
14
 
4.6%
12
 
4.0%
11
 
3.6%
9
 
3.0%
8
 
2.6%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (75) 130
42.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 303
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
17.8%
49
 
16.2%
14
 
4.6%
12
 
4.0%
11
 
3.6%
9
 
3.0%
8
 
2.6%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (75) 130
42.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 303
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
17.8%
49
 
16.2%
14
 
4.6%
12
 
4.0%
11
 
3.6%
9
 
3.0%
8
 
2.6%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (75) 130
42.9%

금년(이번년)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean114.60206
Minimum60.5
Maximum221.92951
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:49:54.591301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum60.5
5-th percentile74.703373
Q194.885744
median110.36784
Q3126.6613
95-th percentile184.34941
Maximum221.92951
Range161.42951
Interquartile range (IQR)31.775557

Descriptive statistics

Standard deviation30.939534
Coefficient of variation (CV)0.26997363
Kurtosis2.190977
Mean114.60206
Median Absolute Deviation (MAD)15.595953
Skewness1.3012839
Sum11460.206
Variance957.25473
MonotonicityNot monotonic
2023-12-10T21:49:54.781901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
140.927993275819 1
 
1.0%
140.692099357819 1
 
1.0%
109.53568206826 1
 
1.0%
103.646954622156 1
 
1.0%
105.758612307171 1
 
1.0%
90.7699885599637 1
 
1.0%
105.13054883384 1
 
1.0%
114.591355131574 1
 
1.0%
120.631930258131 1
 
1.0%
132.119046429705 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
60.5 1
1.0%
64.9652205154471 1
1.0%
69.2288750384212 1
1.0%
71.9 1
1.0%
72.9 1
1.0%
74.7982877654823 1
1.0%
75.0887971388493 1
1.0%
77.031695934195 1
1.0%
77.3381396212352 1
1.0%
80.2741899128098 1
1.0%
ValueCountFrequency (%)
221.929511979665 1
1.0%
207.992648963842 1
1.0%
205.886169725912 1
1.0%
196.472710770453 1
1.0%
189.088282188608 1
1.0%
184.1 1
1.0%
168.063605774522 1
1.0%
167.7 1
1.0%
165.500095262587 1
1.0%
156.583987565542 1
1.0%

평균값
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.593294
Minimum16.753105
Maximum72.432812
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:49:54.973690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16.753105
5-th percentile19.152437
Q122.010307
median24.951727
Q334.923521
95-th percentile42.114619
Maximum72.432812
Range55.679708
Interquartile range (IQR)12.913213

Descriptive statistics

Standard deviation8.8960746
Coefficient of variation (CV)0.3111245
Kurtosis4.7502529
Mean28.593294
Median Absolute Deviation (MAD)4.2984816
Skewness1.6353815
Sum2859.3294
Variance79.140143
MonotonicityNot monotonic
2023-12-10T21:49:55.159464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
24.6737001353741 1
 
1.0%
40.6929159119169 1
 
1.0%
28.2462642031572 1
 
1.0%
26.7603212221868 1
 
1.0%
24.0499661424469 1
 
1.0%
24.4407324173442 1
 
1.0%
21.5880557504269 1
 
1.0%
30.560377483777 1
 
1.0%
32.2164898873449 1
 
1.0%
29.0007451856539 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
16.7531046458196 1
1.0%
17.4275787598661 1
1.0%
18.5013990311373 1
1.0%
18.7000555182564 1
1.0%
18.72965675796 1
1.0%
19.1746881421417 1
1.0%
19.5240263966081 1
1.0%
19.589333145239 1
1.0%
19.9959659925185 1
1.0%
20.3850887055296 1
1.0%
ValueCountFrequency (%)
72.4328125 1
1.0%
50.9113240554982 1
1.0%
47.58432739286 1
1.0%
44.4186640404003 1
1.0%
43.9638656555776 1
1.0%
42.0172904718447 1
1.0%
40.8657642484189 1
1.0%
40.8460273704926 1
1.0%
40.768721765079 1
1.0%
40.6929159119169 1
1.0%

최고값
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44861.7
Minimum41110
Maximum48890
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:49:55.351839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum41110
5-th percentile41209
Q141625
median46795
Q347822.5
95-th percentile48840.5
Maximum48890
Range7780
Interquartile range (IQR)6197.5

Descriptive statistics

Standard deviation3052.4053
Coefficient of variation (CV)0.068040339
Kurtosis-1.8616898
Mean44861.7
Median Absolute Deviation (MAD)2090
Skewness-0.012394933
Sum4486170
Variance9317177.9
MonotonicityNot monotonic
2023-12-10T21:49:55.518277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
42770 1
 
1.0%
48240 1
 
1.0%
47250 1
 
1.0%
47230 1
 
1.0%
47210 1
 
1.0%
47190 1
 
1.0%
47170 1
 
1.0%
47130 1
 
1.0%
47110 1
 
1.0%
47150 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
41110 1
1.0%
41130 1
1.0%
41150 1
1.0%
41170 1
1.0%
41190 1
1.0%
41210 1
1.0%
41220 1
1.0%
41250 1
1.0%
41270 1
1.0%
41280 1
1.0%
ValueCountFrequency (%)
48890 1
1.0%
48880 1
1.0%
48870 1
1.0%
48860 1
1.0%
48850 1
1.0%
48840 1
1.0%
48820 1
1.0%
48740 1
1.0%
48730 1
1.0%
48720 1
1.0%

최소값
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 100
100.0%

Length

2023-12-10T21:49:55.661792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:49:55.772421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
100
100.0%

재현기간
Real number (ℝ)

Distinct87
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65.962
Minimum42.8
Maximum104.7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:49:55.916510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum42.8
5-th percentile51.795
Q159.325
median63.7
Q370.525
95-th percentile85.68
Maximum104.7
Range61.9
Interquartile range (IQR)11.2

Descriptive statistics

Standard deviation11.009204
Coefficient of variation (CV)0.16690222
Kurtosis2.5892729
Mean65.962
Median Absolute Deviation (MAD)5.2
Skewness1.2815962
Sum6596.2
Variance121.20258
MonotonicityNot monotonic
2023-12-10T21:49:56.389767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
60.5 6
 
6.0%
63.7 3
 
3.0%
57.3 2
 
2.0%
67.5 2
 
2.0%
73.4 2
 
2.0%
67.7 2
 
2.0%
59.0 2
 
2.0%
63.1 2
 
2.0%
60.9 1
 
1.0%
54.0 1
 
1.0%
Other values (77) 77
77.0%
ValueCountFrequency (%)
42.8 1
1.0%
47.9 1
1.0%
49.2 1
1.0%
50.8 1
1.0%
51.7 1
1.0%
51.8 1
1.0%
53.1 1
1.0%
53.6 1
1.0%
54.0 1
1.0%
54.1 1
1.0%
ValueCountFrequency (%)
104.7 1
1.0%
103.0 1
1.0%
99.4 1
1.0%
94.9 1
1.0%
91.0 1
1.0%
85.4 1
1.0%
84.8 1
1.0%
81.6 1
1.0%
81.1 1
1.0%
78.4 1
1.0%

Interactions

2023-12-10T21:49:52.322265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:50.925909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:51.429467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:51.862690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:52.447646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:51.066595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:51.556395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:51.968611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:52.554307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:51.193488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:51.659651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:52.073381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:52.667759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:51.315461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:51.775629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:49:52.200044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:49:56.504875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명코드금년(이번년)평균값최고값재현기간
시도명1.0000.8370.7600.6670.9320.530
코드0.8371.0000.0000.9010.8321.000
금년(이번년)0.7600.0001.0000.8340.6140.627
평균값0.6670.9010.8341.0000.5850.733
최고값0.9320.8320.6140.5851.0000.203
재현기간0.5301.0000.6270.7330.2031.000
2023-12-10T21:49:56.628460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
금년(이번년)평균값최고값재현기간시도명
금년(이번년)1.0000.8540.4800.1690.404
평균값0.8541.0000.6800.2620.483
최고값0.4800.6801.000-0.0220.896
재현기간0.1690.262-0.0221.0000.239
시도명0.4040.4830.8960.2391.000

Missing values

2023-12-10T21:49:52.818598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:49:53.002534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군명코드금년(이번년)평균값최고값최소값재현기간
0강원도0정선군140.92799324.673742770-66.7
1강원도0평창군113.32874127.33032942760-64.7
2강원도0영월군94.8838721.75158742750-66.6
3강원도0횡성군91.06576123.17904842730-69.8
4강원도0홍천군84.39787321.60508642720-73.4
5강원도0삼척시127.14520131.13009642230-60.2
6강원도0양양군189.08828237.663642830-59.5
7강원도0고성군205.8861734.88074442820-75.3
8강원도0인제군93.46116721.78403842810-61.1
9강원도0양구군77.03169616.75310542800-63.7
시도명시군명코드금년(이번년)평균값최고값최소값재현기간
90전라남도0화순군121.434.37401746790-63.7
91전라남도0장흥군127.6891840.84602746800-85.4
92전라남도0강진군128.42321139.80647446810-81.1
93전라남도0해남군120.65629637.7599646820-66.0
94전라남도0영암군111.236.51920246830-67.3
95전라남도0무안군124.91407436.10027246840-58.0
96전라남도0함평군124.236.3832646860-74.8
97전라남도0영광군112.935.53350446870-67.7
98전라남도0장성군124.537.79385846880-73.1
99전라남도0완도군165.50009544.41866446890-71.7