Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory70.3 B

Variable types

Categorical3
Text1
Numeric4

Alerts

시군명 has constant value ""Constant
금년(이번년) is highly overall correlated with 평균값High correlation
평균값 is highly overall correlated with 금년(이번년) and 1 other fieldsHigh correlation
최고값 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
시도명 is highly overall correlated with 최고값High correlation
최소값 is highly imbalanced (71.4%)Imbalance
금년(이번년) has unique valuesUnique
평균값 has unique valuesUnique
최고값 has unique valuesUnique

Reproduction

Analysis started2024-04-21 09:31:02.637033
Analysis finished2024-04-21 09:31:07.969479
Duration5.33 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
경기도
31 
경상북도
23 
강원도
18 
경상남도
18 
전라남도
10 

Length

Max length4
Median length4
Mean length3.51
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원도
2nd row강원도
3rd row강원도
4th row강원도
5th row강원도

Common Values

ValueCountFrequency (%)
경기도 31
31.0%
경상북도 23
23.0%
강원도 18
18.0%
경상남도 18
18.0%
전라남도 10
 
10.0%

Length

2024-04-21T18:31:08.183622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T18:31:08.509794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 31
31.0%
경상북도 23
23.0%
강원도 18
18.0%
경상남도 18
18.0%
전라남도 10
 
10.0%

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
0
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 100
100.0%

Length

2024-04-21T18:31:08.878479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T18:31:09.164518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 100
100.0%

코드
Text

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
2024-04-21T18:31:10.174062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.03
Min length3

Characters and Unicode

Total characters303
Distinct characters85
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)98.0%

Sample

1st row정선군
2nd row평창군
3rd row영월군
4th row횡성군
5th row홍천군
ValueCountFrequency (%)
고성군 2
 
2.0%
경산시 1
 
1.0%
김해시 1
 
1.0%
영천시 1
 
1.0%
영주시 1
 
1.0%
구미시 1
 
1.0%
안동시 1
 
1.0%
경주시 1
 
1.0%
포항시 1
 
1.0%
김천시 1
 
1.0%
Other values (89) 89
89.0%
2024-04-21T18:31:11.672194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
17.8%
49
 
16.2%
14
 
4.6%
12
 
4.0%
11
 
3.6%
9
 
3.0%
8
 
2.6%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (75) 130
42.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 303
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
17.8%
49
 
16.2%
14
 
4.6%
12
 
4.0%
11
 
3.6%
9
 
3.0%
8
 
2.6%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (75) 130
42.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 303
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
17.8%
49
 
16.2%
14
 
4.6%
12
 
4.0%
11
 
3.6%
9
 
3.0%
8
 
2.6%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (75) 130
42.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 303
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
17.8%
49
 
16.2%
14
 
4.6%
12
 
4.0%
11
 
3.6%
9
 
3.0%
8
 
2.6%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (75) 130
42.9%

금년(이번년)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean567.75896
Minimum394.16753
Maximum1057.5334
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-21T18:31:12.078579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum394.16753
5-th percentile447.92013
Q1490.56751
median538.48592
Q3612.15522
95-th percentile753.904
Maximum1057.5334
Range663.36588
Interquartile range (IQR)121.5877

Descriptive statistics

Standard deviation114.8673
Coefficient of variation (CV)0.20231701
Kurtosis3.9725842
Mean567.75896
Median Absolute Deviation (MAD)60.036674
Skewness1.6864931
Sum56775.896
Variance13194.496
MonotonicityNot monotonic
2024-04-21T18:31:12.513549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
436.909544501153 1
 
1.0%
738.402991104716 1
 
1.0%
561.185647034184 1
 
1.0%
526.656718381288 1
 
1.0%
496.080554954141 1
 
1.0%
451.89410276433 1
 
1.0%
473.052962901171 1
 
1.0%
611.388216499358 1
 
1.0%
542.545990005092 1
 
1.0%
549.580369736142 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
394.167529031218 1
1.0%
399.455213451457 1
1.0%
424.000876936962 1
1.0%
436.909544501153 1
1.0%
442.602683254561 1
1.0%
448.2 1
1.0%
450.8 1
1.0%
451.89410276433 1
1.0%
453.429372303462 1
1.0%
454.516240996681 1
1.0%
ValueCountFrequency (%)
1057.53341258812 1
1.0%
974.1 1
1.0%
908.553244981868 1
1.0%
794.355994883617 1
1.0%
778.314634533799 1
1.0%
752.619231158211 1
1.0%
747.1 1
1.0%
738.402991104716 1
1.0%
731.72032836931 1
1.0%
723.118526530247 1
1.0%

평균값
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean249.36763
Minimum184.30518
Maximum462.53973
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-21T18:31:12.917515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum184.30518
5-th percentile200.14504
Q1210.69976
median228.44762
Q3270.0229
95-th percentile372.39134
Maximum462.53973
Range278.23455
Interquartile range (IQR)59.323139

Descriptive statistics

Standard deviation57.166014
Coefficient of variation (CV)0.22924392
Kurtosis3.0430164
Mean249.36763
Median Absolute Deviation (MAD)21.716372
Skewness1.7657391
Sum24936.763
Variance3267.9531
MonotonicityNot monotonic
2024-04-21T18:31:13.342669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
213.158688832852 1
 
1.0%
361.448027686256 1
 
1.0%
244.09909261962 1
 
1.0%
229.046361521852 1
 
1.0%
235.584891469085 1
 
1.0%
204.250464409322 1
 
1.0%
200.199183579118 1
 
1.0%
254.81011457411 1
 
1.0%
250.786064621752 1
 
1.0%
224.703860291813 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
184.305181400191 1
1.0%
193.946299656482 1
1.0%
198.278125 1
1.0%
198.928512682242 1
1.0%
199.116380181653 1
1.0%
200.199183579118 1
1.0%
200.643856219311 1
1.0%
201.921154216219 1
1.0%
203.751479286353 1
1.0%
204.250464409322 1
1.0%
ValueCountFrequency (%)
462.5397314445 1
1.0%
460.682541737958 1
1.0%
394.119373846713 1
1.0%
381.6796875 1
1.0%
376.724169801442 1
1.0%
372.163297268256 1
1.0%
366.553724983478 1
1.0%
361.448027686256 1
1.0%
344.97767309272 1
1.0%
342.06215431531 1
1.0%

최고값
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44861.7
Minimum41110
Maximum48890
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-21T18:31:13.751428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum41110
5-th percentile41209
Q141625
median46795
Q347822.5
95-th percentile48840.5
Maximum48890
Range7780
Interquartile range (IQR)6197.5

Descriptive statistics

Standard deviation3052.4053
Coefficient of variation (CV)0.068040339
Kurtosis-1.8616898
Mean44861.7
Median Absolute Deviation (MAD)2090
Skewness-0.012394933
Sum4486170
Variance9317177.9
MonotonicityNot monotonic
2024-04-21T18:31:14.200957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
42770 1
 
1.0%
48240 1
 
1.0%
47250 1
 
1.0%
47230 1
 
1.0%
47210 1
 
1.0%
47190 1
 
1.0%
47170 1
 
1.0%
47130 1
 
1.0%
47110 1
 
1.0%
47150 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
41110 1
1.0%
41130 1
1.0%
41150 1
1.0%
41170 1
1.0%
41190 1
1.0%
41210 1
1.0%
41220 1
1.0%
41250 1
1.0%
41270 1
1.0%
41280 1
1.0%
ValueCountFrequency (%)
48890 1
1.0%
48880 1
1.0%
48870 1
1.0%
48860 1
1.0%
48850 1
1.0%
48840 1
1.0%
48820 1
1.0%
48740 1
1.0%
48730 1
1.0%
48720 1
1.0%

최소값
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
-
95 
5
 
5

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 95
95.0%
5 5
 
5.0%

Length

2024-04-21T18:31:14.620582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T18:31:14.911586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
95
95.0%
5 5
 
5.0%

재현기간
Real number (ℝ)

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean364.833
Minimum264.7
Maximum519.3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-21T18:31:15.236560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum264.7
5-th percentile279.645
Q1328.4
median371.6
Q3400.65
95-th percentile444.33
Maximum519.3
Range254.6
Interquartile range (IQR)72.25

Descriptive statistics

Standard deviation53.570588
Coefficient of variation (CV)0.14683592
Kurtosis-0.092912765
Mean364.833
Median Absolute Deviation (MAD)36.65
Skewness0.19623719
Sum36483.3
Variance2869.8079
MonotonicityNot monotonic
2024-04-21T18:31:15.667580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
401.1 2
 
2.0%
389.5 1
 
1.0%
352.3 1
 
1.0%
333.4 1
 
1.0%
306.2 1
 
1.0%
359.9 1
 
1.0%
276.7 1
 
1.0%
345.1 1
 
1.0%
308.4 1
 
1.0%
319.4 1
 
1.0%
Other values (89) 89
89.0%
ValueCountFrequency (%)
264.7 1
1.0%
271.0 1
1.0%
272.1 1
1.0%
275.4 1
1.0%
276.7 1
1.0%
279.8 1
1.0%
280.2 1
1.0%
282.6 1
1.0%
287.4 1
1.0%
289.1 1
1.0%
ValueCountFrequency (%)
519.3 1
1.0%
506.7 1
1.0%
472.8 1
1.0%
470.9 1
1.0%
446.8 1
1.0%
444.2 1
1.0%
441.1 1
1.0%
440.7 1
1.0%
432.2 1
1.0%
419.1 1
1.0%

Interactions

2024-04-21T18:31:06.330974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:03.134201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:04.315720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:05.295837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:06.573022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:03.567775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:04.561992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:05.554652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:06.811793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:03.810925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:04.801568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:05.812092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:07.068663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:04.077056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:05.061859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:31:06.082671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T18:31:15.931326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명코드금년(이번년)평균값최고값최소값재현기간
시도명1.0000.8370.6440.6700.9320.2600.569
코드0.8371.0000.9510.7450.8321.0000.824
금년(이번년)0.6440.9511.0000.8060.4980.3190.832
평균값0.6700.7450.8061.0000.5930.4180.817
최고값0.9320.8320.4980.5931.0000.2980.433
최소값0.2601.0000.3190.4180.2981.0000.251
재현기간0.5690.8240.8320.8170.4330.2511.000
2024-04-21T18:31:16.409305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
최소값시도명
최소값1.0000.312
시도명0.3121.000
2024-04-21T18:31:16.644686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
금년(이번년)평균값최고값재현기간시도명최소값
금년(이번년)1.0000.6160.2330.4100.3160.236
평균값0.6161.0000.6530.1230.4550.404
최고값0.2330.6531.000-0.1710.8960.211
재현기간0.4100.123-0.1711.0000.2710.180
시도명0.3160.4550.8960.2711.0000.312
최소값0.2360.4040.2110.1800.3121.000

Missing values

2024-04-21T18:31:07.396302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T18:31:07.813881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군명코드금년(이번년)평균값최고값최소값재현기간
0강원도0정선군436.909545213.15868942770-389.5
1강원도0평창군466.7235.94597742760-379.7
2강원도0영월군394.167529208.26195742750-380.6
3강원도0횡성군493.410556222.2227842730-380.8
4강원도0홍천군468.1215.59177942720-401.1
5강원도0삼척시478.358313234.47043742230-351.6
6강원도0양양군584.637449263.67542830-407.7
7강원도0고성군589.7239.82969642820-441.1
8강원도0인제군453.429372207.01435542810-416.1
9강원도0양구군399.455213184.30518142800-380.9
시도명시군명코드금년(이번년)평균값최고값최소값재현기간
90전라남도0화순군539.022193279.6841246790-332.6
91전라남도0장흥군634.916861344.97767346800-444.2
92전라남도0강진군635.61817334.84219646810-410.0
93전라남도0해남군571.899521301.50073446820-369.5
94전라남도0영암군552.665697280.8143646830-349.3
95전라남도0무안군587.634945265.7762146840-308.2
96전라남도0함평군602.504068275.30220246860-323.9
97전라남도0영광군644.967011278.188162468705298.5
98전라남도0장성군614.456214283.966811468805303.0
99전라남도0완도군715.900488376.7241746890-432.2