Overview

Dataset statistics

Number of variables9
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.7 KiB
Average record size in memory79.3 B

Variable types

Numeric4
Text1
Categorical4

Alerts

시도코드 has constant value ""Constant
시도명 has constant value ""Constant
총인구 is highly overall correlated with 취약인구 and 1 other fieldsHigh correlation
취약인구 is highly overall correlated with 총인구 and 1 other fieldsHigh correlation
생활 is highly overall correlated with 총인구 and 1 other fieldsHigh correlation
시군구코드 is highly imbalanced (91.9%)Imbalance
시군구명 is highly imbalanced (91.9%)Imbalance
아이디 has unique valuesUnique
격자번호 has unique valuesUnique
취약인구 has 3 (3.0%) zerosZeros

Reproduction

Analysis started2023-12-10 11:37:50.743732
Analysis finished2023-12-10 11:37:53.605594
Duration2.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

아이디
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:37:53.702932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T20:37:53.904840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

격자번호
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T20:37:54.381740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters600
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row라사9797
2nd row라사5134
3rd row라사4218
4th row라사3314
5th row라사5235
ValueCountFrequency (%)
라사9797 1
 
1.0%
라사3416 1
 
1.0%
라사4832 1
 
1.0%
라사5022 1
 
1.0%
라사3007 1
 
1.0%
라사5135 1
 
1.0%
라사5321 1
 
1.0%
라사3110 1
 
1.0%
라사4532 1
 
1.0%
라사6020 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T20:37:55.028624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
100
16.7%
100
16.7%
3 78
13.0%
1 62
10.3%
2 61
10.2%
4 42
7.0%
5 37
 
6.2%
8 30
 
5.0%
0 29
 
4.8%
9 23
 
3.8%
Other values (2) 38
 
6.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 400
66.7%
Other Letter 200
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 78
19.5%
1 62
15.5%
2 61
15.2%
4 42
10.5%
5 37
9.2%
8 30
 
7.5%
0 29
 
7.2%
9 23
 
5.8%
6 22
 
5.5%
7 16
 
4.0%
Other Letter
ValueCountFrequency (%)
100
50.0%
100
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 400
66.7%
Hangul 200
33.3%

Most frequent character per script

Common
ValueCountFrequency (%)
3 78
19.5%
1 62
15.5%
2 61
15.2%
4 42
10.5%
5 37
9.2%
8 30
 
7.5%
0 29
 
7.2%
9 23
 
5.8%
6 22
 
5.5%
7 16
 
4.0%
Hangul
ValueCountFrequency (%)
100
50.0%
100
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 400
66.7%
Hangul 200
33.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
100
50.0%
100
50.0%
ASCII
ValueCountFrequency (%)
3 78
19.5%
1 62
15.5%
2 61
15.2%
4 42
10.5%
5 37
9.2%
8 30
 
7.5%
0 29
 
7.2%
9 23
 
5.8%
6 22
 
5.5%
7 16
 
4.0%

시도코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
42
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row42
2nd row42
3rd row42
4th row42
5th row42

Common Values

ValueCountFrequency (%)
42 100
100.0%

Length

2023-12-10T20:37:55.261464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:37:55.410606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
42 100
100.0%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
강원
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원
2nd row강원
3rd row강원
4th row강원
5th row강원

Common Values

ValueCountFrequency (%)
강원 100
100.0%

Length

2023-12-10T20:37:55.545895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:37:55.690719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강원 100
100.0%

시군구코드
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
42130
99 
42830
 
1

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row42830
2nd row42130
3rd row42130
4th row42130
5th row42130

Common Values

ValueCountFrequency (%)
42130 99
99.0%
42830 1
 
1.0%

Length

2023-12-10T20:37:55.840975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:37:55.997406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
42130 99
99.0%
42830 1
 
1.0%

시군구명
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
원주시
99 
양양군
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row양양군
2nd row원주시
3rd row원주시
4th row원주시
5th row원주시

Common Values

ValueCountFrequency (%)
원주시 99
99.0%
양양군 1
 
1.0%

Length

2023-12-10T20:37:56.158720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:37:56.313446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
원주시 99
99.0%
양양군 1
 
1.0%

총인구
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-75.28
Minimum-99
Maximum226
Zeros0
Zeros (%)0.0%
Negative85
Negative (%)85.0%
Memory size1.0 KiB
2023-12-10T20:37:56.459953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-99
5-th percentile-99
Q1-99
median-99
Q3-99
95-th percentile54.15
Maximum226
Range325
Interquartile range (IQR)0

Descriptive statistics

Standard deviation60.89104
Coefficient of variation (CV)-0.80886079
Kurtosis7.4396073
Mean-75.28
Median Absolute Deviation (MAD)0
Skewness2.7120854
Sum-7528
Variance3707.7188
MonotonicityNot monotonic
2023-12-10T20:37:57.012408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
-99 85
85.0%
23 2
 
2.0%
47 1
 
1.0%
128 1
 
1.0%
57 1
 
1.0%
118 1
 
1.0%
54 1
 
1.0%
72 1
 
1.0%
9 1
 
1.0%
50 1
 
1.0%
Other values (5) 5
 
5.0%
ValueCountFrequency (%)
-99 85
85.0%
9 1
 
1.0%
10 1
 
1.0%
14 1
 
1.0%
17 1
 
1.0%
23 2
 
2.0%
39 1
 
1.0%
47 1
 
1.0%
50 1
 
1.0%
54 1
 
1.0%
ValueCountFrequency (%)
226 1
1.0%
128 1
1.0%
118 1
1.0%
72 1
1.0%
57 1
1.0%
54 1
1.0%
50 1
1.0%
47 1
1.0%
39 1
1.0%
23 2
2.0%

취약인구
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct12
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-81.01
Minimum-99
Maximum58
Zeros3
Zeros (%)3.0%
Negative85
Negative (%)85.0%
Memory size1.0 KiB
2023-12-10T20:37:57.190457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-99
5-th percentile-99
Q1-99
median-99
Q3-99
95-th percentile24.1
Maximum58
Range157
Interquartile range (IQR)0

Descriptive statistics

Standard deviation43.609955
Coefficient of variation (CV)-0.53832805
Kurtosis2.7388855
Mean-81.01
Median Absolute Deviation (MAD)0
Skewness2.1075749
Sum-8101
Variance1901.8282
MonotonicityNot monotonic
2023-12-10T20:37:57.390937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
-99 85
85.0%
0 3
 
3.0%
26 2
 
2.0%
14 2
 
2.0%
45 1
 
1.0%
24 1
 
1.0%
58 1
 
1.0%
21 1
 
1.0%
16 1
 
1.0%
54 1
 
1.0%
Other values (2) 2
 
2.0%
ValueCountFrequency (%)
-99 85
85.0%
0 3
 
3.0%
6 1
 
1.0%
10 1
 
1.0%
14 2
 
2.0%
16 1
 
1.0%
21 1
 
1.0%
24 1
 
1.0%
26 2
 
2.0%
45 1
 
1.0%
ValueCountFrequency (%)
58 1
1.0%
54 1
1.0%
45 1
1.0%
26 2
2.0%
24 1
1.0%
21 1
1.0%
16 1
1.0%
14 2
2.0%
10 1
1.0%
6 1
1.0%

생활
Real number (ℝ)

HIGH CORRELATION 

Distinct16
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-72.14
Minimum-99
Maximum280
Zeros0
Zeros (%)0.0%
Negative85
Negative (%)85.0%
Memory size1.0 KiB
2023-12-10T20:37:57.587206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-99
5-th percentile-99
Q1-99
median-99
Q3-99
95-th percentile80.05
Maximum280
Range379
Interquartile range (IQR)0

Descriptive statistics

Standard deviation70.304679
Coefficient of variation (CV)-0.9745589
Kurtosis8.4101658
Mean-72.14
Median Absolute Deviation (MAD)0
Skewness2.8566526
Sum-7214
Variance4942.7479
MonotonicityNot monotonic
2023-12-10T20:37:57.786671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
-99 85
85.0%
64 1
 
1.0%
53 1
 
1.0%
33 1
 
1.0%
23 1
 
1.0%
14 1
 
1.0%
10 1
 
1.0%
280 1
 
1.0%
39 1
 
1.0%
73 1
 
1.0%
Other values (6) 6
 
6.0%
ValueCountFrequency (%)
-99 85
85.0%
9 1
 
1.0%
10 1
 
1.0%
14 1
 
1.0%
23 1
 
1.0%
33 1
 
1.0%
39 1
 
1.0%
53 1
 
1.0%
64 1
 
1.0%
73 1
 
1.0%
ValueCountFrequency (%)
280 1
1.0%
176 1
1.0%
173 1
1.0%
93 1
1.0%
81 1
1.0%
80 1
1.0%
73 1
1.0%
64 1
1.0%
53 1
1.0%
39 1
1.0%

Interactions

2023-12-10T20:37:52.626818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:51.091478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:51.614795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:52.110493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:52.774052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:51.214069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:51.735080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:52.241839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:52.917436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:51.346347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:51.870753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:52.372134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:53.084026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:51.473931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:51.997810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:52.495577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T20:37:57.946791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디격자번호시군구코드시군구명총인구취약인구생활
아이디1.0001.0000.0410.0410.5240.6490.456
격자번호1.0001.0001.0001.0001.0001.0001.000
시군구코드0.0411.0001.0000.6930.0000.0000.000
시군구명0.0411.0000.6931.0000.0000.0000.000
총인구0.5241.0000.0000.0001.0000.9941.000
취약인구0.6491.0000.0000.0000.9941.0000.957
생활0.4561.0000.0000.0001.0000.9571.000
2023-12-10T20:37:58.126394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구코드시군구명
시군구코드1.0000.487
시군구명0.4871.000
2023-12-10T20:37:58.275616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디총인구취약인구생활시군구코드시군구명
아이디1.000-0.013-0.014-0.0140.0000.000
총인구-0.0131.0000.9991.0000.0000.000
취약인구-0.0140.9991.0001.0000.0000.000
생활-0.0141.0001.0001.0000.0000.000
시군구코드0.0000.0000.0000.0001.0000.487
시군구명0.0000.0000.0000.0000.4871.000

Missing values

2023-12-10T20:37:53.273638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T20:37:53.502372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

아이디격자번호시도코드시도명시군구코드시군구명총인구취약인구생활
01라사979742강원42830양양군-99-99-99
12라사513442강원42130원주시-99-99-99
23라사421842강원42130원주시-99-99-99
34라사331442강원42130원주시-99-99-99
45라사523542강원42130원주시-99-99-99
56라사293642강원42130원주시-99-99-99
67라사252942강원42130원주시-99-99-99
78라사461742강원42130원주시-99-99-99
89라사260642강원42130원주시-99-99-99
910라사301242강원42130원주시-99-99-99
아이디격자번호시도코드시도명시군구코드시군구명총인구취약인구생활
9091라사311942강원42130원주시-99-99-99
9192라사281542강원42130원주시-99-99-99
9293라사482342강원42130원주시-99-99-99
9394라사483142강원42130원주시-99-99-99
9495라사463042강원42130원주시-99-99-99
9596라사310842강원42130원주시-99-99-99
9697라사551942강원42130원주시-99-99-99
9798라사513042강원42130원주시-99-99-99
9899라사411842강원42130원주시-99-99-99
99100라사283542강원42130원주시-99-99-99