Overview

Dataset statistics

Number of variables7
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.0 KiB
Average record size in memory61.3 B

Variable types

Numeric1
Text1
Categorical5

Alerts

시도코드 has constant value ""Constant
시도명 has constant value ""Constant
시군구코드 is highly imbalanced (91.9%)Imbalance
시군구명 is highly imbalanced (91.9%)Imbalance
의료복지 is highly imbalanced (91.9%)Imbalance
아이디 has unique valuesUnique
격자번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 11:37:39.345022
Analysis finished2023-12-10 11:37:40.021426
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

아이디
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:37:40.163935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T20:37:40.389267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

격자번호
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T20:37:40.827334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters600
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row라사9797
2nd row라사5134
3rd row라사4218
4th row라사3314
5th row라사5235
ValueCountFrequency (%)
라사9797 1
 
1.0%
라사3416 1
 
1.0%
라사4832 1
 
1.0%
라사5022 1
 
1.0%
라사3007 1
 
1.0%
라사5135 1
 
1.0%
라사5321 1
 
1.0%
라사3110 1
 
1.0%
라사4532 1
 
1.0%
라사6020 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T20:37:41.459528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
100
16.7%
100
16.7%
3 78
13.0%
1 62
10.3%
2 61
10.2%
4 42
7.0%
5 37
 
6.2%
8 30
 
5.0%
0 29
 
4.8%
9 23
 
3.8%
Other values (2) 38
 
6.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 400
66.7%
Other Letter 200
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 78
19.5%
1 62
15.5%
2 61
15.2%
4 42
10.5%
5 37
9.2%
8 30
 
7.5%
0 29
 
7.2%
9 23
 
5.8%
6 22
 
5.5%
7 16
 
4.0%
Other Letter
ValueCountFrequency (%)
100
50.0%
100
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 400
66.7%
Hangul 200
33.3%

Most frequent character per script

Common
ValueCountFrequency (%)
3 78
19.5%
1 62
15.5%
2 61
15.2%
4 42
10.5%
5 37
9.2%
8 30
 
7.5%
0 29
 
7.2%
9 23
 
5.8%
6 22
 
5.5%
7 16
 
4.0%
Hangul
ValueCountFrequency (%)
100
50.0%
100
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 400
66.7%
Hangul 200
33.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
100
50.0%
100
50.0%
ASCII
ValueCountFrequency (%)
3 78
19.5%
1 62
15.5%
2 61
15.2%
4 42
10.5%
5 37
9.2%
8 30
 
7.5%
0 29
 
7.2%
9 23
 
5.8%
6 22
 
5.5%
7 16
 
4.0%

시도코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
42
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row42
2nd row42
3rd row42
4th row42
5th row42

Common Values

ValueCountFrequency (%)
42 100
100.0%

Length

2023-12-10T20:37:41.690016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:37:41.843660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
42 100
100.0%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
강원
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원
2nd row강원
3rd row강원
4th row강원
5th row강원

Common Values

ValueCountFrequency (%)
강원 100
100.0%

Length

2023-12-10T20:37:41.999997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:37:42.155397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강원 100
100.0%

시군구코드
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
42130
99 
42830
 
1

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row42830
2nd row42130
3rd row42130
4th row42130
5th row42130

Common Values

ValueCountFrequency (%)
42130 99
99.0%
42830 1
 
1.0%

Length

2023-12-10T20:37:42.298562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:37:42.473415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
42130 99
99.0%
42830 1
 
1.0%

시군구명
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
원주시
99 
양양군
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row양양군
2nd row원주시
3rd row원주시
4th row원주시
5th row원주시

Common Values

ValueCountFrequency (%)
원주시 99
99.0%
양양군 1
 
1.0%

Length

2023-12-10T20:37:42.640167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:37:42.765785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
원주시 99
99.0%
양양군 1
 
1.0%

의료복지
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-99
99 
1
 
1

Length

Max length3
Median length3
Mean length2.98
Min length1

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row-99
2nd row-99
3rd row-99
4th row-99
5th row-99

Common Values

ValueCountFrequency (%)
-99 99
99.0%
1 1
 
1.0%

Length

2023-12-10T20:37:42.900073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:37:43.050096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
99 99
99.0%
1 1
 
1.0%

Interactions

2023-12-10T20:37:39.620754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T20:37:43.149539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디격자번호시군구코드시군구명의료복지
아이디1.0001.0000.0410.0410.041
격자번호1.0001.0001.0001.0001.000
시군구코드0.0411.0001.0000.6930.000
시군구명0.0411.0000.6931.0000.000
의료복지0.0411.0000.0000.0001.000
2023-12-10T20:37:43.317430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구코드시군구명의료복지
시군구코드1.0000.4870.000
시군구명0.4871.0000.000
의료복지0.0000.0001.000
2023-12-10T20:37:43.444005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디시군구코드시군구명의료복지
아이디1.0000.0000.0000.000
시군구코드0.0001.0000.4870.000
시군구명0.0000.4871.0000.000
의료복지0.0000.0000.0001.000

Missing values

2023-12-10T20:37:39.777060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T20:37:39.959299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

아이디격자번호시도코드시도명시군구코드시군구명의료복지
01라사979742강원42830양양군-99
12라사513442강원42130원주시-99
23라사421842강원42130원주시-99
34라사331442강원42130원주시-99
45라사523542강원42130원주시-99
56라사293642강원42130원주시-99
67라사252942강원42130원주시-99
78라사461742강원42130원주시-99
89라사260642강원42130원주시-99
910라사301242강원42130원주시-99
아이디격자번호시도코드시도명시군구코드시군구명의료복지
9091라사311942강원42130원주시-99
9192라사281542강원42130원주시-99
9293라사482342강원42130원주시-99
9394라사483142강원42130원주시-99
9495라사463042강원42130원주시-99
9596라사310842강원42130원주시-99
9697라사551942강원42130원주시-99
9798라사513042강원42130원주시-99
9899라사411842강원42130원주시-99
99100라사283542강원42130원주시-99