Overview

Dataset statistics

Number of variables9
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.7 KiB
Average record size in memory79.3 B

Variable types

Numeric3
Text1
Categorical5

Alerts

시도코드 has constant value ""Constant
시도명 has constant value ""Constant
지하차도/교량/터널 has constant value ""Constant
일반도로/고속도로 is highly overall correlated with 도로High correlation
도로 is highly overall correlated with 일반도로/고속도로High correlation
시군구코드 is highly imbalanced (91.9%)Imbalance
시군구명 is highly imbalanced (91.9%)Imbalance
아이디 has unique valuesUnique
격자번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 11:37:59.454794
Analysis finished2023-12-10 11:38:01.459001
Duration2 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

아이디
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:38:01.582697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T20:38:01.827539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

격자번호
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T20:38:02.301526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters600
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row라사9797
2nd row라사5134
3rd row라사4218
4th row라사3314
5th row라사5235
ValueCountFrequency (%)
라사9797 1
 
1.0%
라사3416 1
 
1.0%
라사4832 1
 
1.0%
라사5022 1
 
1.0%
라사3007 1
 
1.0%
라사5135 1
 
1.0%
라사5321 1
 
1.0%
라사3110 1
 
1.0%
라사4532 1
 
1.0%
라사6020 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T20:38:02.922420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
100
16.7%
100
16.7%
3 78
13.0%
1 62
10.3%
2 61
10.2%
4 42
7.0%
5 37
 
6.2%
8 30
 
5.0%
0 29
 
4.8%
9 23
 
3.8%
Other values (2) 38
 
6.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 400
66.7%
Other Letter 200
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 78
19.5%
1 62
15.5%
2 61
15.2%
4 42
10.5%
5 37
9.2%
8 30
 
7.5%
0 29
 
7.2%
9 23
 
5.8%
6 22
 
5.5%
7 16
 
4.0%
Other Letter
ValueCountFrequency (%)
100
50.0%
100
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 400
66.7%
Hangul 200
33.3%

Most frequent character per script

Common
ValueCountFrequency (%)
3 78
19.5%
1 62
15.5%
2 61
15.2%
4 42
10.5%
5 37
9.2%
8 30
 
7.5%
0 29
 
7.2%
9 23
 
5.8%
6 22
 
5.5%
7 16
 
4.0%
Hangul
ValueCountFrequency (%)
100
50.0%
100
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 400
66.7%
Hangul 200
33.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
100
50.0%
100
50.0%
ASCII
ValueCountFrequency (%)
3 78
19.5%
1 62
15.5%
2 61
15.2%
4 42
10.5%
5 37
9.2%
8 30
 
7.5%
0 29
 
7.2%
9 23
 
5.8%
6 22
 
5.5%
7 16
 
4.0%

시도코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
42
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row42
2nd row42
3rd row42
4th row42
5th row42

Common Values

ValueCountFrequency (%)
42 100
100.0%

Length

2023-12-10T20:38:03.099552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:38:03.228260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
42 100
100.0%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
강원
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원
2nd row강원
3rd row강원
4th row강원
5th row강원

Common Values

ValueCountFrequency (%)
강원 100
100.0%

Length

2023-12-10T20:38:03.372383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:38:03.520062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강원 100
100.0%

시군구코드
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
42130
99 
42830
 
1

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row42830
2nd row42130
3rd row42130
4th row42130
5th row42130

Common Values

ValueCountFrequency (%)
42130 99
99.0%
42830 1
 
1.0%

Length

2023-12-10T20:38:03.654925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:38:03.791601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
42130 99
99.0%
42830 1
 
1.0%

시군구명
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
원주시
99 
양양군
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row양양군
2nd row원주시
3rd row원주시
4th row원주시
5th row원주시

Common Values

ValueCountFrequency (%)
원주시 99
99.0%
양양군 1
 
1.0%

Length

2023-12-10T20:38:03.944935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:38:04.090192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
원주시 99
99.0%
양양군 1
 
1.0%

일반도로/고속도로
Real number (ℝ)

HIGH CORRELATION 

Distinct19
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-80.807499
Minimum-99
Maximum5.9174027
Zeros0
Zeros (%)0.0%
Negative82
Negative (%)82.0%
Memory size1.0 KiB
2023-12-10T20:38:04.240979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-99
5-th percentile-99
Q1-99
median-99
Q3-99
95-th percentile2.6478427
Maximum5.9174027
Range104.9174
Interquartile range (IQR)0

Descriptive statistics

Standard deviation39.029468
Coefficient of variation (CV)-0.48299315
Kurtosis0.88233523
Mean-80.807499
Median Absolute Deviation (MAD)0
Skewness1.6921978
Sum-8080.7499
Variance1523.2994
MonotonicityNot monotonic
2023-12-10T20:38:04.435683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
-99.0 82
82.0%
0.690722156 1
 
1.0%
3.093091592 1
 
1.0%
1.346806039 1
 
1.0%
0.732729086 1
 
1.0%
1.441748977 1
 
1.0%
1.67792465 1
 
1.0%
5.917402691 1
 
1.0%
3.405978753 1
 
1.0%
1.205961474 1
 
1.0%
Other values (9) 9
 
9.0%
ValueCountFrequency (%)
-99.0 82
82.0%
0.073570782 1
 
1.0%
0.513459507 1
 
1.0%
0.690722156 1
 
1.0%
0.732729086 1
 
1.0%
1.161543622 1
 
1.0%
1.205961474 1
 
1.0%
1.346806039 1
 
1.0%
1.441748977 1
 
1.0%
1.67792465 1
 
1.0%
ValueCountFrequency (%)
5.917402691 1
1.0%
3.405978753 1
1.0%
3.093091592 1
1.0%
3.081880243 1
1.0%
2.688848183 1
1.0%
2.645684537 1
1.0%
2.579619779 1
1.0%
2.514509196 1
1.0%
2.47863014 1
1.0%
1.67792465 1
1.0%

지하차도/교량/터널
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-99
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-99
2nd row-99
3rd row-99
4th row-99
5th row-99

Common Values

ValueCountFrequency (%)
-99 100
100.0%

Length

2023-12-10T20:38:04.616357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:38:04.725528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
99 100
100.0%

도로
Real number (ℝ)

HIGH CORRELATION 

Distinct19
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-80.807499
Minimum-99
Maximum5.9174027
Zeros0
Zeros (%)0.0%
Negative82
Negative (%)82.0%
Memory size1.0 KiB
2023-12-10T20:38:04.858743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-99
5-th percentile-99
Q1-99
median-99
Q3-99
95-th percentile2.6478427
Maximum5.9174027
Range104.9174
Interquartile range (IQR)0

Descriptive statistics

Standard deviation39.029468
Coefficient of variation (CV)-0.48299315
Kurtosis0.88233523
Mean-80.807499
Median Absolute Deviation (MAD)0
Skewness1.6921978
Sum-8080.7499
Variance1523.2994
MonotonicityNot monotonic
2023-12-10T20:38:05.029195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
-99.0 82
82.0%
0.690722156 1
 
1.0%
3.093091592 1
 
1.0%
1.346806039 1
 
1.0%
0.732729086 1
 
1.0%
1.441748977 1
 
1.0%
1.67792465 1
 
1.0%
5.917402691 1
 
1.0%
3.405978753 1
 
1.0%
1.205961474 1
 
1.0%
Other values (9) 9
 
9.0%
ValueCountFrequency (%)
-99.0 82
82.0%
0.073570782 1
 
1.0%
0.513459507 1
 
1.0%
0.690722156 1
 
1.0%
0.732729086 1
 
1.0%
1.161543622 1
 
1.0%
1.205961474 1
 
1.0%
1.346806039 1
 
1.0%
1.441748977 1
 
1.0%
1.67792465 1
 
1.0%
ValueCountFrequency (%)
5.917402691 1
1.0%
3.405978753 1
1.0%
3.093091592 1
1.0%
3.081880243 1
1.0%
2.688848183 1
1.0%
2.645684537 1
1.0%
2.579619779 1
1.0%
2.514509196 1
1.0%
2.47863014 1
1.0%
1.67792465 1
1.0%

Interactions

2023-12-10T20:38:00.637961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:59.798764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:38:00.241766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:38:00.776604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:37:59.929902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:38:00.367526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:38:00.920519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:38:00.080652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:38:00.502840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T20:38:05.158460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디격자번호시군구코드시군구명일반도로/고속도로도로
아이디1.0001.0000.0410.0410.6850.685
격자번호1.0001.0001.0001.0001.0001.000
시군구코드0.0411.0001.0000.6930.0000.000
시군구명0.0411.0000.6931.0000.0000.000
일반도로/고속도로0.6851.0000.0000.0001.0000.999
도로0.6851.0000.0000.0000.9991.000
2023-12-10T20:38:05.311716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구코드시군구명
시군구코드1.0000.487
시군구명0.4871.000
2023-12-10T20:38:05.444093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디일반도로/고속도로도로시군구코드시군구명
아이디1.0000.0250.0250.0000.000
일반도로/고속도로0.0251.0001.0000.0000.000
도로0.0251.0001.0000.0000.000
시군구코드0.0000.0000.0001.0000.487
시군구명0.0000.0000.0000.4871.000

Missing values

2023-12-10T20:38:01.117335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T20:38:01.360859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

아이디격자번호시도코드시도명시군구코드시군구명일반도로/고속도로지하차도/교량/터널도로
01라사979742강원42830양양군-99.0-99-99.0
12라사513442강원42130원주시-99.0-99-99.0
23라사421842강원42130원주시-99.0-99-99.0
34라사331442강원42130원주시-99.0-99-99.0
45라사523542강원42130원주시-99.0-99-99.0
56라사293642강원42130원주시-99.0-99-99.0
67라사252942강원42130원주시-99.0-99-99.0
78라사461742강원42130원주시-99.0-99-99.0
89라사260642강원42130원주시-99.0-99-99.0
910라사301242강원42130원주시-99.0-99-99.0
아이디격자번호시도코드시도명시군구코드시군구명일반도로/고속도로지하차도/교량/터널도로
9091라사311942강원42130원주시-99.0-99-99.0
9192라사281542강원42130원주시-99.0-99-99.0
9293라사482342강원42130원주시-99.0-99-99.0
9394라사483142강원42130원주시-99.0-99-99.0
9495라사463042강원42130원주시-99.0-99-99.0
9596라사310842강원42130원주시-99.0-99-99.0
9697라사551942강원42130원주시-99.0-99-99.0
9798라사513042강원42130원주시-99.0-99-99.0
9899라사411842강원42130원주시-99.0-99-99.0
99100라사283542강원42130원주시3.093092-993.093092