Overview

Dataset statistics

Number of variables12
Number of observations8381
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory835.0 KiB
Average record size in memory102.0 B

Variable types

Numeric6
Text4
Categorical2

Dataset

Description충청북도내 교통약자 이용 시설과 해당 이용 시설 주변의 사고다발지역 정보 교통약자 이용시설과 가까운 사고다발지역 정보 확인 사고다발지역의 GPS 위도/경도 확인 및 시설과 사고다발지역간의 거리 정보 확인
URLhttps://www.data.go.kr/data/15097136/fileData.do

Alerts

사고 종류 is highly overall correlated with 대상High correlation
대상 is highly overall correlated with 사고 종류High correlation
시설정보 에프아이디 is highly overall correlated with 위도High correlation
다발지역 에프아이디 is highly overall correlated with 다발지역 아이디High correlation
다발지역 아이디 is highly overall correlated with 다발지역 에프아이디High correlation
위도 is highly overall correlated with 시설정보 에프아이디High correlation

Reproduction

Analysis started2023-12-12 07:25:09.873342
Analysis finished2023-12-12 07:25:16.455409
Duration6.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설정보 에프아이디
Real number (ℝ)

HIGH CORRELATION 

Distinct105
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43231056
Minimum43111101
Maximum43800253
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size73.8 KiB
2023-12-12T16:25:16.537788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum43111101
5-th percentile43111120
Q143112106
median43113320
Q343150101
95-th percentile43760250
Maximum43800253
Range689152
Interquartile range (IQR)37995

Descriptive statistics

Standard deviation242363.51
Coefficient of variation (CV)0.0056062363
Kurtosis0.86610631
Mean43231056
Median Absolute Deviation (MAD)2196
Skewness1.6866087
Sum3.6231948 × 1011
Variance5.8740072 × 1010
MonotonicityIncreasing
2023-12-12T16:25:16.710935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
43113114 525
 
6.3%
43111124 392
 
4.7%
43113113 370
 
4.4%
43112106 340
 
4.1%
43114103 300
 
3.6%
43111120 295
 
3.5%
43130105 255
 
3.0%
43114253 249
 
3.0%
43113115 225
 
2.7%
43130118 219
 
2.6%
Other values (95) 5211
62.2%
ValueCountFrequency (%)
43111101 30
 
0.4%
43111103 15
 
0.2%
43111108 20
 
0.2%
43111109 5
 
0.1%
43111110 10
 
0.1%
43111111 15
 
0.2%
43111112 50
0.6%
43111117 50
0.6%
43111118 25
 
0.3%
43111119 100
1.2%
ValueCountFrequency (%)
43800253 15
 
0.2%
43800250 99
1.2%
43770370 11
 
0.1%
43770340 95
1.1%
43770330 29
 
0.3%
43770253 141
1.7%
43770250 21
 
0.3%
43760350 6
 
0.1%
43760250 91
1.1%
43750370 75
0.9%
Distinct1930
Distinct (%)23.0%
Missing0
Missing (%)0.0%
Memory size65.6 KiB
2023-12-12T16:25:16.945916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length13
Mean length13
Min length13

Characters and Unicode

Total characters108953
Distinct characters19
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)1.8%

Sample

1st rowZID11e2422474
2nd rowZID11e2422474
3rd rowZID11e2422474
4th rowZID11e2422474
5th rowZID11e2422474
ValueCountFrequency (%)
zid11e2422474 5
 
0.1%
zidd35058e75f 5
 
0.1%
zidbe8fd66bfc 5
 
0.1%
zidbc50a087e4 5
 
0.1%
zidbc0f788434 5
 
0.1%
zidb90e7841d2 5
 
0.1%
zid9752c6b599 5
 
0.1%
zid959474755d 5
 
0.1%
zid8f2c71dbba 5
 
0.1%
zid8ed0b93792 5
 
0.1%
Other values (1920) 8331
99.4%
2023-12-12T16:25:17.284557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
Z 8381
 
7.7%
I 8381
 
7.7%
D 8381
 
7.7%
2 5452
 
5.0%
8 5416
 
5.0%
b 5401
 
5.0%
5 5329
 
4.9%
a 5312
 
4.9%
9 5294
 
4.9%
c 5290
 
4.9%
Other values (9) 46316
42.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 52389
48.1%
Lowercase Letter 31421
28.8%
Uppercase Letter 25143
23.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 5452
10.4%
8 5416
10.3%
5 5329
10.2%
9 5294
10.1%
0 5257
10.0%
6 5191
9.9%
3 5165
9.9%
7 5159
9.8%
4 5098
9.7%
1 5028
9.6%
Lowercase Letter
ValueCountFrequency (%)
b 5401
17.2%
a 5312
16.9%
c 5290
16.8%
e 5200
16.5%
f 5171
16.5%
d 5047
16.1%
Uppercase Letter
ValueCountFrequency (%)
Z 8381
33.3%
I 8381
33.3%
D 8381
33.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 56564
51.9%
Common 52389
48.1%

Most frequent character per script

Common
ValueCountFrequency (%)
2 5452
10.4%
8 5416
10.3%
5 5329
10.2%
9 5294
10.1%
0 5257
10.0%
6 5191
9.9%
3 5165
9.9%
7 5159
9.8%
4 5098
9.7%
1 5028
9.6%
Latin
ValueCountFrequency (%)
Z 8381
14.8%
I 8381
14.8%
D 8381
14.8%
b 5401
9.5%
a 5312
9.4%
c 5290
9.4%
e 5200
9.2%
f 5171
9.1%
d 5047
8.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 108953
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
Z 8381
 
7.7%
I 8381
 
7.7%
D 8381
 
7.7%
2 5452
 
5.0%
8 5416
 
5.0%
b 5401
 
5.0%
5 5329
 
4.9%
a 5312
 
4.9%
9 5294
 
4.9%
c 5290
 
4.9%
Other values (9) 46316
42.5%
Distinct1771
Distinct (%)21.1%
Missing0
Missing (%)0.0%
Memory size65.6 KiB
2023-12-12T16:25:17.546175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length23
Mean length7.8472736
Min length1

Characters and Unicode

Total characters65768
Distinct characters500
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique122 ?
Unique (%)1.5%

Sample

1st row청주중학교
2nd row청주중학교
3rd row청주중학교
4th row청주중학교
5th row청주중학교
ValueCountFrequency (%)
어린이집 126
 
1.4%
꿈나무어린이집 26
 
0.3%
아이사랑어린이집 23
 
0.3%
아이뜰어린이집 18
 
0.2%
자연어린이집 17
 
0.2%
다솜어린이집 16
 
0.2%
샛별어린이집 15
 
0.2%
충청노인요양원 15
 
0.2%
음성군재가노인지원서비스센터 15
 
0.2%
수어통역센터 15
 
0.2%
Other values (1791) 8408
96.7%
2023-12-12T16:25:17.972931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4758
 
7.2%
4245
 
6.5%
4180
 
6.4%
4144
 
6.3%
1655
 
2.5%
1643
 
2.5%
1608
 
2.4%
1328
 
2.0%
1269
 
1.9%
1229
 
1.9%
Other values (490) 39709
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 64959
98.8%
Space Separator 313
 
0.5%
Uppercase Letter 229
 
0.3%
Decimal Number 123
 
0.2%
Close Punctuation 65
 
0.1%
Open Punctuation 55
 
0.1%
Other Symbol 7
 
< 0.1%
Lowercase Letter 7
 
< 0.1%
Letter Number 5
 
< 0.1%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4758
 
7.3%
4245
 
6.5%
4180
 
6.4%
4144
 
6.4%
1655
 
2.5%
1643
 
2.5%
1608
 
2.5%
1328
 
2.0%
1269
 
2.0%
1229
 
1.9%
Other values (458) 38900
59.9%
Uppercase Letter
ValueCountFrequency (%)
Y 30
13.1%
A 25
10.9%
C 24
10.5%
S 20
8.7%
B 20
8.7%
L 15
6.6%
W 15
6.6%
E 15
6.6%
M 15
6.6%
K 13
 
5.7%
Other values (6) 37
16.2%
Decimal Number
ValueCountFrequency (%)
2 61
49.6%
1 21
 
17.1%
5 12
 
9.8%
3 11
 
8.9%
6 9
 
7.3%
4 5
 
4.1%
7 2
 
1.6%
8 2
 
1.6%
Lowercase Letter
ValueCountFrequency (%)
i 6
85.7%
s 1
 
14.3%
Space Separator
ValueCountFrequency (%)
313
100.0%
Close Punctuation
ValueCountFrequency (%)
) 65
100.0%
Open Punctuation
ValueCountFrequency (%)
( 55
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%
Letter Number
ValueCountFrequency (%)
5
100.0%
Other Punctuation
ValueCountFrequency (%)
& 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 64966
98.8%
Common 561
 
0.9%
Latin 241
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4758
 
7.3%
4245
 
6.5%
4180
 
6.4%
4144
 
6.4%
1655
 
2.5%
1643
 
2.5%
1608
 
2.5%
1328
 
2.0%
1269
 
2.0%
1229
 
1.9%
Other values (459) 38907
59.9%
Latin
ValueCountFrequency (%)
Y 30
12.4%
A 25
10.4%
C 24
10.0%
S 20
8.3%
B 20
8.3%
L 15
 
6.2%
W 15
 
6.2%
E 15
 
6.2%
M 15
 
6.2%
K 13
 
5.4%
Other values (9) 49
20.3%
Common
ValueCountFrequency (%)
313
55.8%
) 65
 
11.6%
2 61
 
10.9%
( 55
 
9.8%
1 21
 
3.7%
5 12
 
2.1%
3 11
 
2.0%
6 9
 
1.6%
& 5
 
0.9%
4 5
 
0.9%
Other values (2) 4
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 64959
98.8%
ASCII 797
 
1.2%
None 7
 
< 0.1%
Number Forms 5
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4758
 
7.3%
4245
 
6.5%
4180
 
6.4%
4144
 
6.4%
1655
 
2.5%
1643
 
2.5%
1608
 
2.5%
1328
 
2.0%
1269
 
2.0%
1229
 
1.9%
Other values (458) 38900
59.9%
ASCII
ValueCountFrequency (%)
313
39.3%
) 65
 
8.2%
2 61
 
7.7%
( 55
 
6.9%
Y 30
 
3.8%
A 25
 
3.1%
C 24
 
3.0%
1 21
 
2.6%
S 20
 
2.5%
B 20
 
2.5%
Other values (20) 163
20.5%
None
ValueCountFrequency (%)
7
100.0%
Number Forms
ValueCountFrequency (%)
5
100.0%
Distinct1760
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Memory size65.6 KiB
2023-12-12T16:25:18.292358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length45
Mean length29.537406
Min length14

Characters and Unicode

Total characters247553
Distinct characters390
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique113 ?
Unique (%)1.3%

Sample

1st row충청북도 청주시 상당구 사직대로 361번길 125
2nd row충청북도 청주시 상당구 사직대로 361번길 125
3rd row충청북도 청주시 상당구 사직대로 361번길 125
4th row충청북도 청주시 상당구 사직대로 361번길 125
5th row충청북도 청주시 상당구 사직대로 361번길 125
ValueCountFrequency (%)
충청북도 8381
 
16.6%
청주시 5193
 
10.3%
흥덕구 1531
 
3.0%
서원구 1400
 
2.8%
상당구 1262
 
2.5%
충주시 1097
 
2.2%
청원구 987
 
2.0%
제천시 611
 
1.2%
진천군 314
 
0.6%
음성군 297
 
0.6%
Other values (2462) 29316
58.2%
2023-12-12T16:25:19.159196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42057
 
17.0%
15024
 
6.1%
1 11042
 
4.5%
9717
 
3.9%
8561
 
3.5%
8437
 
3.4%
7309
 
3.0%
7051
 
2.8%
6923
 
2.8%
6903
 
2.8%
Other values (380) 124529
50.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 151819
61.3%
Space Separator 42057
 
17.0%
Decimal Number 40693
 
16.4%
Open Punctuation 4366
 
1.8%
Close Punctuation 4362
 
1.8%
Other Punctuation 2469
 
1.0%
Dash Punctuation 1593
 
0.6%
Uppercase Letter 152
 
0.1%
Lowercase Letter 40
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15024
 
9.9%
9717
 
6.4%
8561
 
5.6%
8437
 
5.6%
7309
 
4.8%
7051
 
4.6%
6923
 
4.6%
6903
 
4.5%
5415
 
3.6%
4133
 
2.7%
Other values (347) 72346
47.7%
Uppercase Letter
ValueCountFrequency (%)
L 32
21.1%
H 32
21.1%
A 23
15.1%
F 14
9.2%
E 10
 
6.6%
N 10
 
6.6%
S 8
 
5.3%
M 5
 
3.3%
C 5
 
3.3%
Y 5
 
3.3%
Other values (2) 8
 
5.3%
Decimal Number
ValueCountFrequency (%)
1 11042
27.1%
2 5616
13.8%
0 5093
12.5%
3 4297
 
10.6%
4 3205
 
7.9%
5 2646
 
6.5%
6 2568
 
6.3%
7 2430
 
6.0%
8 2023
 
5.0%
9 1773
 
4.4%
Lowercase Letter
ValueCountFrequency (%)
i 10
25.0%
e 10
25.0%
t 10
25.0%
p 10
25.0%
Other Punctuation
ValueCountFrequency (%)
, 2426
98.3%
. 43
 
1.7%
Space Separator
ValueCountFrequency (%)
42057
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4366
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4362
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1593
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 151819
61.3%
Common 95542
38.6%
Latin 192
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15024
 
9.9%
9717
 
6.4%
8561
 
5.6%
8437
 
5.6%
7309
 
4.8%
7051
 
4.6%
6923
 
4.6%
6903
 
4.5%
5415
 
3.6%
4133
 
2.7%
Other values (347) 72346
47.7%
Common
ValueCountFrequency (%)
42057
44.0%
1 11042
 
11.6%
2 5616
 
5.9%
0 5093
 
5.3%
( 4366
 
4.6%
) 4362
 
4.6%
3 4297
 
4.5%
4 3205
 
3.4%
5 2646
 
2.8%
6 2568
 
2.7%
Other values (7) 10290
 
10.8%
Latin
ValueCountFrequency (%)
L 32
16.7%
H 32
16.7%
A 23
12.0%
F 14
7.3%
i 10
 
5.2%
E 10
 
5.2%
N 10
 
5.2%
e 10
 
5.2%
t 10
 
5.2%
p 10
 
5.2%
Other values (6) 31
16.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 151819
61.3%
ASCII 95734
38.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
42057
43.9%
1 11042
 
11.5%
2 5616
 
5.9%
0 5093
 
5.3%
( 4366
 
4.6%
) 4362
 
4.6%
3 4297
 
4.5%
4 3205
 
3.3%
5 2646
 
2.8%
6 2568
 
2.7%
Other values (23) 10482
 
10.9%
Hangul
ValueCountFrequency (%)
15024
 
9.9%
9717
 
6.4%
8561
 
5.6%
8437
 
5.6%
7309
 
4.8%
7051
 
4.6%
6923
 
4.6%
6903
 
4.5%
5415
 
3.6%
4133
 
2.7%
Other values (347) 72346
47.7%

다발지역 에프아이디
Real number (ℝ)

HIGH CORRELATION 

Distinct632
Distinct (%)7.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5453365.2
Minimum138639
Maximum6715935
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size73.8 KiB
2023-12-12T16:25:19.319227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum138639
5-th percentile160973
Q16101149
median6430780
Q36598070
95-th percentile6713469
Maximum6715935
Range6577296
Interquartile range (IQR)496921

Descriptive statistics

Standard deviation2280472
Coefficient of variation (CV)0.41817701
Kurtosis1.508034
Mean5453365.2
Median Absolute Deviation (MAD)208329
Skewness-1.8502127
Sum4.5704654 × 1010
Variance5.2005525 × 1012
MonotonicityNot monotonic
2023-12-12T16:25:19.507339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6536156 61
 
0.7%
6713004 59
 
0.7%
6536987 57
 
0.7%
6607783 55
 
0.7%
143387 54
 
0.6%
6536697 53
 
0.6%
6598069 53
 
0.6%
6503365 51
 
0.6%
6101147 51
 
0.6%
6713047 51
 
0.6%
Other values (622) 7836
93.5%
ValueCountFrequency (%)
138639 20
 
0.2%
138992 18
 
0.2%
139194 1
 
< 0.1%
139213 6
 
0.1%
139266 22
0.3%
140481 16
 
0.2%
143385 32
0.4%
143387 54
0.6%
143388 32
0.4%
143389 28
0.3%
ValueCountFrequency (%)
6715935 13
0.2%
6715934 6
 
0.1%
6715933 5
 
0.1%
6715930 7
 
0.1%
6715929 7
 
0.1%
6715928 9
 
0.1%
6715925 21
0.3%
6715924 10
0.1%
6715923 24
0.3%
6715921 5
 
0.1%

다발지역 아이디
Real number (ℝ)

HIGH CORRELATION 

Distinct58
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017642
Minimum2013097
Maximum2021056
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size73.8 KiB
2023-12-12T16:25:19.659016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2013097
5-th percentile2013099
Q12015049
median2018045
Q32020016
95-th percentile2021054
Maximum2021056
Range7959
Interquartile range (IQR)4967

Descriptive statistics

Standard deviation2552.869
Coefficient of variation (CV)0.0012652735
Kurtosis-1.0330723
Mean2017642
Median Absolute Deviation (MAD)2001
Skewness-0.43727261
Sum1.6909857 × 1010
Variance6517140
MonotonicityNot monotonic
2023-12-12T16:25:19.823298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2019048 338
 
4.0%
2021053 334
 
4.0%
2021056 312
 
3.7%
2019056 298
 
3.6%
2019049 293
 
3.5%
2017029 287
 
3.4%
2018074 285
 
3.4%
2020087 269
 
3.2%
2013099 241
 
2.9%
2021054 239
 
2.9%
Other values (48) 5485
65.4%
ValueCountFrequency (%)
2013097 231
2.8%
2013098 177
2.1%
2013099 241
2.9%
2013114 227
2.7%
2014095 45
 
0.5%
2014105 209
2.5%
2014109 197
2.4%
2014110 105
1.3%
2014117 135
1.6%
2015042 47
 
0.6%
ValueCountFrequency (%)
2021056 312
3.7%
2021054 239
2.9%
2021053 334
4.0%
2021029 8
 
0.1%
2021028 43
 
0.5%
2021024 146
1.7%
2021018 11
 
0.1%
2020087 269
3.2%
2020083 215
2.6%
2020081 196
2.3%

사고 종류
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size65.6 KiB
VIOLT
1936 
OLDMAN
1573 
BICYCLE
1545 
LG
1164 
CHILD
1037 
Other values (3)
1126 

Length

Max length10
Median length8
Mean length5.7964443
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowBICYCLE
2nd rowJAYWALKING
3rd rowVIOLT
4th rowBICYCLE
5th rowBICYCLE

Common Values

ValueCountFrequency (%)
VIOLT 1936
23.1%
OLDMAN 1573
18.8%
BICYCLE 1545
18.4%
LG 1164
13.9%
CHILD 1037
12.4%
JAYWALKING 669
 
8.0%
SCHOOLZONE 394
 
4.7%
FREEZING 63
 
0.8%

Length

2023-12-12T16:25:19.999797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:25:20.134633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
violt 1936
23.1%
oldman 1573
18.8%
bicycle 1545
18.4%
lg 1164
13.9%
child 1037
12.4%
jaywalking 669
 
8.0%
schoolzone 394
 
4.7%
freezing 63
 
0.8%
Distinct521
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size65.6 KiB
2023-12-12T16:25:20.388705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length34
Mean length26.216919
Min length12

Characters and Unicode

Total characters219724
Distinct characters394
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)0.4%

Sample

1st row충청북도 청주시 상당구 우암동(우암교회 부근)
2nd row충청북도 청주시 상당구 수동(우암오거리 부근)
3rd row충청북도 청주시 상당구 영동(방아다리길삼충로 부근)
4th row충청북도 청주시 상당구 북문로2가동(태웅건설 부근)
5th row충청북도 청주시 상당구 북문로3가동(신쭈꾸곱창의전설 부근)
ValueCountFrequency (%)
충청북도 8361
21.5%
부근 7002
18.0%
청주시 5090
13.1%
흥덕구 1718
 
4.4%
상당구 1428
 
3.7%
인근 1360
 
3.5%
서원구 1125
 
2.9%
충주시 1085
 
2.8%
청원구 819
 
2.1%
제천시 611
 
1.6%
Other values (522) 10229
26.3%
2023-12-12T16:25:20.819594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30447
 
13.9%
14926
 
6.8%
9756
 
4.4%
) 9618
 
4.4%
( 9618
 
4.4%
8611
 
3.9%
8486
 
3.9%
8362
 
3.8%
8348
 
3.8%
7366
 
3.4%
Other values (384) 104186
47.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 164996
75.1%
Space Separator 30447
 
13.9%
Close Punctuation 9618
 
4.4%
Open Punctuation 9618
 
4.4%
Decimal Number 4527
 
2.1%
Uppercase Letter 245
 
0.1%
Dash Punctuation 124
 
0.1%
Lowercase Letter 118
 
0.1%
Other Punctuation 31
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14926
 
9.0%
9756
 
5.9%
8611
 
5.2%
8486
 
5.1%
8362
 
5.1%
8348
 
5.1%
7366
 
4.5%
7084
 
4.3%
7055
 
4.3%
5515
 
3.3%
Other values (352) 79487
48.2%
Uppercase Letter
ValueCountFrequency (%)
C 30
12.2%
H 27
11.0%
N 27
11.0%
B 25
10.2%
G 23
9.4%
L 22
9.0%
T 20
8.2%
U 19
7.8%
I 18
7.3%
K 16
6.5%
Other values (4) 18
7.3%
Decimal Number
ValueCountFrequency (%)
1 1084
23.9%
2 795
17.6%
0 603
13.3%
3 573
12.7%
5 362
 
8.0%
9 291
 
6.4%
4 277
 
6.1%
8 238
 
5.3%
6 225
 
5.0%
7 79
 
1.7%
Lowercase Letter
ValueCountFrequency (%)
e 108
91.5%
o 5
 
4.2%
i 5
 
4.2%
Space Separator
ValueCountFrequency (%)
30447
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9618
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9618
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 124
100.0%
Other Punctuation
ValueCountFrequency (%)
. 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 164996
75.1%
Common 54365
 
24.7%
Latin 363
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14926
 
9.0%
9756
 
5.9%
8611
 
5.2%
8486
 
5.1%
8362
 
5.1%
8348
 
5.1%
7366
 
4.5%
7084
 
4.3%
7055
 
4.3%
5515
 
3.3%
Other values (352) 79487
48.2%
Latin
ValueCountFrequency (%)
e 108
29.8%
C 30
 
8.3%
H 27
 
7.4%
N 27
 
7.4%
B 25
 
6.9%
G 23
 
6.3%
L 22
 
6.1%
T 20
 
5.5%
U 19
 
5.2%
I 18
 
5.0%
Other values (7) 44
12.1%
Common
ValueCountFrequency (%)
30447
56.0%
) 9618
 
17.7%
( 9618
 
17.7%
1 1084
 
2.0%
2 795
 
1.5%
0 603
 
1.1%
3 573
 
1.1%
5 362
 
0.7%
9 291
 
0.5%
4 277
 
0.5%
Other values (5) 697
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 164996
75.1%
ASCII 54728
 
24.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30447
55.6%
) 9618
 
17.6%
( 9618
 
17.6%
1 1084
 
2.0%
2 795
 
1.5%
0 603
 
1.1%
3 573
 
1.0%
5 362
 
0.7%
9 291
 
0.5%
4 277
 
0.5%
Other values (22) 1060
 
1.9%
Hangul
ValueCountFrequency (%)
14926
 
9.0%
9756
 
5.9%
8611
 
5.2%
8486
 
5.1%
8362
 
5.1%
8348
 
5.1%
7366
 
4.5%
7084
 
4.3%
7055
 
4.3%
5515
 
3.3%
Other values (352) 79487
48.2%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct593
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.732242
Minimum36.164694
Maximum37.531705
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size73.8 KiB
2023-12-12T16:25:21.002772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.164694
5-th percentile36.485169
Q136.620745
median36.642895
Q336.961497
95-th percentile37.134156
Maximum37.531705
Range1.367011
Interquartile range (IQR)0.34075215

Descriptive statistics

Standard deviation0.20338745
Coefficient of variation (CV)0.005537028
Kurtosis0.27599777
Mean36.732242
Median Absolute Deviation (MAD)0.02997285
Skewness0.29962658
Sum307852.92
Variance0.041366456
MonotonicityNot monotonic
2023-12-12T16:25:21.171064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
36.61912431 61
 
0.7%
36.61753816 59
 
0.7%
36.71173501 57
 
0.7%
36.63076316 55
 
0.7%
36.62053973 54
 
0.6%
36.62982115 53
 
0.6%
36.61923049 53
 
0.6%
36.48516871 53
 
0.6%
36.99352324 52
 
0.6%
36.62069048 51
 
0.6%
Other values (583) 7833
93.5%
ValueCountFrequency (%)
36.16469415 4
 
< 0.1%
36.17081868 14
0.2%
36.17116437 15
0.2%
36.17252728 14
0.2%
36.1736618 11
 
0.1%
36.17370424 22
0.3%
36.17395362 7
 
0.1%
36.17507298 10
 
0.1%
36.17519463 1
 
< 0.1%
36.17536756 30
0.4%
ValueCountFrequency (%)
37.53170516 1
 
< 0.1%
37.5314386 1
 
< 0.1%
37.53108269 1
 
< 0.1%
37.53103838 1
 
< 0.1%
37.53034958 1
 
< 0.1%
37.18080059 2
 
< 0.1%
37.1729552 2
 
< 0.1%
37.16532049 7
 
0.1%
37.15893794 14
0.2%
37.1553387 25
0.3%

경도
Real number (ℝ)

Distinct591
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.61666
Minimum127.09236
Maximum128.37129
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size73.8 KiB
2023-12-12T16:25:21.350258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127.09236
5-th percentile127.42939
Q1127.46245
median127.4927
Q3127.72171
95-th percentile128.21146
Maximum128.37129
Range1.2789333
Interquartile range (IQR)0.2592646

Descriptive statistics

Standard deviation0.24937262
Coefficient of variation (CV)0.0019540758
Kurtosis0.88323942
Mean127.61666
Median Absolute Deviation (MAD)0.04704
Skewness1.4618333
Sum1069555.2
Variance0.062186704
MonotonicityNot monotonic
2023-12-12T16:25:21.507246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.5952586 66
 
0.8%
127.5123913 61
 
0.7%
127.505983 59
 
0.7%
127.4191699 57
 
0.7%
127.50604 55
 
0.7%
127.4457038 54
 
0.6%
127.5126257 53
 
0.6%
127.7217097 53
 
0.6%
127.5049001 53
 
0.6%
127.4473974 51
 
0.6%
Other values (581) 7819
93.3%
ValueCountFrequency (%)
127.092357 1
 
< 0.1%
127.1273596 1
 
< 0.1%
127.1283531 1
 
< 0.1%
127.1284418 1
 
< 0.1%
127.1302553 1
 
< 0.1%
127.1307068 1
 
< 0.1%
127.3592479 16
0.2%
127.3723417 13
0.2%
127.373831 12
0.1%
127.3756014 11
0.1%
ValueCountFrequency (%)
128.3712903 6
 
0.1%
128.3700236 7
 
0.1%
128.3698531 1
 
< 0.1%
128.3692858 14
0.2%
128.3691241 7
 
0.1%
128.3690681 9
0.1%
128.3690349 22
0.3%
128.3687516 19
0.2%
128.3667619 7
 
0.1%
128.3555812 7
 
0.1%

대상
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size65.6 KiB
기타
5377 
노약자
1573 
어린이
1431 

Length

Max length3
Median length2
Mean length2.3584298
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타
2nd row기타
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
기타 5377
64.2%
노약자 1573
 
18.8%
어린이 1431
 
17.1%

Length

2023-12-12T16:25:21.650167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:25:21.776852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 5377
64.2%
노약자 1573
 
18.8%
어린이 1431
 
17.1%

거리(미터)
Real number (ℝ)

Distinct959
Distinct (%)11.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean430.88486
Minimum8
Maximum999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size73.8 KiB
2023-12-12T16:25:21.912075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8
5-th percentile117
Q1259
median401
Q3570
95-th percentile869
Maximum999
Range991
Interquartile range (IQR)311

Descriptive statistics

Standard deviation222.68063
Coefficient of variation (CV)0.51679845
Kurtosis-0.38606461
Mean430.88486
Median Absolute Deviation (MAD)152
Skewness0.54107557
Sum3611246
Variance49586.662
MonotonicityNot monotonic
2023-12-12T16:25:22.055183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
430 29
 
0.3%
331 28
 
0.3%
357 27
 
0.3%
268 25
 
0.3%
355 25
 
0.3%
271 24
 
0.3%
220 24
 
0.3%
258 24
 
0.3%
282 24
 
0.3%
233 23
 
0.3%
Other values (949) 8128
97.0%
ValueCountFrequency (%)
8 1
 
< 0.1%
12 2
< 0.1%
23 3
< 0.1%
24 2
< 0.1%
25 1
 
< 0.1%
27 2
< 0.1%
29 2
< 0.1%
30 1
 
< 0.1%
31 1
 
< 0.1%
32 2
< 0.1%
ValueCountFrequency (%)
999 1
 
< 0.1%
997 3
< 0.1%
996 3
< 0.1%
995 2
 
< 0.1%
994 1
 
< 0.1%
993 5
0.1%
992 1
 
< 0.1%
991 1
 
< 0.1%
990 2
 
< 0.1%
989 7
0.1%

Interactions

2023-12-12T16:25:15.386122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:11.638780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:12.448098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:13.163623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:13.884785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:14.664185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:15.515048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:11.773220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:12.554661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:13.286885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:14.029948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:14.800697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:15.641262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:11.931726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:12.669069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:13.409278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:14.148001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:14.933309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:15.776052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:12.066764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:12.799559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:13.525923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:14.265682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:15.046210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:15.874422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:12.203160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:12.930585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:13.654394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:14.440431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:15.180463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:15.999355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:12.324051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:13.047616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:13.779395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:14.550029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:25:15.288286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:25:22.150493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설정보 에프아이디다발지역 에프아이디다발지역 아이디사고 종류위도경도대상거리(미터)
시설정보 에프아이디1.0000.0850.2440.5660.9690.6680.3410.150
다발지역 에프아이디0.0851.0000.8120.6130.2150.2840.2190.059
다발지역 아이디0.2440.8121.0000.5850.2420.2160.4560.147
사고 종류0.5660.6130.5851.0000.4160.5211.0000.127
위도0.9690.2150.2420.4161.0000.8980.3550.195
경도0.6680.2840.2160.5210.8981.0000.1900.127
대상0.3410.2190.4561.0000.3550.1901.0000.102
거리(미터)0.1500.0590.1470.1270.1950.1270.1021.000
2023-12-12T16:25:22.283550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사고 종류대상
사고 종류1.0001.000
대상1.0001.000
2023-12-12T16:25:22.366328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설정보 에프아이디다발지역 에프아이디다발지역 아이디위도경도거리(미터)사고 종류대상
시설정보 에프아이디1.0000.1880.1810.6310.4040.0890.4320.118
다발지역 에프아이디0.1881.0000.9990.1200.1400.0820.3110.210
다발지역 아이디0.1810.9991.0000.1140.1370.0800.3200.291
위도0.6310.1200.1141.0000.4160.0660.2190.168
경도0.4040.1400.1370.4161.0000.0250.1970.122
거리(미터)0.0890.0820.0800.0660.0251.0000.0610.060
사고 종류0.4320.3110.3200.2190.1970.0611.0001.000
대상0.1180.2100.2910.1680.1220.0601.0001.000

Missing values

2023-12-12T16:25:16.185868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:25:16.369250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설정보 에프아이디시설정보 아이디시설명시설 주소다발지역 에프아이디다발지역 아이디사고 종류지점명위도경도대상거리(미터)
043111101ZID11e2422474청주중학교충청북도 청주시 상당구 사직대로 361번길 1251609722013099BICYCLE충청북도 청주시 상당구 우암동(우암교회 부근)36.647618127.489132기타546
143111101ZID11e2422474청주중학교충청북도 청주시 상당구 사직대로 361번길 1251670712013114JAYWALKING충청북도 청주시 상당구 수동(우암오거리 부근)36.646458127.489394기타457
243111101ZID11e2422474청주중학교충청북도 청주시 상당구 사직대로 361번길 12564268692018040VIOLT충청북도 청주시 상당구 영동(방아다리길삼충로 부근)36.644977127.48594기타171
343111101ZID11e2422474청주중학교충청북도 청주시 상당구 사직대로 361번길 12565193732019038BICYCLE충청북도 청주시 상당구 북문로2가동(태웅건설 부근)36.64094127.489388기타415
443111101ZID11e2422474청주중학교충청북도 청주시 상당구 사직대로 361번길 12565193742019038BICYCLE충청북도 청주시 상당구 북문로3가동(신쭈꾸곱창의전설 부근)36.645061127.486778기타196
543111101ZID51b94336bf파랑새지역아동센터충청북도 청주시 상당구 중앙로61번길 39-12, 2층 (영동)1671792013114JAYWALKING충청북도 청주시 상당구 문화동(중앙시장 부근)36.639255127.490089기타452
643111101ZID51b94336bf파랑새지역아동센터충청북도 청주시 상당구 중앙로61번길 39-12, 2층 (영동)62225382016146OLDMAN충청북도 청주시 상당구 수동((청수빌딩) 부근)36.640409127.491957노약자540
743111101ZID51b94336bf파랑새지역아동센터충청북도 청주시 상당구 중앙로61번길 39-12, 2층 (영동)64268692018040VIOLT충청북도 청주시 상당구 영동(방아다리길삼충로 부근)36.644977127.48594기타350
843111101ZID51b94336bf파랑새지역아동센터충청북도 청주시 상당구 중앙로61번길 39-12, 2층 (영동)65193732019038BICYCLE충청북도 청주시 상당구 북문로2가동(태웅건설 부근)36.64094127.489388기타304
943111101ZID51b94336bf파랑새지역아동센터충청북도 청주시 상당구 중앙로61번길 39-12, 2층 (영동)65193742019038BICYCLE충청북도 청주시 상당구 북문로3가동(신쭈꾸곱창의전설 부근)36.645061127.486778기타363
시설정보 에프아이디시설정보 아이디시설명시설 주소다발지역 에프아이디다발지역 아이디사고 종류지점명위도경도대상거리(미터)
837143800253ZID42b8adb6e6매포중학교충청북도 단양군 매포읍 평동1로 1366498892020087LG충청북도 단양군 매포읍(단양로1923 인근)37.033572128.303091기타504
837243800253ZID54a9ac5b2c매포초등학교충청북도 단양군 매포읍 단양로 192966498892020087LG충청북도 단양군 매포읍(단양로1923 인근)37.033572128.303091기타164
837343800253ZID5814247c9f매포초등학교병설유치원충청북도 단양군 매포읍 단양로 192966498892020087LG충청북도 단양군 매포읍(단양로1923 인근)37.033572128.303091기타92
837443800253ZID91c4a0cf56단양카리스요양원충청북도 단양군 매포읍 단양로 185466498882020087LG충청북도 단양군 매포읍(평동1길8018 인근)37.02624128.309684기타398
837543800253ZID91c4a0cf56단양카리스요양원충청북도 단양군 매포읍 단양로 185466498892020087LG충청북도 단양군 매포읍(단양로1923 인근)37.033572128.303091기타673
837643800253ZID964c1ca471매포교회어린이집충청북도 단양군 매포읍 평동3길 12 매포교회어린이집66498882020087LG충청북도 단양군 매포읍(평동1길8018 인근)37.02624128.309684기타747
837743800253ZID964c1ca471매포교회어린이집충청북도 단양군 매포읍 평동3길 12 매포교회어린이집66498892020087LG충청북도 단양군 매포읍(단양로1923 인근)37.033572128.303091기타276
837843800253ZIDd7f9666486충주댐효나눔복지센터충청북도 단양군 매포읍 평동38길 8-466498892020087LG충청북도 단양군 매포읍(단양로1923 인근)37.033572128.303091기타685
837943800253ZIDdb12d8cd66혜모어린이집충청북도 단양군 매포읍 평동5길 23-9 혜모어린이집66498882020087LG충청북도 단양군 매포읍(평동1길8018 인근)37.02624128.309684기타382
838043800253ZIDdb12d8cd66혜모어린이집충청북도 단양군 매포읍 평동5길 23-9 혜모어린이집66498892020087LG충청북도 단양군 매포읍(단양로1923 인근)37.033572128.303091기타631