Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory410.2 KiB
Average record size in memory42.0 B

Variable types

Text2
Numeric2

Dataset

Description도로표지 속성정보(도로종류, 노선번호, 표지종류), 위치정보(좌표), 안내지명정보, POI정보
Author국토교통부
URLhttps://www.data.go.kr/data/3049886/fileData.do

Reproduction

Analysis started2023-12-12 15:11:41.681935
Analysis finished2023-12-12 15:11:42.684880
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9993
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:11:42.872255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length32
Mean length17.9026
Min length7

Characters and Unicode

Total characters179026
Distinct characters468
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9986 ?
Unique (%)99.9%

Sample

1st rowRR-15[동서대로]-상-64
2nd rowUR(경기도 의정부시)-[동일로]-상-1
3rd rowNR-4[영남대로]-하-282
4th rowRR-619[예당로]-상-51
5th rowUR(경기도 화성시)-3-상-8
ValueCountFrequency (%)
ur(경기도 538
 
4.7%
ur(충청남도 91
 
0.8%
ur(경상북도 89
 
0.8%
ur(전라북도 87
 
0.8%
ur(경상남도 79
 
0.7%
wr(서울특별시 68
 
0.6%
gr(충청북도 49
 
0.4%
wr(인천광역시 46
 
0.4%
ur(강원도 41
 
0.4%
gr(경상북도 40
 
0.4%
Other values (10012) 10289
90.1%
2023-12-13T00:11:43.296825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 30052
 
16.8%
R 12130
 
6.8%
[ 9883
 
5.5%
] 9883
 
5.5%
9598
 
5.4%
1 8105
 
4.5%
2 5815
 
3.2%
3 5665
 
3.2%
5334
 
3.0%
5084
 
2.8%
Other values (458) 77477
43.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60281
33.7%
Decimal Number 43546
24.3%
Dash Punctuation 30052
16.8%
Uppercase Letter 20004
 
11.2%
Open Punctuation 11854
 
6.6%
Close Punctuation 11854
 
6.6%
Space Separator 1417
 
0.8%
Other Punctuation 14
 
< 0.1%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9598
 
15.9%
5334
 
8.8%
5084
 
8.4%
2143
 
3.6%
2043
 
3.4%
1793
 
3.0%
1335
 
2.2%
1321
 
2.2%
1041
 
1.7%
986
 
1.6%
Other values (429) 29603
49.1%
Decimal Number
ValueCountFrequency (%)
1 8105
18.6%
2 5815
13.4%
3 5665
13.0%
4 4313
9.9%
5 3960
9.1%
7 3590
8.2%
6 3264
7.5%
0 3264
7.5%
9 2850
 
6.5%
8 2720
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
R 12130
60.6%
N 5070
25.3%
U 1571
 
7.9%
E 768
 
3.8%
G 320
 
1.6%
W 142
 
0.7%
C 1
 
< 0.1%
P 1
 
< 0.1%
A 1
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
. 8
57.1%
? 5
35.7%
· 1
 
7.1%
Open Punctuation
ValueCountFrequency (%)
[ 9883
83.4%
( 1971
 
16.6%
Close Punctuation
ValueCountFrequency (%)
] 9883
83.4%
) 1971
 
16.6%
Dash Punctuation
ValueCountFrequency (%)
- 30052
100.0%
Space Separator
ValueCountFrequency (%)
1417
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 98741
55.2%
Hangul 60281
33.7%
Latin 20004
 
11.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9598
 
15.9%
5334
 
8.8%
5084
 
8.4%
2143
 
3.6%
2043
 
3.4%
1793
 
3.0%
1335
 
2.2%
1321
 
2.2%
1041
 
1.7%
986
 
1.6%
Other values (429) 29603
49.1%
Common
ValueCountFrequency (%)
- 30052
30.4%
[ 9883
 
10.0%
] 9883
 
10.0%
1 8105
 
8.2%
2 5815
 
5.9%
3 5665
 
5.7%
4 4313
 
4.4%
5 3960
 
4.0%
7 3590
 
3.6%
6 3264
 
3.3%
Other values (10) 14211
14.4%
Latin
ValueCountFrequency (%)
R 12130
60.6%
N 5070
25.3%
U 1571
 
7.9%
E 768
 
3.8%
G 320
 
1.6%
W 142
 
0.7%
C 1
 
< 0.1%
P 1
 
< 0.1%
A 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 118744
66.3%
Hangul 60281
33.7%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 30052
25.3%
R 12130
10.2%
[ 9883
 
8.3%
] 9883
 
8.3%
1 8105
 
6.8%
2 5815
 
4.9%
3 5665
 
4.8%
N 5070
 
4.3%
4 4313
 
3.6%
5 3960
 
3.3%
Other values (18) 23868
20.1%
Hangul
ValueCountFrequency (%)
9598
 
15.9%
5334
 
8.8%
5084
 
8.4%
2143
 
3.6%
2043
 
3.4%
1793
 
3.0%
1335
 
2.2%
1321
 
2.2%
1041
 
1.7%
986
 
1.6%
Other values (429) 29603
49.1%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct92
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:11:43.572697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length6.4895
Min length2

Characters and Unicode

Total characters64895
Distinct characters120
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)0.1%

Sample

1st row2방향예고표지
2nd row도계표지
3rd row3방향표지
4th row3방향예고표지
5th row읍/면/동계표지
ValueCountFrequency (%)
2방향표지 2032
19.4%
3방향표지 1346
12.9%
2방향예고표지 1199
11.5%
3방향예고표지 749
 
7.2%
단일노선표지 635
 
6.1%
분기점표지 558
 
5.3%
2지명이정표지 528
 
5.1%
1지명방향표지 459
 
4.4%
읍/면/동계표지 296
 
2.8%
3지명이정표지 196
 
1.9%
Other values (88) 2453
23.5%
2023-12-13T00:11:44.009394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12072
18.6%
10217
15.7%
6445
9.9%
6445
9.9%
2 4334
 
6.7%
3 2464
 
3.8%
2439
 
3.8%
2410
 
3.7%
1571
 
2.4%
1 1027
 
1.6%
Other values (110) 15471
23.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 54215
83.5%
Decimal Number 8231
 
12.7%
Other Punctuation 762
 
1.2%
Space Separator 451
 
0.7%
Close Punctuation 424
 
0.7%
Open Punctuation 424
 
0.7%
Lowercase Letter 215
 
0.3%
Connector Punctuation 102
 
0.2%
Uppercase Letter 54
 
0.1%
Dash Punctuation 17
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12072
22.3%
10217
18.8%
6445
11.9%
6445
11.9%
2439
 
4.5%
2410
 
4.4%
1571
 
2.9%
900
 
1.7%
828
 
1.5%
709
 
1.3%
Other values (95) 10179
18.8%
Decimal Number
ValueCountFrequency (%)
2 4334
52.7%
3 2464
29.9%
1 1027
 
12.5%
0 316
 
3.8%
5 45
 
0.5%
4 31
 
0.4%
9 14
 
0.2%
Other Punctuation
ValueCountFrequency (%)
/ 762
100.0%
Space Separator
ValueCountFrequency (%)
451
100.0%
Close Punctuation
ValueCountFrequency (%)
) 424
100.0%
Open Punctuation
ValueCountFrequency (%)
( 424
100.0%
Lowercase Letter
ValueCountFrequency (%)
m 215
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 102
100.0%
Uppercase Letter
ValueCountFrequency (%)
K 54
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 54212
83.5%
Common 10411
 
16.0%
Latin 269
 
0.4%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12072
22.3%
10217
18.8%
6445
11.9%
6445
11.9%
2439
 
4.5%
2410
 
4.4%
1571
 
2.9%
900
 
1.7%
828
 
1.5%
709
 
1.3%
Other values (93) 10176
18.8%
Common
ValueCountFrequency (%)
2 4334
41.6%
3 2464
23.7%
1 1027
 
9.9%
/ 762
 
7.3%
451
 
4.3%
) 424
 
4.1%
( 424
 
4.1%
0 316
 
3.0%
_ 102
 
1.0%
5 45
 
0.4%
Other values (3) 62
 
0.6%
Latin
ValueCountFrequency (%)
m 215
79.9%
K 54
 
20.1%
Han
ValueCountFrequency (%)
2
66.7%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 54210
83.5%
ASCII 10680
 
16.5%
CJK 3
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12072
22.3%
10217
18.8%
6445
11.9%
6445
11.9%
2439
 
4.5%
2410
 
4.4%
1571
 
2.9%
900
 
1.7%
828
 
1.5%
709
 
1.3%
Other values (92) 10174
18.8%
ASCII
ValueCountFrequency (%)
2 4334
40.6%
3 2464
23.1%
1 1027
 
9.6%
/ 762
 
7.1%
451
 
4.2%
) 424
 
4.0%
( 424
 
4.0%
0 316
 
3.0%
m 215
 
2.0%
_ 102
 
1.0%
Other values (5) 161
 
1.5%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
2
66.7%
1
33.3%

좌표(x)
Real number (ℝ)

Distinct9954
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean255512.78
Minimum-5980.16
Maximum536224.32
Zeros5
Zeros (%)< 0.1%
Negative1
Negative (%)< 0.1%
Memory size166.0 KiB
2023-12-13T00:11:44.190052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-5980.16
5-th percentile160477.81
Q1193726.96
median235671.07
Q3317220.25
95-th percentile394004.29
Maximum536224.32
Range542204.48
Interquartile range (IQR)123493.29

Descriptive statistics

Standard deviation75565.996
Coefficient of variation (CV)0.29574253
Kurtosis-0.80203534
Mean255512.78
Median Absolute Deviation (MAD)51947.77
Skewness0.50815813
Sum2.5551278 × 109
Variance5.7102197 × 109
MonotonicityNot monotonic
2023-12-13T00:11:44.370713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 5
 
0.1%
233020.02 2
 
< 0.1%
255825.56 2
 
< 0.1%
285383.33 2
 
< 0.1%
195387.3 2
 
< 0.1%
341927.77 2
 
< 0.1%
410758.01 2
 
< 0.1%
158737.62 2
 
< 0.1%
190082.86 2
 
< 0.1%
392656.0 2
 
< 0.1%
Other values (9944) 9977
99.8%
ValueCountFrequency (%)
-5980.16 1
 
< 0.1%
0.0 5
0.1%
102252.25 1
 
< 0.1%
102407.51 1
 
< 0.1%
111512.7 1
 
< 0.1%
114553.06 1
 
< 0.1%
115116.73 1
 
< 0.1%
116367.75 1
 
< 0.1%
116634.37 1
 
< 0.1%
118690.68 1
 
< 0.1%
ValueCountFrequency (%)
536224.3161 1
< 0.1%
432262.41 1
< 0.1%
432251.45 1
< 0.1%
432241.49 1
< 0.1%
432036.91 1
< 0.1%
431956.83 1
< 0.1%
431735.83 1
< 0.1%
430920.34 1
< 0.1%
430899.22 1
< 0.1%
429713.21 1
< 0.1%

좌표(y)
Real number (ℝ)

Distinct9955
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean315198.33
Minimum-31778
Maximum562919.68
Zeros5
Zeros (%)< 0.1%
Negative172
Negative (%)1.7%
Memory size166.0 KiB
2023-12-13T00:11:44.532483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-31778
5-th percentile142985.59
Q1222505.98
median319965.43
Q3413544.26
95-th percentile481113.08
Maximum562919.68
Range594697.68
Interquartile range (IQR)191038.27

Descriptive statistics

Standard deviation115290.16
Coefficient of variation (CV)0.36577022
Kurtosis-0.31733173
Mean315198.33
Median Absolute Deviation (MAD)95233.2
Skewness-0.36684279
Sum3.1519833 × 109
Variance1.3291822 × 1010
MonotonicityNot monotonic
2023-12-13T00:11:44.665234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 5
 
0.1%
313806.78 2
 
< 0.1%
470544.66 2
 
< 0.1%
457816.98 2
 
< 0.1%
422549.31 2
 
< 0.1%
188562.79 2
 
< 0.1%
261204.08 2
 
< 0.1%
455305.85 2
 
< 0.1%
242037.92 2
 
< 0.1%
359657.0 2
 
< 0.1%
Other values (9945) 9977
99.8%
ValueCountFrequency (%)
-31778.0 1
< 0.1%
-29717.47 1
< 0.1%
-28726.83 1
< 0.1%
-28014.11 1
< 0.1%
-27521.28 1
< 0.1%
-27234.54 1
< 0.1%
-27077.33 1
< 0.1%
-27053.43 1
< 0.1%
-26941.78 1
< 0.1%
-26935.3 1
< 0.1%
ValueCountFrequency (%)
562919.68 1
< 0.1%
560359.17 1
< 0.1%
556046.06 1
< 0.1%
552442.38 1
< 0.1%
551790.07 1
< 0.1%
551170.31 1
< 0.1%
550528.67 1
< 0.1%
550451.86 1
< 0.1%
550123.88 1
< 0.1%
550086.11 1
< 0.1%

Interactions

2023-12-13T00:11:42.328981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:11:42.161265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:11:42.403977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:11:42.250600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:11:44.764345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
표지종별좌표(x)좌표(y)
표지종별1.0000.2150.272
좌표(x)0.2151.0000.424
좌표(y)0.2720.4241.000
2023-12-13T00:11:44.857523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
좌표(x)좌표(y)
좌표(x)1.000-0.092
좌표(y)-0.0921.000

Missing values

2023-12-13T00:11:42.557123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:11:42.646536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

표지일련번호표지종별좌표(x)좌표(y)
38197RR-15[동서대로]-상-642방향예고표지172874.21216420.9
2808UR(경기도 의정부시)-[동일로]-상-1도계표지204780.8465264.96
52178NR-4[영남대로]-하-2823방향표지293198.51293155.78
86369RR-619[예당로]-상-513방향예고표지179308.23350572.49
12668UR(경기도 화성시)-3-상-8읍/면/동계표지187346.4845412460.6059
30149NR-37[율곡로]-하-12641지명방향표지183237.21487262.66
44787NR-30[여용로]-상-7322방향예고표지259291.97270845.98
28359NR-24[배내로]-상-9483방향예고표지392244.02233733.2
93700RR-592[질마로]-하-212방향예고표지257134.91362968.27
26518ER-30[당진영덕고속도로]-상-1802지명이정표지166938.11370632.34
표지일련번호표지종별좌표(x)좌표(y)
77257NR-31[청송로]-상-431단일노선표지385247.78325208.35
50499NR-24[산내로]-하-228도계표지382941.03236219.54
38519RR-15[녹두로]-상-772방향표지166186.64215692.83
29340ER-25[논산천안고속도로]-상-578매표소표지213155.79358532.22
40289NR-15[우주로]-상-195읍/면/동계표지242981.6111538.47
65453NR-3[남상주로]-상-3142방향표지297932.15309521.69
87443UR(충청북도 충주시)-05[예성로]-하-13방향표지283236.53386644.09
15383NR-29[죽향문화로]-하-13방향표지198952.78203709.69
35566ER-1[경부고속도로]-하-6811차출구예고표지(2방향)_2Km예고346776.26269732.61
29934NR-38[강원남부로]-상-880분기점표지381103.81409708.3