Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 390.6 KiB |
Average record size in memory | 40.0 B |
Variable types
Text | 3 |
---|---|
Categorical | 1 |
Dataset
Description | 경상남도 도로대장전산화 시스템 데이터의 중장기개방계획에 따른 데이터입니다. 시스템 상에서의 각 도로별 구조물 정보를 가지고 있으며, 도로대장의 구조물도면 정보 데이터를 포함하고있습니다. |
---|---|
Author | 경상남도 |
URL | https://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15091937 |
Reproduction
Analysis started | 2023-12-10 23:25:32.011855 |
---|---|
Analysis finished | 2023-12-10 23:25:32.890991 |
Duration | 0.88 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
구조물코드
Text
Distinct | 530 |
---|---|
Distinct (%) | 5.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
0058b570 | 473 | 4.7% |
0058b520 | 387 | 3.9% |
0058b300 | 356 | 3.6% |
0058b490 | 246 | 2.5% |
1020t010 | 218 | 2.2% |
1047t010 | 189 | 1.9% |
1020b090 | 183 | 1.8% |
0058b320 | 174 | 1.7% |
0030b040 | 155 | 1.6% |
0058b380 | 154 | 1.5% |
Other values (520) | 7465 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 32879 | |
B | 9547 | 11.9% |
1 | 8956 | 11.2% |
5 | 6351 | 7.9% |
8 | 5290 | 6.6% |
2 | 3644 | 4.6% |
4 | 3198 | 4.0% |
3 | 3159 | 3.9% |
7 | 2364 | 3.0% |
9 | 2190 | 2.7% |
Other values (4) | 2422 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 70000 | |
Uppercase Letter | 10000 | 12.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 32879 | |
1 | 8956 | 12.8% |
5 | 6351 | 9.1% |
8 | 5290 | 7.6% |
2 | 3644 | 5.2% |
4 | 3198 | 4.6% |
3 | 3159 | 4.5% |
7 | 2364 | 3.4% |
9 | 2190 | 3.1% |
6 | 1969 | 2.8% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 9547 | |
T | 427 | 4.3% |
S | 19 | 0.2% |
P | 7 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 70000 | |
Latin | 10000 | 12.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 32879 | |
1 | 8956 | 12.8% |
5 | 6351 | 9.1% |
8 | 5290 | 7.6% |
2 | 3644 | 5.2% |
4 | 3198 | 4.6% |
3 | 3159 | 4.5% |
7 | 2364 | 3.4% |
9 | 2190 | 3.1% |
6 | 1969 | 2.8% |
Latin
Value | Count | Frequency (%) |
B | 9547 | |
T | 427 | 4.3% |
S | 19 | 0.2% |
P | 7 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 80000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 32879 | |
B | 9547 | 11.9% |
1 | 8956 | 11.2% |
5 | 6351 | 7.9% |
8 | 5290 | 6.6% |
2 | 3644 | 4.6% |
4 | 3198 | 4.0% |
3 | 3159 | 3.9% |
7 | 2364 | 3.0% |
9 | 2190 | 2.7% |
Other values (4) | 2422 | 3.0% |
파일명
Text
Distinct | 9865 |
---|---|
Distinct (%) | 98.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 13 |
---|---|
Median length | 13 |
Mean length | 12.9992 |
Min length | 11 |
Characters and Unicode
Total characters | 129992 |
---|---|
Distinct characters | 14 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 9741 ? |
---|---|
Unique (%) | 97.4% |
Sample
1st row | 100305B040013 |
---|---|
2nd row | 005807B520021 |
3rd row | 103703B100080 |
4th row | 108903B040010 |
5th row | 006007B190004 |
Value | Count | Frequency (%) |
102113b110001 | 3 | < 0.1% |
100101b010001 | 3 | < 0.1% |
100116b320001 | 3 | < 0.1% |
100111b200001 | 3 | < 0.1% |
100114b280001 | 3 | < 0.1% |
108911b150001 | 3 | < 0.1% |
104202b040001 | 3 | < 0.1% |
100113b220001 | 3 | < 0.1% |
100102b110001 | 3 | < 0.1% |
108410b200001 | 3 | < 0.1% |
Other values (9855) | 9970 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 52827 | |
1 | 15210 | 11.7% |
B | 9547 | 7.3% |
5 | 8708 | 6.7% |
7 | 8011 | 6.2% |
2 | 7476 | 5.8% |
8 | 7243 | 5.6% |
3 | 6940 | 5.3% |
4 | 5695 | 4.4% |
9 | 4114 | 3.2% |
Other values (4) | 4221 | 3.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 119992 | |
Uppercase Letter | 10000 | 7.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 52827 | |
1 | 15210 | 12.7% |
5 | 8708 | 7.3% |
7 | 8011 | 6.7% |
2 | 7476 | 6.2% |
8 | 7243 | 6.0% |
3 | 6940 | 5.8% |
4 | 5695 | 4.7% |
9 | 4114 | 3.4% |
6 | 3768 | 3.1% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 9547 | |
T | 427 | 4.3% |
S | 19 | 0.2% |
P | 7 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 119992 | |
Latin | 10000 | 7.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 52827 | |
1 | 15210 | 12.7% |
5 | 8708 | 7.3% |
7 | 8011 | 6.7% |
2 | 7476 | 6.2% |
8 | 7243 | 6.0% |
3 | 6940 | 5.8% |
4 | 5695 | 4.7% |
9 | 4114 | 3.4% |
6 | 3768 | 3.1% |
Latin
Value | Count | Frequency (%) |
B | 9547 | |
T | 427 | 4.3% |
S | 19 | 0.2% |
P | 7 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 129992 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 52827 | |
1 | 15210 | 11.7% |
B | 9547 | 7.3% |
5 | 8708 | 6.7% |
7 | 8011 | 6.2% |
2 | 7476 | 5.8% |
8 | 7243 | 5.6% |
3 | 6940 | 5.3% |
4 | 5695 | 4.4% |
9 | 4114 | 3.2% |
Other values (4) | 4221 | 3.2% |
도면명
Text
Distinct | 7674 |
---|---|
Distinct (%) | 76.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 60 |
---|---|
Median length | 36 |
Mean length | 11.0638 |
Min length | 2 |
Characters and Unicode
Total characters | 110638 |
---|---|
Distinct characters | 503 |
Distinct categories | 12 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 6853 ? |
---|---|
Unique (%) | 68.5% |
Sample
1st row | 편구배도 |
---|---|
2nd row | 케이슨횡방향벽체철근배치도,TYPE1 |
3rd row | 장박교LAUNCHING NOSE상세도(5) |
4th row | 도탄교 난간 상세도 |
5th row | 덕진교교명판 |
Value | Count | Frequency (%) |
교명주 | 232 | 1.6% |
교대 | 230 | 1.6% |
위치도 | 224 | 1.5% |
정면 | 203 | 1.4% |
교명판 | 196 | 1.3% |
교각 | 193 | 1.3% |
일반도 | 177 | 1.2% |
측면 | 164 | 1.1% |
상세도 | 164 | 1.1% |
135 | 0.9% | |
Other values (6455) | 12816 |
Most occurring characters
Value | Count | Frequency (%) |
도 | 7305 | 6.6% |
( | 6215 | 5.6% |
) | 6145 | 5.6% |
교 | 5670 | 5.1% |
4747 | 4.3% | |
1 | 2916 | 2.6% |
상 | 2855 | 2.6% |
2 | 2388 | 2.2% |
배 | 2197 | 2.0% |
근 | 2057 | 1.9% |
Other values (493) | 68143 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 69232 | |
Uppercase Letter | 11338 | 10.2% |
Decimal Number | 10063 | 9.1% |
Open Punctuation | 6215 | 5.6% |
Close Punctuation | 6145 | 5.6% |
Space Separator | 4747 | 4.3% |
Other Punctuation | 1259 | 1.1% |
Dash Punctuation | 997 | 0.9% |
Lowercase Letter | 356 | 0.3% |
Math Symbol | 265 | 0.2% |
Other values (2) | 21 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
도 | 7305 | 10.6% |
교 | 5670 | 8.2% |
상 | 2855 | 4.1% |
배 | 2197 | 3.2% |
근 | 2057 | 3.0% |
세 | 2030 | 2.9% |
면 | 1387 | 2.0% |
강 | 1292 | 1.9% |
치 | 1188 | 1.7% |
일 | 1181 | 1.7% |
Other values (415) | 42070 |
Uppercase Letter
Value | Count | Frequency (%) |
E | 1450 | |
P | 1169 | 10.3% |
T | 844 | 7.4% |
S | 832 | 7.3% |
A | 825 | 7.3% |
N | 781 | 6.9% |
R | 689 | 6.1% |
G | 560 | 4.9% |
M | 512 | 4.5% |
O | 512 | 4.5% |
Other values (16) | 3164 |
Lowercase Letter
Value | Count | Frequency (%) |
w | 94 | |
g | 88 | |
d | 88 | |
m | 25 | 7.0% |
e | 15 | 4.2% |
o | 6 | 1.7% |
r | 5 | 1.4% |
t | 5 | 1.4% |
l | 5 | 1.4% |
p | 5 | 1.4% |
Other values (9) | 20 | 5.6% |
Decimal Number
Value | Count | Frequency (%) |
1 | 2916 | |
2 | 2388 | |
3 | 1356 | |
4 | 743 | 7.4% |
5 | 662 | 6.6% |
6 | 470 | 4.7% |
7 | 437 | 4.3% |
0 | 420 | 4.2% |
8 | 380 | 3.8% |
9 | 291 | 2.9% |
Other Punctuation
Value | Count | Frequency (%) |
, | 759 | |
. | 433 | |
' | 23 | 1.8% |
: | 21 | 1.7% |
/ | 12 | 1.0% |
* | 5 | 0.4% |
@ | 2 | 0.2% |
" | 2 | 0.2% |
& | 2 | 0.2% |
Letter Number
Value | Count | Frequency (%) |
Ⅲ | 5 | |
Ⅳ | 3 | |
Ⅴ | 3 | |
Ⅵ | 3 | |
Ⅱ | 1 | 6.2% |
Ⅰ | 1 | 6.2% |
Math Symbol
Value | Count | Frequency (%) |
~ | 187 | |
= | 60 | 22.6% |
+ | 18 | 6.8% |
Open Punctuation
Value | Count | Frequency (%) |
( | 6215 |
Close Punctuation
Value | Count | Frequency (%) |
) | 6145 |
Space Separator
Value | Count | Frequency (%) |
4747 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 997 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 5 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 69232 | |
Common | 29696 | |
Latin | 11710 | 10.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
도 | 7305 | 10.6% |
교 | 5670 | 8.2% |
상 | 2855 | 4.1% |
배 | 2197 | 3.2% |
근 | 2057 | 3.0% |
세 | 2030 | 2.9% |
면 | 1387 | 2.0% |
강 | 1292 | 1.9% |
치 | 1188 | 1.7% |
일 | 1181 | 1.7% |
Other values (415) | 42070 |
Latin
Value | Count | Frequency (%) |
E | 1450 | |
P | 1169 | 10.0% |
T | 844 | 7.2% |
S | 832 | 7.1% |
A | 825 | 7.0% |
N | 781 | 6.7% |
R | 689 | 5.9% |
G | 560 | 4.8% |
M | 512 | 4.4% |
O | 512 | 4.4% |
Other values (41) | 3536 |
Common
Value | Count | Frequency (%) |
( | 6215 | |
) | 6145 | |
4747 | ||
1 | 2916 | |
2 | 2388 | 8.0% |
3 | 1356 | 4.6% |
- | 997 | 3.4% |
, | 759 | 2.6% |
4 | 743 | 2.5% |
5 | 662 | 2.2% |
Other values (17) | 2768 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 69232 | |
ASCII | 41390 | |
Number Forms | 16 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
도 | 7305 | 10.6% |
교 | 5670 | 8.2% |
상 | 2855 | 4.1% |
배 | 2197 | 3.2% |
근 | 2057 | 3.0% |
세 | 2030 | 2.9% |
면 | 1387 | 2.0% |
강 | 1292 | 1.9% |
치 | 1188 | 1.7% |
일 | 1181 | 1.7% |
Other values (415) | 42070 |
ASCII
Value | Count | Frequency (%) |
( | 6215 | |
) | 6145 | |
4747 | 11.5% | |
1 | 2916 | 7.0% |
2 | 2388 | 5.8% |
E | 1450 | 3.5% |
3 | 1356 | 3.3% |
P | 1169 | 2.8% |
- | 997 | 2.4% |
T | 844 | 2.0% |
Other values (62) | 13163 |
Number Forms
Value | Count | Frequency (%) |
Ⅲ | 5 | |
Ⅳ | 3 | |
Ⅴ | 3 | |
Ⅵ | 3 | |
Ⅱ | 1 | 6.2% |
Ⅰ | 1 | 6.2% |
입력방식
Categorical
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
R | |
---|---|
P | |
V | |
p | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | R |
---|---|
2nd row | R |
3rd row | R |
4th row | R |
5th row | P |
Common Values
Value | Count | Frequency (%) |
R | 7082 | |
P | 2031 | 20.3% |
V | 886 | 8.9% |
p | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
r | 7082 | |
p | 2032 | 20.3% |
v | 886 | 8.9% |
구조물코드 | 파일명 | 도면명 | 입력방식 | |
---|---|---|---|---|
13496 | 1003B040 | 100305B040013 | 편구배도 | R |
3774 | 0058B520 | 005807B520021 | 케이슨횡방향벽체철근배치도,TYPE1 | R |
9348 | 1037B100 | 103703B100080 | 장박교LAUNCHING NOSE상세도(5) | R |
15932 | 1089B040 | 108903B040010 | 도탄교 난간 상세도 | R |
11269 | 0060B190 | 006007B190004 | 덕진교교명판 | P |
15471 | 1080B070 | 108001B070003 | 대지교 교명주 | P |
12755 | 1010B100 | 101003B100004 | 주철근조립도 및 배근도 | V |
5226 | 0058B570 | 005807B570189 | P7-1파일기초,파일철근배치도(2) | R |
13171 | 1004B100 | 100403B100022 | 주형상세도(15) | R |
1997 | 0058B370 | 005807B370069 | 시.종점부정착구보강상세 | R |
구조물코드 | 파일명 | 도면명 | 입력방식 | |
---|---|---|---|---|
15736 | 1084B110 | 108403B110003 | 춘전교 교명주 | P |
7025 | 0069B610 | 006908B610219 | 교대2구조도(1) | R |
13801 | 0067B010 | 006701B010117 | 상부슬라브수평브레이싱(1) | R |
15871 | 1084B190 | 108406B190010 | ARCHRIB배근도(1) | V |
7115 | 0069B350 | 006906B350019 | 배근도(시점부) | R |
3167 | 0058B490 | 005807B490222 | 맨홀상세도 | R |
8103 | 1007B210 | 100705B210002 | 교명주 | P |
10338 | 1077B090 | 107703B090010 | 어영교일반도 | V |
264 | 0058B270 | 005807B270102 | 강재재료표(11) | R |
7123 | 0069B350 | 006906B350027 | 차도용난간받침상세도 | R |