Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 79 |
Missing cells | 323 |
Missing cells (%) | 45.4% |
Duplicate rows | 1 |
Duplicate rows (%) | 1.3% |
Total size in memory | 5.7 KiB |
Average record size in memory | 73.7 B |
Variable types
Unsupported | 3 |
---|---|
Text | 4 |
Categorical | 1 |
Boolean | 1 |
Dataset
Description | 교통문화지수(운전행태/교통안전/교통환경 조사분석한 수치)의 영역별(운전/교통/보행/문화...) 정보 |
---|---|
Author | 교통안전공단 |
URL | https://www.vworld.kr/dtmk/dtmk_ntads_s002.do?dsId=30035 |
Unnamed: 5 has constant value "" | Constant |
Unnamed: 8 has constant value "" | Constant |
Dataset has 1 (1.3%) duplicate rows | Duplicates |
Unnamed: 3 is highly imbalanced (61.1%) | Imbalance |
테이블정의서 has 1 (1.3%) missing values | Missing |
Unnamed: 1 has 6 (7.6%) missing values | Missing |
Unnamed: 2 has 5 (6.3%) missing values | Missing |
Unnamed: 4 has 6 (7.6%) missing values | Missing |
Unnamed: 5 has 77 (97.5%) missing values | Missing |
Unnamed: 6 has 74 (93.7%) missing values | Missing |
Unnamed: 7 has 76 (96.2%) missing values | Missing |
Unnamed: 8 has 78 (98.7%) missing values | Missing |
테이블정의서 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-04-18 00:13:06.716370 |
---|---|
Analysis finished | 2024-04-18 00:13:08.276232 |
Duration | 1.56 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
테이블정의서
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 1 |
---|---|
Missing (%) | 1.3% |
Memory size | 764.0 B |
Unnamed: 1
Text
MISSING
 
Distinct | 73 |
---|---|
Distinct (%) | 100.0% |
Missing | 6 |
Missing (%) | 7.6% |
Memory size | 764.0 B |
Length
Max length | 37 |
---|---|
Median length | 29 |
Mean length | 19.342466 |
Min length | 4 |
Characters and Unicode
Total characters | 1412 |
---|---|
Distinct characters | 27 |
Distinct categories | 3 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 73 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 컬럼ID |
---|---|
2nd row | YEAR_CD |
3rd row | JIJACE_CD |
4th row | DRV_BHV_GRD |
5th row | TRF_SAF_GRD |
Value | Count | Frequency (%) |
saf_blt_rank | 1 | 1.4% |
local_safty_perform_rank | 1 | 1.4% |
sgn_cnfm_rat_avg | 1 | 1.4% |
drct_sgl_rgtrat_avg | 1 | 1.4% |
crswk_stp_ln_cnfm_rat_avg | 1 | 1.4% |
not_crsw_smart_userat_rank | 1 | 1.4% |
not_crsw_smart_userat_avg | 1 | 1.4% |
busi_car_cnt_road_acc_death_rank | 1 | 1.4% |
busi_car_cnt_road_acc_death_cnt | 1 | 1.4% |
people_road_perdestrian_death_rank | 1 | 1.4% |
Other values (63) | 63 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 221 | |
A | 155 | |
R | 139 | 9.8% |
T | 95 | 6.7% |
S | 93 | 6.6% |
E | 82 | 5.8% |
C | 70 | 5.0% |
D | 59 | 4.2% |
G | 57 | 4.0% |
N | 56 | 4.0% |
Other values (17) | 385 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 1189 | |
Connector Punctuation | 221 | 15.7% |
Other Letter | 2 | 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 155 | |
R | 139 | |
T | 95 | 8.0% |
S | 93 | 7.8% |
E | 82 | 6.9% |
C | 70 | 5.9% |
D | 59 | 5.0% |
G | 57 | 4.8% |
N | 56 | 4.7% |
V | 40 | 3.4% |
Other values (14) | 343 |
Other Letter
Value | Count | Frequency (%) |
컬 | 1 | |
럼 | 1 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 221 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 1189 | |
Common | 221 | 15.7% |
Hangul | 2 | 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 155 | |
R | 139 | |
T | 95 | 8.0% |
S | 93 | 7.8% |
E | 82 | 6.9% |
C | 70 | 5.9% |
D | 59 | 5.0% |
G | 57 | 4.8% |
N | 56 | 4.7% |
V | 40 | 3.4% |
Other values (14) | 343 |
Hangul
Value | Count | Frequency (%) |
컬 | 1 | |
럼 | 1 |
Common
Value | Count | Frequency (%) |
_ | 221 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1410 | |
Hangul | 2 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
_ | 221 | |
A | 155 | |
R | 139 | 9.9% |
T | 95 | 6.7% |
S | 93 | 6.6% |
E | 82 | 5.8% |
C | 70 | 5.0% |
D | 59 | 4.2% |
G | 57 | 4.0% |
N | 56 | 4.0% |
Other values (15) | 383 |
Hangul
Value | Count | Frequency (%) |
컬 | 1 | |
럼 | 1 |
Unnamed: 2
Text
MISSING
 
Distinct | 74 |
---|---|
Distinct (%) | 100.0% |
Missing | 5 |
Missing (%) | 6.3% |
Memory size | 764.0 B |
Value | Count | Frequency (%) |
랭크 | 11 | 5.4% |
및 | 9 | 4.4% |
당 | 9 | 4.4% |
지자체 | 9 | 4.4% |
사망자 | 9 | 4.4% |
도로연장 | 9 | 4.4% |
교통사고 | 6 | 2.9% |
인구 | 6 | 2.9% |
교통안전 | 6 | 2.9% |
자동차 | 6 | 2.9% |
Other values (74) | 124 |
Most occurring characters
Value | Count | Frequency (%) |
130 | 13.1% | |
_ | 76 | 7.7% |
도 | 35 | 3.5% |
자 | 32 | 3.2% |
전 | 31 | 3.1% |
수 | 29 | 2.9% |
사 | 27 | 2.7% |
지 | 23 | 2.3% |
안 | 20 | 2.0% |
보 | 20 | 2.0% |
Other values (106) | 570 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 765 | |
Space Separator | 130 | 13.1% |
Connector Punctuation | 76 | 7.7% |
Uppercase Letter | 22 | 2.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
도 | 35 | 4.6% |
자 | 32 | 4.2% |
전 | 31 | 4.1% |
수 | 29 | 3.8% |
사 | 27 | 3.5% |
지 | 23 | 3.0% |
안 | 20 | 2.6% |
보 | 20 | 2.6% |
교 | 19 | 2.5% |
통 | 19 | 2.5% |
Other values (89) | 510 |
Uppercase Letter
Value | Count | Frequency (%) |
E | 2 | 9.1% |
S | 2 | 9.1% |
D | 2 | 9.1% |
I | 2 | 9.1% |
R | 2 | 9.1% |
T | 2 | 9.1% |
U | 2 | 9.1% |
X | 1 | 4.5% |
P | 1 | 4.5% |
A | 1 | 4.5% |
Other values (5) | 5 |
Space Separator
Value | Count | Frequency (%) |
130 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 76 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 765 | |
Common | 206 | 20.7% |
Latin | 22 | 2.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
도 | 35 | 4.6% |
자 | 32 | 4.2% |
전 | 31 | 4.1% |
수 | 29 | 3.8% |
사 | 27 | 3.5% |
지 | 23 | 3.0% |
안 | 20 | 2.6% |
보 | 20 | 2.6% |
교 | 19 | 2.5% |
통 | 19 | 2.5% |
Other values (89) | 510 |
Latin
Value | Count | Frequency (%) |
E | 2 | 9.1% |
S | 2 | 9.1% |
D | 2 | 9.1% |
I | 2 | 9.1% |
R | 2 | 9.1% |
T | 2 | 9.1% |
U | 2 | 9.1% |
X | 1 | 4.5% |
P | 1 | 4.5% |
A | 1 | 4.5% |
Other values (5) | 5 |
Common
Value | Count | Frequency (%) |
130 | ||
_ | 76 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 765 | |
ASCII | 228 | 23.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
130 | ||
_ | 76 | |
E | 2 | 0.9% |
S | 2 | 0.9% |
D | 2 | 0.9% |
I | 2 | 0.9% |
R | 2 | 0.9% |
T | 2 | 0.9% |
U | 2 | 0.9% |
X | 1 | 0.4% |
Other values (7) | 7 | 3.1% |
Hangul
Value | Count | Frequency (%) |
도 | 35 | 4.6% |
자 | 32 | 4.2% |
전 | 31 | 4.1% |
수 | 29 | 3.8% |
사 | 27 | 3.5% |
지 | 23 | 3.0% |
안 | 20 | 2.6% |
보 | 20 | 2.6% |
교 | 19 | 2.5% |
통 | 19 | 2.5% |
Other values (89) | 510 |
Unnamed: 3
Categorical
IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 7.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 764.0 B |
NUMBER | |
---|---|
VARCHAR | 6 |
<NA> | 5 |
테이블ID | 1 |
타입 | 1 |
Length
Max length | 7 |
---|---|
Median length | 6 |
Mean length | 5.8607595 |
Min length | 2 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 3.8% |
Sample
1st row | <NA> |
---|---|
2nd row | 테이블ID |
3rd row | <NA> |
4th row | 타입 |
5th row | VARCHAR |
Common Values
Value | Count | Frequency (%) |
NUMBER | 65 | |
VARCHAR | 6 | 7.6% |
<NA> | 5 | 6.3% |
테이블ID | 1 | 1.3% |
타입 | 1 | 1.3% |
DATE | 1 | 1.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
number | 65 | |
varchar | 6 | 7.6% |
na | 5 | 6.3% |
테이블id | 1 | 1.3% |
타입 | 1 | 1.3% |
date | 1 | 1.3% |
Unnamed: 4
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 6 |
---|---|
Missing (%) | 7.6% |
Memory size | 764.0 B |
Unnamed: 5
Boolean
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 50.0% |
Missing | 77 |
Missing (%) | 97.5% |
Memory size | 290.0 B |
False | 2 |
---|---|
(Missing) |
Value | Count | Frequency (%) |
False | 2 | 2.5% |
(Missing) | 77 |
Unnamed: 6
Text
MISSING
 
Distinct | 4 |
---|---|
Distinct (%) | 80.0% |
Missing | 74 |
Missing (%) | 93.7% |
Memory size | 764.0 B |
Value | Count | Frequency (%) |
pk | 2 | |
작성일 | 1 | |
테이블명 | 1 | |
pk/fk | 1 |
Most occurring characters
Value | Count | Frequency (%) |
K | 4 | |
P | 3 | |
작 | 1 | 6.2% |
성 | 1 | 6.2% |
일 | 1 | 6.2% |
테 | 1 | 6.2% |
이 | 1 | 6.2% |
블 | 1 | 6.2% |
명 | 1 | 6.2% |
/ | 1 | 6.2% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 8 | |
Other Letter | 7 | |
Other Punctuation | 1 | 6.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
작 | 1 | |
성 | 1 | |
일 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 4 | |
P | 3 | |
F | 1 | 12.5% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 8 | |
Hangul | 7 | |
Common | 1 | 6.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
작 | 1 | |
성 | 1 | |
일 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 |
Latin
Value | Count | Frequency (%) |
K | 4 | |
P | 3 | |
F | 1 | 12.5% |
Common
Value | Count | Frequency (%) |
/ | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 9 | |
Hangul | 7 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
K | 4 | |
P | 3 | |
/ | 1 | 11.1% |
F | 1 | 11.1% |
Hangul
Value | Count | Frequency (%) |
작 | 1 | |
성 | 1 | |
일 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 |
Unnamed: 7
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 76 |
---|---|
Missing (%) | 96.2% |
Memory size | 764.0 B |
Unnamed: 8
Text
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 78 |
Missing (%) | 98.7% |
Memory size | 764.0 B |
Value | Count | Frequency (%) |
참조테이블명/비고 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
참 | 1 | |
조 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
/ | 1 | |
비 | 1 | |
고 | 1 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 8 | |
Other Punctuation | 1 | 11.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
참 | 1 | |
조 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
비 | 1 | |
고 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 8 | |
Common | 1 | 11.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
참 | 1 | |
조 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
비 | 1 | |
고 | 1 |
Common
Value | Count | Frequency (%) |
/ | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 8 | |
ASCII | 1 | 11.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
참 | 1 | |
조 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
비 | 1 | |
고 | 1 |
ASCII
Value | Count | Frequency (%) |
/ | 1 |
Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 6 | |
---|---|---|---|---|
Unnamed: 1 | 1.000 | 1.000 | 1.000 | 1.000 |
Unnamed: 2 | 1.000 | 1.000 | 1.000 | 1.000 |
Unnamed: 3 | 1.000 | 1.000 | 1.000 | 1.000 |
Unnamed: 6 | 1.000 | 1.000 | 1.000 | 1.000 |
테이블정의서 | Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | |
---|---|---|---|---|---|---|---|---|---|
0 | 작성자 | <NA> | <NA> | <NA> | NaN | <NA> | 작성일 | 2019-09-04 00:00:00 | <NA> |
1 | 주제영역명 | <NA> | <NA> | 테이블ID | Z_TMACS_T_W_BASE_TRF_CULT_IDX | <NA> | 테이블명 | 기초통계 교통문화지수 영역별 | <NA> |
2 | 테이블설명 | <NA> | <NA> | <NA> | NaN | <NA> | <NA> | NaN | <NA> |
3 | No | 컬럼ID | 컬럼명 | 타입 | 길이(Byte) | <NA> | PK/FK | Default | 참조테이블명/비고 |
4 | 1 | YEAR_CD | 년도_코드 | VARCHAR | 4 | N | PK | NaN | <NA> |
5 | 2 | JIJACE_CD | 지자체_코드 | VARCHAR | 5 | N | PK | NaN | <NA> |
6 | 3 | DRV_BHV_GRD | 운전_행태_점수 | NUMBER | 10,2 | <NA> | <NA> | NaN | <NA> |
7 | 4 | TRF_SAF_GRD | 교통_안전_점수 | NUMBER | 10,2 | <NA> | <NA> | NaN | <NA> |
8 | 5 | WK_BHV_GRD | 보행_행태_점수 | NUMBER | 10,2 | <NA> | <NA> | NaN | <NA> |
9 | 6 | TRF_WKP_GRD | 교통_약자_점수 | NUMBER | 10,2 | <NA> | <NA> | NaN | <NA> |
테이블정의서 | Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | |
---|---|---|---|---|---|---|---|---|---|
69 | 65 | CRSW_SMART_USERAT_AVG | <NA> | NUMBER | 10,2 | <NA> | <NA> | NaN | <NA> |
70 | 65 | CRSW_SMART_USERAT_AVG_AVG | 횡단중 스마트기기 사용률_평균 | NUMBER | 10,2 | <NA> | <NA> | NaN | <NA> |
71 | 65 | NOT_CRSW_SMART_USERAT_AVG_AVG | 횡단보도가 아닌 도로에서의 무단횡단 빈도_평균 | NUMBER | 10,2 | <NA> | <NA> | NaN | <NA> |
72 | 65 | DRV_BHV_GRD_RK | 운전_행태_순위 | NUMBER | 10 | <NA> | <NA> | NaN | <NA> |
73 | 65 | TRF_SAF_GRD_RK | 교통_안전_순위 | NUMBER | 10 | <NA> | <NA> | NaN | <NA> |
74 | 65 | WK_BHV_GRD_RK | 보행_행태_순위 | NUMBER | 10 | <NA> | <NA> | NaN | <NA> |
75 | 65 | TRF_WKP_GRDRK | 교통_약자_순위 | NUMBER | 10 | <NA> | <NA> | NaN | <NA> |
76 | 인덱스명 | <NA> | 인덱스키 | <NA> | NaN | <NA> | <NA> | NaN | <NA> |
77 | NaN | <NA> | BASE_TRF_CULT_IDX_PK | <NA> | NaN | <NA> | <NA> | NaN | <NA> |
78 | 업무규칙 | <NA> | <NA> | <NA> | NaN | <NA> | <NA> | NaN | <NA> |
Most frequently occurring
Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 5 | Unnamed: 6 | Unnamed: 8 | # duplicates | |
---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2 |