Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 39972 |
Missing cells (%) | 66.6% |
Duplicate rows | 1 |
Duplicate rows (%) | < 0.1% |
Total size in memory | 546.9 KiB |
Average record size in memory | 56.0 B |
Variable types
Categorical | 2 |
---|---|
DateTime | 1 |
Text | 3 |
Dataset
Description | 경기도 의왕시에 신고된 화물자동차 운송 사업자 현황입니다. 업체명, 업종, 도로명주소, 지번주소를 제공하고 있습니다. |
---|---|
Author | 경기도 의왕시 |
URL | https://www.data.go.kr/data/15113394/fileData.do |
Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
시군명 is highly overall correlated with 사업의종류 | High correlation |
사업의종류 is highly overall correlated with 시군명 | High correlation |
시군명 is highly imbalanced (99.2%) | Imbalance |
사업의종류 is highly imbalanced (99.2%) | Imbalance |
허가연월일 has 9993 (99.9%) missing values | Missing |
상호 has 9993 (99.9%) missing values | Missing |
주사무소도로명주소 has 9993 (99.9%) missing values | Missing |
전화번호 has 9993 (99.9%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 15:15:10.664023 |
---|---|
Analysis finished | 2023-12-12 15:15:11.331507 |
Duration | 0.67 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시군명
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
의왕시 | 7 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9993 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9993 | |
의왕시 | 7 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9993 | |
의왕시 | 7 | 0.1% |
허가연월일
Date
MISSING
 
Distinct | 7 |
---|---|
Distinct (%) | 100.0% |
Missing | 9993 |
Missing (%) | 99.9% |
Memory size | 156.2 KiB |
Minimum | 2003-01-09 00:00:00 |
---|---|
Maximum | 2010-08-24 00:00:00 |
상호
Text
MISSING
 
Distinct | 7 |
---|---|
Distinct (%) | 100.0% |
Missing | 9993 |
Missing (%) | 99.9% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
주)트랜스틸 | 1 | |
주)세양 | 1 | |
주)씨티엘물류 | 1 | |
주)한원물류 | 1 | |
케이에스씨로지스 | 1 | |
삼화물류(주 | 1 | |
주)장평로지스 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
( | 6 | 12.0% |
) | 6 | 12.0% |
주 | 6 | 12.0% |
스 | 4 | 8.0% |
류 | 3 | 6.0% |
물 | 3 | 6.0% |
씨 | 2 | 4.0% |
지 | 2 | 4.0% |
로 | 2 | 4.0% |
틸 | 1 | 2.0% |
Other values (15) | 15 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 38 | |
Open Punctuation | 6 | 12.0% |
Close Punctuation | 6 | 12.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 6 | |
스 | 4 | 10.5% |
류 | 3 | 7.9% |
물 | 3 | 7.9% |
씨 | 2 | 5.3% |
지 | 2 | 5.3% |
로 | 2 | 5.3% |
틸 | 1 | 2.6% |
케 | 1 | 2.6% |
장 | 1 | 2.6% |
Other values (13) | 13 |
Open Punctuation
Value | Count | Frequency (%) |
( | 6 |
Close Punctuation
Value | Count | Frequency (%) |
) | 6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 38 | |
Common | 12 | 24.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 6 | |
스 | 4 | 10.5% |
류 | 3 | 7.9% |
물 | 3 | 7.9% |
씨 | 2 | 5.3% |
지 | 2 | 5.3% |
로 | 2 | 5.3% |
틸 | 1 | 2.6% |
케 | 1 | 2.6% |
장 | 1 | 2.6% |
Other values (13) | 13 |
Common
Value | Count | Frequency (%) |
( | 6 | |
) | 6 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 38 | |
ASCII | 12 | 24.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
( | 6 | |
) | 6 |
Hangul
Value | Count | Frequency (%) |
주 | 6 | |
스 | 4 | 10.5% |
류 | 3 | 7.9% |
물 | 3 | 7.9% |
씨 | 2 | 5.3% |
지 | 2 | 5.3% |
로 | 2 | 5.3% |
틸 | 1 | 2.6% |
케 | 1 | 2.6% |
장 | 1 | 2.6% |
Other values (13) | 13 |
사업의종류
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
일반화물 | 7 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9993 | |
일반화물 | 7 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9993 | |
일반화물 | 7 | 0.1% |
주사무소도로명주소
Text
MISSING
 
Distinct | 7 |
---|---|
Distinct (%) | 100.0% |
Missing | 9993 |
Missing (%) | 99.9% |
Memory size | 156.2 KiB |
Length
Max length | 44 |
---|---|
Median length | 37 |
Mean length | 36.142857 |
Min length | 28 |
Characters and Unicode
Total characters | 253 |
---|---|
Distinct characters | 64 |
Distinct categories | 7 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 7 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 경기도 의왕시 왕송못동로 79, 천일정기화물자동차 (삼동) |
---|---|
2nd row | 경기도 의왕시 오봉산단3로 25, 더리브비즈원 1동 1323호 (삼동) |
3rd row | 경기도 의왕시 창말로 39 (이동, 의왕제1터미널) |
4th row | 경기도 의왕시 이미로 40, 인덕원IT밸리 에이동 1020호 (포일동) |
5th row | 경기도 의왕시 창말로 39, 4군 의왕제1터미널 2층 (이동) |
Value | Count | Frequency (%) |
경기도 | 7 | 13.0% |
의왕시 | 7 | 13.0% |
이동 | 4 | 7.4% |
오봉로 | 2 | 3.7% |
2층 | 2 | 3.7% |
의왕제1터미널 | 2 | 3.7% |
39 | 2 | 3.7% |
창말로 | 2 | 3.7% |
175 | 2 | 3.7% |
삼동 | 2 | 3.7% |
Other values (22) | 22 |
Most occurring characters
Value | Count | Frequency (%) |
47 | 18.6% | |
왕 | 12 | 4.7% |
의 | 11 | 4.3% |
동 | 11 | 4.3% |
1 | 9 | 3.6% |
기 | 8 | 3.2% |
2 | 8 | 3.2% |
, | 7 | 2.8% |
) | 7 | 2.8% |
( | 7 | 2.8% |
Other values (54) | 126 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 143 | |
Space Separator | 47 | 18.6% |
Decimal Number | 40 | 15.8% |
Other Punctuation | 7 | 2.8% |
Close Punctuation | 7 | 2.8% |
Open Punctuation | 7 | 2.8% |
Uppercase Letter | 2 | 0.8% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
왕 | 12 | 8.4% |
의 | 11 | 7.7% |
동 | 11 | 7.7% |
기 | 8 | 5.6% |
이 | 7 | 4.9% |
경 | 7 | 4.9% |
도 | 7 | 4.9% |
시 | 7 | 4.9% |
로 | 7 | 4.9% |
미 | 4 | 2.8% |
Other values (39) | 62 |
Decimal Number
Value | Count | Frequency (%) |
1 | 9 | |
2 | 8 | |
0 | 5 | |
3 | 5 | |
7 | 4 | |
9 | 3 | 7.5% |
5 | 3 | 7.5% |
4 | 2 | 5.0% |
6 | 1 | 2.5% |
Uppercase Letter
Value | Count | Frequency (%) |
T | 1 | |
I | 1 |
Space Separator
Value | Count | Frequency (%) |
47 |
Other Punctuation
Value | Count | Frequency (%) |
, | 7 |
Close Punctuation
Value | Count | Frequency (%) |
) | 7 |
Open Punctuation
Value | Count | Frequency (%) |
( | 7 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 143 | |
Common | 108 | |
Latin | 2 | 0.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
왕 | 12 | 8.4% |
의 | 11 | 7.7% |
동 | 11 | 7.7% |
기 | 8 | 5.6% |
이 | 7 | 4.9% |
경 | 7 | 4.9% |
도 | 7 | 4.9% |
시 | 7 | 4.9% |
로 | 7 | 4.9% |
미 | 4 | 2.8% |
Other values (39) | 62 |
Common
Value | Count | Frequency (%) |
47 | ||
1 | 9 | 8.3% |
2 | 8 | 7.4% |
, | 7 | 6.5% |
) | 7 | 6.5% |
( | 7 | 6.5% |
0 | 5 | 4.6% |
3 | 5 | 4.6% |
7 | 4 | 3.7% |
9 | 3 | 2.8% |
Other values (3) | 6 | 5.6% |
Latin
Value | Count | Frequency (%) |
T | 1 | |
I | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 143 | |
ASCII | 110 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
47 | ||
1 | 9 | 8.2% |
2 | 8 | 7.3% |
, | 7 | 6.4% |
) | 7 | 6.4% |
( | 7 | 6.4% |
0 | 5 | 4.5% |
3 | 5 | 4.5% |
7 | 4 | 3.6% |
9 | 3 | 2.7% |
Other values (5) | 8 | 7.3% |
Hangul
Value | Count | Frequency (%) |
왕 | 12 | 8.4% |
의 | 11 | 7.7% |
동 | 11 | 7.7% |
기 | 8 | 5.6% |
이 | 7 | 4.9% |
경 | 7 | 4.9% |
도 | 7 | 4.9% |
시 | 7 | 4.9% |
로 | 7 | 4.9% |
미 | 4 | 2.8% |
Other values (39) | 62 |
전화번호
Text
MISSING
 
Distinct | 7 |
---|---|
Distinct (%) | 100.0% |
Missing | 9993 |
Missing (%) | 99.9% |
Memory size | 156.2 KiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 10.428571 |
Min length | 1 |
Characters and Unicode
Total characters | 73 |
---|---|
Distinct characters | 11 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 7 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 031-462-2131 |
---|---|
2nd row | 031-462-4247 |
3rd row | 031-461-0280 |
4th row | 031-426-4891 |
5th row |
Value | Count | Frequency (%) |
031-462-2131 | 1 | |
031-462-4247 | 1 | |
031-461-0280 | 1 | |
031-426-4891 | 1 | |
031-461-6691 | 1 | |
031-462-6060 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 12 | |
- | 12 | |
0 | 10 | |
6 | 10 | |
4 | 9 | |
3 | 7 | |
2 | 7 | |
8 | 2 | 2.7% |
9 | 2 | 2.7% |
7 | 1 | 1.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 60 | |
Dash Punctuation | 12 | 16.4% |
Space Separator | 1 | 1.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 12 | |
0 | 10 | |
6 | 10 | |
4 | 9 | |
3 | 7 | |
2 | 7 | |
8 | 2 | 3.3% |
9 | 2 | 3.3% |
7 | 1 | 1.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 12 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 73 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 12 | |
- | 12 | |
0 | 10 | |
6 | 10 | |
4 | 9 | |
3 | 7 | |
2 | 7 | |
8 | 2 | 2.7% |
9 | 2 | 2.7% |
7 | 1 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 73 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 12 | |
- | 12 | |
0 | 10 | |
6 | 10 | |
4 | 9 | |
3 | 7 | |
2 | 7 | |
8 | 2 | 2.7% |
9 | 2 | 2.7% |
7 | 1 | 1.4% |
허가연월일 | 상호 | 주사무소도로명주소 | 전화번호 | |
---|---|---|---|---|
허가연월일 | 1.000 | 1.000 | 1.000 | 1.000 |
상호 | 1.000 | 1.000 | 1.000 | 1.000 |
주사무소도로명주소 | 1.000 | 1.000 | 1.000 | 1.000 |
전화번호 | 1.000 | 1.000 | 1.000 | 1.000 |
시군명 | 사업의종류 | |
---|---|---|
시군명 | 1.000 | 1.000 |
사업의종류 | 1.000 | 1.000 |
시군명 | 사업의종류 | |
---|---|---|
시군명 | 1.000 | 1.000 |
사업의종류 | 1.000 | 1.000 |
시군명 | 허가연월일 | 상호 | 사업의종류 | 주사무소도로명주소 | 전화번호 | |
---|---|---|---|---|---|---|
19952 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
90982 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
63934 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
66329 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
69214 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
83511 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
72150 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
23154 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
3054 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
72356 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
시군명 | 허가연월일 | 상호 | 사업의종류 | 주사무소도로명주소 | 전화번호 | |
---|---|---|---|---|---|---|
5629 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
7996 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
16807 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
32614 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
55035 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
38425 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
20359 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
37343 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
21732 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
99457 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
시군명 | 허가연월일 | 상호 | 사업의종류 | 주사무소도로명주소 | 전화번호 | # duplicates | |
---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 9993 |