Overview

Dataset statistics

Number of variables2
Number of observations21
Missing cells4
Missing cells (%)9.5%
Duplicate rows1
Duplicate rows (%)4.8%
Total size in memory468.0 B
Average record size in memory22.3 B

Variable types

Text2

Dataset

Description2023년 영월군 일반화물 운송사 현황으로, 일반화물 업체명(혹은 대표자명), 주소의 정보를 포함하고 있습니다.
URLhttps://www.data.go.kr/data/15115482/fileData.do

Alerts

Dataset has 1 (4.8%) duplicate rowsDuplicates
상호 has 2 (9.5%) missing valuesMissing
주사무소주소 has 2 (9.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 07:58:15.572793
Analysis finished2023-12-12 07:58:15.997390
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

MISSING 

Distinct18
Distinct (%)94.7%
Missing2
Missing (%)9.5%
Memory size300.0 B
2023-12-12T16:58:16.150888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length8
Mean length6.0526316
Min length3

Characters and Unicode

Total characters115
Distinct characters54
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)89.5%

Sample

1st row(주)새하마노
2nd row영월레카
3rd row(자)이진상운
4th row영월특수물산
5th row(주)쌍용로지스
ValueCountFrequency (%)
김oo 2
 
10.0%
주)세계물류 1
 
5.0%
명성특운 1
 
5.0%
주)유원물류 1
 
5.0%
영월운수화물 1
 
5.0%
주식회사 1
 
5.0%
쌍용제이씨(주 1
 
5.0%
주)코모로환경 1
 
5.0%
주)에스디케이 1
 
5.0%
일반화물 1
 
5.0%
Other values (9) 9
45.0%
2023-12-12T16:58:16.567322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 10
 
8.7%
10
 
8.7%
) 10
 
8.7%
O 8
 
7.0%
6
 
5.2%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (44) 56
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 86
74.8%
Open Punctuation 10
 
8.7%
Close Punctuation 10
 
8.7%
Uppercase Letter 8
 
7.0%
Space Separator 1
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
11.6%
6
 
7.0%
3
 
3.5%
3
 
3.5%
3
 
3.5%
3
 
3.5%
3
 
3.5%
3
 
3.5%
2
 
2.3%
2
 
2.3%
Other values (40) 48
55.8%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Uppercase Letter
ValueCountFrequency (%)
O 8
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 86
74.8%
Common 21
 
18.3%
Latin 8
 
7.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
11.6%
6
 
7.0%
3
 
3.5%
3
 
3.5%
3
 
3.5%
3
 
3.5%
3
 
3.5%
3
 
3.5%
2
 
2.3%
2
 
2.3%
Other values (40) 48
55.8%
Common
ValueCountFrequency (%)
( 10
47.6%
) 10
47.6%
1
 
4.8%
Latin
ValueCountFrequency (%)
O 8
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 86
74.8%
ASCII 29
 
25.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 10
34.5%
) 10
34.5%
O 8
27.6%
1
 
3.4%
Hangul
ValueCountFrequency (%)
10
 
11.6%
6
 
7.0%
3
 
3.5%
3
 
3.5%
3
 
3.5%
3
 
3.5%
3
 
3.5%
3
 
3.5%
2
 
2.3%
2
 
2.3%
Other values (40) 48
55.8%

주사무소주소
Text

MISSING 

Distinct14
Distinct (%)73.7%
Missing2
Missing (%)9.5%
Memory size300.0 B
2023-12-12T16:58:16.811767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length28
Mean length25.947368
Min length22

Characters and Unicode

Total characters493
Distinct characters64
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)63.2%

Sample

1st row강원특별자치도 영월군 한반도면 쌍용로 295
2nd row강원특별자치도 영월군 영월읍 느티나무길 27
3rd row강원특별자치도 영월군 한반도면 강원남로 267
4th row강원특별자치도 영월군 영월읍 하송안길 70-4
5th row강원특별자치도 영월군 한반도면 쌍용로 122
ValueCountFrequency (%)
강원특별자치도 19
19.2%
영월군 19
19.2%
한반도면 11
11.1%
강원남로 6
 
6.1%
영월읍 6
 
6.1%
267 5
 
5.1%
쌍용로 5
 
5.1%
122 2
 
2.0%
140 1
 
1.0%
1101호 1
 
1.0%
Other values (24) 24
24.2%
2023-12-12T16:58:17.161175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
80
 
16.2%
30
 
6.1%
25
 
5.1%
25
 
5.1%
25
 
5.1%
25
 
5.1%
20
 
4.1%
19
 
3.9%
19
 
3.9%
19
 
3.9%
Other values (54) 206
41.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 339
68.8%
Space Separator 80
 
16.2%
Decimal Number 66
 
13.4%
Dash Punctuation 4
 
0.8%
Other Punctuation 2
 
0.4%
Close Punctuation 1
 
0.2%
Open Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
8.8%
25
 
7.4%
25
 
7.4%
25
 
7.4%
25
 
7.4%
20
 
5.9%
19
 
5.6%
19
 
5.6%
19
 
5.6%
19
 
5.6%
Other values (39) 113
33.3%
Decimal Number
ValueCountFrequency (%)
2 15
22.7%
1 13
19.7%
6 8
12.1%
7 8
12.1%
4 8
12.1%
0 5
 
7.6%
9 4
 
6.1%
3 2
 
3.0%
8 2
 
3.0%
5 1
 
1.5%
Space Separator
ValueCountFrequency (%)
80
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 339
68.8%
Common 154
31.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
8.8%
25
 
7.4%
25
 
7.4%
25
 
7.4%
25
 
7.4%
20
 
5.9%
19
 
5.6%
19
 
5.6%
19
 
5.6%
19
 
5.6%
Other values (39) 113
33.3%
Common
ValueCountFrequency (%)
80
51.9%
2 15
 
9.7%
1 13
 
8.4%
6 8
 
5.2%
7 8
 
5.2%
4 8
 
5.2%
0 5
 
3.2%
- 4
 
2.6%
9 4
 
2.6%
, 2
 
1.3%
Other values (5) 7
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 339
68.8%
ASCII 154
31.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
80
51.9%
2 15
 
9.7%
1 13
 
8.4%
6 8
 
5.2%
7 8
 
5.2%
4 8
 
5.2%
0 5
 
3.2%
- 4
 
2.6%
9 4
 
2.6%
, 2
 
1.3%
Other values (5) 7
 
4.5%
Hangul
ValueCountFrequency (%)
30
 
8.8%
25
 
7.4%
25
 
7.4%
25
 
7.4%
25
 
7.4%
20
 
5.9%
19
 
5.6%
19
 
5.6%
19
 
5.6%
19
 
5.6%
Other values (39) 113
33.3%

Correlations

2023-12-12T16:58:17.258793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상호주사무소주소
상호1.0000.868
주사무소주소0.8681.000

Missing values

2023-12-12T16:58:15.759391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:58:15.843527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T16:58:15.943264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

상호주사무소주소
0(주)새하마노강원특별자치도 영월군 한반도면 쌍용로 295
1영월레카강원특별자치도 영월군 영월읍 느티나무길 27
2(자)이진상운강원특별자치도 영월군 한반도면 강원남로 267
3영월특수물산강원특별자치도 영월군 영월읍 하송안길 70-4
4(주)쌍용로지스강원특별자치도 영월군 한반도면 쌍용로 122
5박OO강원특별자치도 영월군 주천면 자작결운길 244-36
6(주)진우환경강원특별자치도 영월군 한반도면 쌍용로 122
7(주)무진물류강원특별자치도 영월군 한반도면 강원남로 267
8김OO강원특별자치도 영월군 영월읍 중앙로 269
9임OO강원특별자치도 영월군 영월읍 중리1길 144-12
상호주사무소주소
11김OO강원특별자치도 영월군 영월읍 하송로 146-10, 103동 1101호 (드림채아파트)
12일반화물강원특별자치도 영월군 영월읍 오무개길 9
13(주)세계물류강원특별자치도 영월군 한반도면 강원남로 140
14(주)에스디케이강원특별자치도 영월군 한반도면 쌍용로 297
15(주)코모로환경강원특별자치도 영월군 남면 연당로 41, 남면청년회
16쌍용제이씨(주)강원특별자치도 영월군 한반도면 쌍용로 88
17주식회사 영월운수화물강원특별자치도 영월군 한반도면 강원남로 267
18(주)유원물류강원특별자치도 영월군 한반도면 강원남로 267
19<NA><NA>
20<NA><NA>

Duplicate rows

Most frequently occurring

상호주사무소주소# duplicates
0<NA><NA>2