Overview

Dataset statistics

Number of variables10
Number of observations196
Missing cells17
Missing cells (%)0.9%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory15.4 KiB
Average record size in memory80.7 B

Variable types

Unsupported2
Text7
Categorical1

Dataset

Description문화체육관광부가 지정한 중국인 단체관광객 유치 업무를 할 수 있는 전담여행사 목록으로 업체명, 대표자, 연락처, 지정일자 등의 정보를 제공합니다.
Author문화체육관광부
URLhttps://www.data.go.kr/data/15096591/fileData.do

Alerts

Dataset has 1 (0.5%) duplicate rowsDuplicates
Unnamed: 1 has 2 (1.0%) missing valuesMissing
Unnamed: 2 has 2 (1.0%) missing valuesMissing
Unnamed: 3 has 2 (1.0%) missing valuesMissing
Unnamed: 4 has 2 (1.0%) missing valuesMissing
Unnamed: 5 has 2 (1.0%) missing valuesMissing
Unnamed: 6 has 2 (1.0%) missing valuesMissing
Unnamed: 7 has 2 (1.0%) missing valuesMissing
Unnamed: 8 has 2 (1.0%) missing valuesMissing
중국 단체관광객 유치 전담여행사 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-17 16:12:07.278420
Analysis finished2024-04-17 16:12:08.153270
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

중국 단체관광객 유치 전담여행사
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)0.5%
Memory size1.7 KiB

Unnamed: 1
Text

MISSING 

Distinct194
Distinct (%)100.0%
Missing2
Missing (%)1.0%
Memory size1.7 KiB
2024-04-18T01:12:08.281368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length8.6701031
Min length3

Characters and Unicode

Total characters1682
Distinct characters211
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique194 ?
Unique (%)100.0%

Sample

1st row업체명
2nd row(주)금룡여행사
3rd row롯데관광(주)
4th row(주)한진관광
5th row(주)계명세계여행
ValueCountFrequency (%)
주식회사 21
 
9.7%
주)레드캡투어 1
 
0.5%
팀맥스어드벤처 1
 
0.5%
주)코리아외사국제여행사 1
 
0.5%
주)현대투어 1
 
0.5%
대한국제여유(주 1
 
0.5%
주)굿프렌드여행사 1
 
0.5%
롯데제이티비(주 1
 
0.5%
주)마이스월드 1
 
0.5%
선일국제 1
 
0.5%
Other values (187) 187
86.2%
2024-04-18T01:12:08.564222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
173
 
10.3%
( 148
 
8.8%
) 148
 
8.8%
98
 
5.8%
80
 
4.8%
80
 
4.8%
43
 
2.6%
40
 
2.4%
34
 
2.0%
33
 
2.0%
Other values (201) 805
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1355
80.6%
Open Punctuation 148
 
8.8%
Close Punctuation 148
 
8.8%
Space Separator 24
 
1.4%
Other Symbol 7
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
173
 
12.8%
98
 
7.2%
80
 
5.9%
80
 
5.9%
43
 
3.2%
40
 
3.0%
34
 
2.5%
33
 
2.4%
32
 
2.4%
24
 
1.8%
Other values (197) 718
53.0%
Open Punctuation
ValueCountFrequency (%)
( 148
100.0%
Close Punctuation
ValueCountFrequency (%)
) 148
100.0%
Space Separator
ValueCountFrequency (%)
24
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1362
81.0%
Common 320
 
19.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
173
 
12.7%
98
 
7.2%
80
 
5.9%
80
 
5.9%
43
 
3.2%
40
 
2.9%
34
 
2.5%
33
 
2.4%
32
 
2.3%
24
 
1.8%
Other values (198) 725
53.2%
Common
ValueCountFrequency (%)
( 148
46.2%
) 148
46.2%
24
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1355
80.6%
ASCII 320
 
19.0%
None 7
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
173
 
12.8%
98
 
7.2%
80
 
5.9%
80
 
5.9%
43
 
3.2%
40
 
3.0%
34
 
2.5%
33
 
2.4%
32
 
2.4%
24
 
1.8%
Other values (197) 718
53.0%
ASCII
ValueCountFrequency (%)
( 148
46.2%
) 148
46.2%
24
 
7.5%
None
ValueCountFrequency (%)
7
100.0%

Unnamed: 2
Text

MISSING 

Distinct194
Distinct (%)100.0%
Missing2
Missing (%)1.0%
Memory size1.7 KiB
2024-04-18T01:12:08.791970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length21
Mean length8.9175258
Min length2

Characters and Unicode

Total characters1730
Distinct characters289
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique194 ?
Unique (%)100.0%

Sample

1st row업체명(중문)
2nd row(株)金龙旅行社
3rd row乐天观光(株)
4th row(株)韩进观光
5th row(株)启明世界旅行
ValueCountFrequency (%)
tour 5
 
2.1%
co 4
 
1.7%
ltd 4
 
1.7%
co.,ltd 3
 
1.2%
株式會社 3
 
1.2%
株式会社 2
 
0.8%
korea 2
 
0.8%
tours 2
 
0.8%
大韩国际旅游(株 1
 
0.4%
株)佳友旅行社 1
 
0.4%
Other values (214) 214
88.8%
2024-04-18T01:12:09.134045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
127
 
7.3%
( 115
 
6.6%
) 115
 
6.6%
99
 
5.7%
93
 
5.4%
57
 
3.3%
55
 
3.2%
53
 
3.1%
49
 
2.8%
T 41
 
2.4%
Other values (279) 926
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1074
62.1%
Uppercase Letter 326
 
18.8%
Open Punctuation 117
 
6.8%
Close Punctuation 117
 
6.8%
Space Separator 49
 
2.8%
Lowercase Letter 28
 
1.6%
Other Punctuation 19
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
127
 
11.8%
99
 
9.2%
93
 
8.7%
57
 
5.3%
55
 
5.1%
53
 
4.9%
37
 
3.4%
26
 
2.4%
18
 
1.7%
16
 
1.5%
Other values (233) 493
45.9%
Uppercase Letter
ValueCountFrequency (%)
T 41
12.6%
O 32
 
9.8%
R 25
 
7.7%
E 23
 
7.1%
C 22
 
6.7%
A 21
 
6.4%
L 21
 
6.4%
U 20
 
6.1%
N 19
 
5.8%
D 15
 
4.6%
Other values (14) 87
26.7%
Lowercase Letter
ValueCountFrequency (%)
a 4
14.3%
r 3
10.7%
s 3
10.7%
o 3
10.7%
m 3
10.7%
e 3
10.7%
u 2
7.1%
i 2
7.1%
d 2
7.1%
p 1
 
3.6%
Other values (2) 2
7.1%
Other Punctuation
ValueCountFrequency (%)
. 7
36.8%
, 6
31.6%
& 4
21.1%
' 1
 
5.3%
1
 
5.3%
Open Punctuation
ValueCountFrequency (%)
( 115
98.3%
2
 
1.7%
Close Punctuation
ValueCountFrequency (%)
) 115
98.3%
2
 
1.7%
Space Separator
ValueCountFrequency (%)
49
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 1069
61.8%
Latin 354
 
20.5%
Common 302
 
17.5%
Hangul 5
 
0.3%

Most frequent character per script

Han
ValueCountFrequency (%)
127
 
11.9%
99
 
9.3%
93
 
8.7%
57
 
5.3%
55
 
5.1%
53
 
5.0%
37
 
3.5%
26
 
2.4%
18
 
1.7%
16
 
1.5%
Other values (228) 488
45.7%
Latin
ValueCountFrequency (%)
T 41
 
11.6%
O 32
 
9.0%
R 25
 
7.1%
E 23
 
6.5%
C 22
 
6.2%
A 21
 
5.9%
L 21
 
5.9%
U 20
 
5.6%
N 19
 
5.4%
D 15
 
4.2%
Other values (26) 115
32.5%
Common
ValueCountFrequency (%)
( 115
38.1%
) 115
38.1%
49
16.2%
. 7
 
2.3%
, 6
 
2.0%
& 4
 
1.3%
2
 
0.7%
2
 
0.7%
' 1
 
0.3%
1
 
0.3%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
CJK 1008
58.3%
ASCII 651
37.6%
CJK Compat Ideographs 61
 
3.5%
Hangul 5
 
0.3%
None 4
 
0.2%
Punctuation 1
 
0.1%

Most frequent character per block

CJK
ValueCountFrequency (%)
127
 
12.6%
99
 
9.8%
93
 
9.2%
55
 
5.5%
53
 
5.3%
37
 
3.7%
26
 
2.6%
18
 
1.8%
16
 
1.6%
13
 
1.3%
Other values (223) 471
46.7%
ASCII
ValueCountFrequency (%)
( 115
17.7%
) 115
17.7%
49
 
7.5%
T 41
 
6.3%
O 32
 
4.9%
R 25
 
3.8%
E 23
 
3.5%
C 22
 
3.4%
A 21
 
3.2%
L 21
 
3.2%
Other values (33) 187
28.7%
CJK Compat Ideographs
ValueCountFrequency (%)
57
93.4%
1
 
1.6%
1
 
1.6%
1
 
1.6%
1
 
1.6%
None
ValueCountFrequency (%)
2
50.0%
2
50.0%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

Unnamed: 3
Text

MISSING 

Distinct193
Distinct (%)99.5%
Missing2
Missing (%)1.0%
Memory size1.7 KiB
2024-04-18T01:12:09.413807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length3
Mean length3.2989691
Min length3

Characters and Unicode

Total characters640
Distinct characters170
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique192 ?
Unique (%)99.0%

Sample

1st row대표자
2nd row유옥붕
3rd row조광희
4th row김정수
5th row김미숙
ValueCountFrequency (%)
최성희 2
 
0.9%
2
 
0.9%
2
 
0.9%
황정희 1
 
0.5%
권보경 1
 
0.5%
이영택 1
 
0.5%
이수빈 1
 
0.5%
김용원 1
 
0.5%
박재영 1
 
0.5%
남영봉 1
 
0.5%
Other values (202) 202
94.0%
2024-04-18T01:12:09.782676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36
 
5.6%
34
 
5.3%
27
 
4.2%
15
 
2.3%
15
 
2.3%
14
 
2.2%
14
 
2.2%
13
 
2.0%
11
 
1.7%
9
 
1.4%
Other values (160) 452
70.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 595
93.0%
Space Separator 36
 
5.6%
Control 8
 
1.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
5.7%
27
 
4.5%
15
 
2.5%
15
 
2.5%
14
 
2.4%
14
 
2.4%
13
 
2.2%
11
 
1.8%
9
 
1.5%
9
 
1.5%
Other values (157) 434
72.9%
Space Separator
ValueCountFrequency (%)
36
100.0%
Control
ValueCountFrequency (%)
8
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 595
93.0%
Common 45
 
7.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
5.7%
27
 
4.5%
15
 
2.5%
15
 
2.5%
14
 
2.4%
14
 
2.4%
13
 
2.2%
11
 
1.8%
9
 
1.5%
9
 
1.5%
Other values (157) 434
72.9%
Common
ValueCountFrequency (%)
36
80.0%
8
 
17.8%
, 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 595
93.0%
ASCII 45
 
7.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
36
80.0%
8
 
17.8%
, 1
 
2.2%
Hangul
ValueCountFrequency (%)
34
 
5.7%
27
 
4.5%
15
 
2.5%
15
 
2.5%
14
 
2.4%
14
 
2.4%
13
 
2.2%
11
 
1.8%
9
 
1.5%
9
 
1.5%
Other values (157) 434
72.9%

Unnamed: 4
Text

MISSING 

Distinct193
Distinct (%)99.5%
Missing2
Missing (%)1.0%
Memory size1.7 KiB
2024-04-18T01:12:10.056005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length3
Mean length3.2474227
Min length2

Characters and Unicode

Total characters630
Distinct characters336
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique192 ?
Unique (%)99.0%

Sample

1st row대표자(중문)
2nd row刘玉鹏
3rd row赵光熙
4th row金正洙
5th row金美淑
ValueCountFrequency (%)
崔盛熙 2
 
0.9%
2
 
0.9%
崔珉瑞 1
 
0.5%
權寶慶 1
 
0.5%
李永泽 1
 
0.5%
李守彬 1
 
0.5%
金容元 1
 
0.5%
朴宰永 1
 
0.5%
南永峰 1
 
0.5%
白大雄 1
 
0.5%
Other values (201) 201
94.4%
2024-04-18T01:12:10.428376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
 
5.6%
22
 
3.5%
17
 
2.7%
14
 
2.2%
10
 
1.6%
8
 
1.3%
8
 
1.3%
8
 
1.3%
7
 
1.1%
6
 
1.0%
Other values (326) 495
78.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 597
94.8%
Space Separator 22
 
3.5%
Control 8
 
1.3%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
5.9%
17
 
2.8%
14
 
2.3%
10
 
1.7%
8
 
1.3%
8
 
1.3%
7
 
1.2%
6
 
1.0%
6
 
1.0%
5
 
0.8%
Other values (321) 481
80.6%
Space Separator
ValueCountFrequency (%)
22
100.0%
Control
ValueCountFrequency (%)
8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 592
94.0%
Common 33
 
5.2%
Hangul 5
 
0.8%

Most frequent character per script

Han
ValueCountFrequency (%)
35
 
5.9%
17
 
2.9%
14
 
2.4%
10
 
1.7%
8
 
1.4%
8
 
1.4%
7
 
1.2%
6
 
1.0%
6
 
1.0%
5
 
0.8%
Other values (316) 476
80.4%
Common
ValueCountFrequency (%)
22
66.7%
8
 
24.2%
( 1
 
3.0%
) 1
 
3.0%
, 1
 
3.0%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
CJK 580
92.1%
ASCII 33
 
5.2%
CJK Compat Ideographs 12
 
1.9%
Hangul 5
 
0.8%

Most frequent character per block

CJK
ValueCountFrequency (%)
35
 
6.0%
17
 
2.9%
14
 
2.4%
8
 
1.4%
8
 
1.4%
7
 
1.2%
6
 
1.0%
6
 
1.0%
5
 
0.9%
5
 
0.9%
Other values (313) 469
80.9%
ASCII
ValueCountFrequency (%)
22
66.7%
8
 
24.2%
( 1
 
3.0%
) 1
 
3.0%
, 1
 
3.0%
CJK Compat Ideographs
ValueCountFrequency (%)
10
83.3%
1
 
8.3%
1
 
8.3%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Unnamed: 5
Text

MISSING 

Distinct193
Distinct (%)99.5%
Missing2
Missing (%)1.0%
Memory size1.7 KiB
2024-04-18T01:12:10.639367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12.5
Mean length11.525773
Min length2

Characters and Unicode

Total characters2236
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique192 ?
Unique (%)99.0%

Sample

1st row전화
2nd row02-720-2861
3rd row02-2078-6658
4th row02-726-5546
5th row02-732-8888
ValueCountFrequency (%)
02-775-4884 2
 
1.0%
02-3144-6636 1
 
0.5%
02-720-7506 1
 
0.5%
02-775-3563 1
 
0.5%
02-739-7309 1
 
0.5%
032-752-7703 1
 
0.5%
02-335-1818 1
 
0.5%
02-334-1891 1
 
0.5%
02-6313-8021 1
 
0.5%
02-6123-4886 1
 
0.5%
Other values (183) 183
94.3%
2024-04-18T01:12:10.939346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 386
17.3%
0 343
15.3%
2 309
13.8%
3 203
9.1%
8 195
8.7%
7 167
7.5%
6 159
7.1%
1 141
 
6.3%
5 121
 
5.4%
4 106
 
4.7%
Other values (3) 106
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1848
82.6%
Dash Punctuation 386
 
17.3%
Other Letter 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 343
18.6%
2 309
16.7%
3 203
11.0%
8 195
10.6%
7 167
9.0%
6 159
8.6%
1 141
7.6%
5 121
 
6.5%
4 106
 
5.7%
9 104
 
5.6%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 386
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2234
99.9%
Hangul 2
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 386
17.3%
0 343
15.4%
2 309
13.8%
3 203
9.1%
8 195
8.7%
7 167
7.5%
6 159
7.1%
1 141
 
6.3%
5 121
 
5.4%
4 106
 
4.7%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2234
99.9%
Hangul 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 386
17.3%
0 343
15.4%
2 309
13.8%
3 203
9.1%
8 195
8.7%
7 167
7.5%
6 159
7.1%
1 141
 
6.3%
5 121
 
5.4%
4 106
 
4.7%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Unnamed: 6
Text

MISSING 

Distinct189
Distinct (%)97.4%
Missing2
Missing (%)1.0%
Memory size1.7 KiB
2024-04-18T01:12:11.144276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.448454
Min length1

Characters and Unicode

Total characters2221
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique185 ?
Unique (%)95.4%

Sample

1st row팩스
2nd row02-720-2865
3rd row02-6442-3920
4th row02-773-1623
5th row02-2630-2563
ValueCountFrequency (%)
3
 
1.5%
02-338-8086 2
 
1.0%
02-718-1689 2
 
1.0%
02-775-4885 2
 
1.0%
02-6123-4887 1
 
0.5%
070-7500-2902 1
 
0.5%
02-335-1883 1
 
0.5%
064-743-0606 1
 
0.5%
055-339-5540 1
 
0.5%
02-984-1088 1
 
0.5%
Other values (179) 179
92.3%
2024-04-18T01:12:11.477924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 383
17.2%
2 308
13.9%
0 307
13.8%
3 196
8.8%
8 184
8.3%
6 162
7.3%
7 154
6.9%
5 142
 
6.4%
1 141
 
6.3%
4 136
 
6.1%
Other values (3) 108
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1836
82.7%
Dash Punctuation 383
 
17.2%
Other Letter 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 308
16.8%
0 307
16.7%
3 196
10.7%
8 184
10.0%
6 162
8.8%
7 154
8.4%
5 142
7.7%
1 141
7.7%
4 136
7.4%
9 106
 
5.8%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 383
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2219
99.9%
Hangul 2
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 383
17.3%
2 308
13.9%
0 307
13.8%
3 196
8.8%
8 184
8.3%
6 162
7.3%
7 154
6.9%
5 142
 
6.4%
1 141
 
6.4%
4 136
 
6.1%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2219
99.9%
Hangul 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 383
17.3%
2 308
13.9%
0 307
13.8%
3 196
8.8%
8 184
8.3%
6 162
7.3%
7 154
6.9%
5 142
 
6.4%
1 141
 
6.4%
4 136
 
6.1%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Unnamed: 7
Text

MISSING 

Distinct194
Distinct (%)100.0%
Missing2
Missing (%)1.0%
Memory size1.7 KiB
2024-04-18T01:12:11.769549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length82
Median length55.5
Mean length39.505155
Min length2

Characters and Unicode

Total characters7664
Distinct characters317
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique194 ?
Unique (%)100.0%

Sample

1st row주소
2nd row03981 서울특별시 마포구 성미산로 189, 2층
3rd row04543 서울특별시 중구 을지로11길 15 동화빌딩 501호(중국사업부)
4th row04532 서울특별시 중구 소공로 88, 신관 9층 (소공동, 한진빌딩)
5th row07213 서울특별시 영등포구 양평로 67 한강포스빌 420호
ValueCountFrequency (%)
서울특별시 136
 
9.4%
중구 28
 
1.9%
마포구 26
 
1.8%
2층 24
 
1.7%
종로구 20
 
1.4%
영등포구 17
 
1.2%
경기도 15
 
1.0%
서대문구 11
 
0.8%
강서구 11
 
0.8%
3층 10
 
0.7%
Other values (856) 1153
79.5%
2024-04-18T01:12:12.463316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1260
 
16.4%
0 375
 
4.9%
1 356
 
4.6%
2 257
 
3.4%
3 248
 
3.2%
218
 
2.8%
202
 
2.6%
, 198
 
2.6%
193
 
2.5%
186
 
2.4%
Other values (307) 4171
54.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3729
48.7%
Decimal Number 2147
28.0%
Space Separator 1260
 
16.4%
Other Punctuation 199
 
2.6%
Open Punctuation 119
 
1.6%
Close Punctuation 119
 
1.6%
Dash Punctuation 28
 
0.4%
Uppercase Letter 27
 
0.4%
Lowercase Letter 22
 
0.3%
Math Symbol 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
218
 
5.8%
202
 
5.4%
193
 
5.2%
186
 
5.0%
146
 
3.9%
144
 
3.9%
144
 
3.9%
143
 
3.8%
128
 
3.4%
88
 
2.4%
Other values (256) 2137
57.3%
Lowercase Letter
ValueCountFrequency (%)
e 5
22.7%
n 2
 
9.1%
o 2
 
9.1%
d 1
 
4.5%
i 1
 
4.5%
r 1
 
4.5%
t 1
 
4.5%
m 1
 
4.5%
p 1
 
4.5%
l 1
 
4.5%
Other values (6) 6
27.3%
Uppercase Letter
ValueCountFrequency (%)
S 5
18.5%
G 3
11.1%
C 2
 
7.4%
M 2
 
7.4%
A 2
 
7.4%
L 2
 
7.4%
B 2
 
7.4%
T 2
 
7.4%
V 1
 
3.7%
K 1
 
3.7%
Other values (5) 5
18.5%
Decimal Number
ValueCountFrequency (%)
0 375
17.5%
1 356
16.6%
2 257
12.0%
3 248
11.6%
5 183
8.5%
4 183
8.5%
7 145
 
6.8%
8 139
 
6.5%
6 135
 
6.3%
9 126
 
5.9%
Math Symbol
ValueCountFrequency (%)
< 4
44.4%
> 4
44.4%
~ 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 198
99.5%
. 1
 
0.5%
Space Separator
ValueCountFrequency (%)
1260
100.0%
Open Punctuation
ValueCountFrequency (%)
( 119
100.0%
Close Punctuation
ValueCountFrequency (%)
) 119
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%
Control
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3886
50.7%
Hangul 3729
48.7%
Latin 49
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
218
 
5.8%
202
 
5.4%
193
 
5.2%
186
 
5.0%
146
 
3.9%
144
 
3.9%
144
 
3.9%
143
 
3.8%
128
 
3.4%
88
 
2.4%
Other values (256) 2137
57.3%
Latin
ValueCountFrequency (%)
S 5
 
10.2%
e 5
 
10.2%
G 3
 
6.1%
C 2
 
4.1%
M 2
 
4.1%
A 2
 
4.1%
L 2
 
4.1%
n 2
 
4.1%
B 2
 
4.1%
T 2
 
4.1%
Other values (21) 22
44.9%
Common
ValueCountFrequency (%)
1260
32.4%
0 375
 
9.7%
1 356
 
9.2%
2 257
 
6.6%
3 248
 
6.4%
, 198
 
5.1%
5 183
 
4.7%
4 183
 
4.7%
7 145
 
3.7%
8 139
 
3.6%
Other values (10) 542
13.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3935
51.3%
Hangul 3729
48.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1260
32.0%
0 375
 
9.5%
1 356
 
9.0%
2 257
 
6.5%
3 248
 
6.3%
, 198
 
5.0%
5 183
 
4.7%
4 183
 
4.7%
7 145
 
3.7%
8 139
 
3.5%
Other values (41) 591
15.0%
Hangul
ValueCountFrequency (%)
218
 
5.8%
202
 
5.4%
193
 
5.2%
186
 
5.0%
146
 
3.9%
144
 
3.9%
144
 
3.9%
143
 
3.8%
128
 
3.4%
88
 
2.4%
Other values (256) 2137
57.3%

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)1.0%
Memory size1.7 KiB

Unnamed: 9
Categorical

Distinct28
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2019.8.27.
27 
2016.11.4.
26 
2022.7.27.
18 
2015.8.24.
16 
2018.09.27.
13 
Other values (23)
96 

Length

Max length11
Median length10
Mean length10.02551
Min length3

Unique

Unique7 ?
Unique (%)3.6%

Sample

1st row<NA>
2nd row<NA>
3rd row지정일
4th row2000.06.27
5th row2000.06.27

Common Values

ValueCountFrequency (%)
2019.8.27. 27
13.8%
2016.11.4. 26
13.3%
2022.7.27. 18
 
9.2%
2015.8.24. 16
 
8.2%
2018.09.27. 13
 
6.6%
2014.12.12. 11
 
5.6%
2010.08.03 10
 
5.1%
2021.7.19. 9
 
4.6%
2000.06.27 9
 
4.6%
2014.02.26 9
 
4.6%
Other values (18) 48
24.5%

Length

2024-04-18T01:12:12.575557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2019.8.27 27
13.8%
2016.11.4 26
13.3%
2022.7.27 18
 
9.2%
2015.8.24 16
 
8.2%
2018.09.27 13
 
6.6%
2014.12.12 12
 
6.1%
2010.08.03 10
 
5.1%
2021.7.19 9
 
4.6%
2000.06.27 9
 
4.6%
2014.02.26 9
 
4.6%
Other values (17) 47
24.0%

Missing values

2024-04-18T01:12:07.841640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T01:12:07.953033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-18T01:12:08.064689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

중국 단체관광객 유치 전담여행사Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
0총 193개사 (2022. 7월)<NA><NA><NA><NA><NA><NA><NA>NaN<NA>
1NaN<NA><NA><NA><NA><NA><NA><NA>NaN<NA>
2번호업체명업체명(중문)대표자대표자(중문)전화팩스주소등록번호지정일
31(주)금룡여행사(株)金龙旅行社유옥붕刘玉鹏02-720-286102-720-286503981 서울특별시 마포구 성미산로 189, 2층20000627012000.06.27
42롯데관광(주)乐天观光(株)조광희赵光熙02-2078-665802-6442-392004543 서울특별시 중구 을지로11길 15 동화빌딩 501호(중국사업부)20000627022000.06.27
53(주)한진관광(株)韩进观光김정수金正洙02-726-554602-773-162304532 서울특별시 중구 소공로 88, 신관 9층 (소공동, 한진빌딩)20000627042000.06.27
64(주)계명세계여행(株)启明世界旅行김미숙金美淑02-732-888802-2630-256307213 서울특별시 영등포구 양평로 67 한강포스빌 420호20000627052000.06.27
75(주)내일관광여행사(株)来日观光旅行社이문균李文鈞02-773-388802-776-488803459 서울특별시 은평구 응암로 352, 302호(녹번동, 태선라이프빌딩)20000627092000.06.27
86(주)화인관광(株)华人观光손서장孙书壮02-322-919102-322-844804000 서울특별시 마포구 월드컵북로5길 22, 503호(서교동)20000627102000.06.27
97(주)태창여행사(株)泰昌旅行社왕덕안王德安02-323-588802-323-693304053 서울특별시 마포구 어울마당로 130, 기린빌딩 3층 377호 (서교동)20000627152000.06.27
중국 단체관광객 유치 전담여행사Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
186184성위관광(주)城尉观光유기룡刘其龙02-3141-036802-3141-036903938 서울특별시 마포구 월드컵로34길 20, B104호 (성산동, 동우자인채스토리상암)20220727092022.7.27.
187185㈜성진씨앤티(株)成进CNT조영훈曺永勳02-730-290402-730-117503170 서울특별시 종로구 새문안로5가길 28, 1403호 (적선동, 광화문플래티넘)20220727102022.7.27.
188186(주)신성세계여행사新盛世界旅行社유붕후刘鹏厚02-598-608002-522-224908806 서울특별시 관악구 남현1길 51, 601호 (남현동, 범평빌딩)20220727112022.7.27.
189187주식회사 아사달인터내셔날韩国阿斯达国际株式会社서 명徐 明02-6265-598802-6265-598903157 서울특별시 종로구 종로 19, 1217호 (종로1가, 르메이에르)20220727122022.7.27.
190188㈜에이치피티코리아HPTCOREA김응식金應植02-730-553202-730-553103182 서울특별시 종로구 새문안로 91, 713호 (신문로1가, 고려빌딩)20220727132022.7.27.
191189주식회사 유랑지구游览地球박진호朴镇皞02-9265-2910-04045 서울특별시 마포구 양화로 56, 903호 (서교동, 동양한강트레벨)20220727142022.7.27.
192190주식회사 케이씨티트래블韩国文旅장유재张有财02-6379-888802-6020-869807246 서울특별시 영등포구 국회대로50길 20, 103동 906호 (영등포동7가, 포레나영등포센트럴)20220727152022.7.27.
193191코리아 가이드 센터 주식회사韩国导游公司최성희崔盛熙051-715-0727051-715-072849037 부산광역시 영도구 봉래나루로 33, Sea-Side Complex Town 306-27호(대교동1가)20220727162022.7.27.
194192㈜티에이치인터내셔널通韩国际이 성李 成032-299-0999032-299-098921984 인천광역시 연수구 송도과학로 32, M동 902호 (송도테크노파크IT센터)20220727172022.7.27.
195193㈜해피투어여행사欣欣旅游사현숙佘贤淑02-395-808802-374-156603659 서울특별시 서대문구 가좌로 66, 2층 (홍은동, 성원빌딩)20220727182022.7.27.

Duplicate rows

Most frequently occurring

Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 9# duplicates
0<NA><NA><NA><NA><NA><NA><NA><NA>2