Overview

Dataset statistics

Number of variables10
Number of observations185
Missing cells17
Missing cells (%)0.9%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory14.6 KiB
Average record size in memory80.7 B

Variable types

Unsupported2
Text7
Categorical1

Dataset

Description문화체육관광부가 지정한 중국인 단체관광객 유치 업무를 할 수 있는 전담여행사 목록으로 업체명, 대표자, 연락처, 지정일자 등의 정보를 제공합니다.
Author문화체육관광부
URLhttps://www.data.go.kr/data/15096067/fileData.do

Alerts

Dataset has 1 (0.5%) duplicate rowsDuplicates
Unnamed: 1 has 2 (1.1%) missing valuesMissing
Unnamed: 2 has 2 (1.1%) missing valuesMissing
Unnamed: 3 has 2 (1.1%) missing valuesMissing
Unnamed: 4 has 2 (1.1%) missing valuesMissing
Unnamed: 5 has 2 (1.1%) missing valuesMissing
Unnamed: 6 has 2 (1.1%) missing valuesMissing
Unnamed: 7 has 2 (1.1%) missing valuesMissing
Unnamed: 8 has 2 (1.1%) missing valuesMissing
중국 단체관광객 유치 전담여행사 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 01:10:20.583421
Analysis finished2023-12-12 01:10:21.869357
Duration1.29 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

중국 단체관광객 유치 전담여행사
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)0.5%
Memory size1.6 KiB

Unnamed: 1
Text

MISSING 

Distinct183
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T10:10:22.158017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length7.7103825
Min length3

Characters and Unicode

Total characters1411
Distinct characters208
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row업체명
2nd row(주)금룡여행사
3rd row롯데관광(주)
4th row(주)한진관광
5th row(주)계명세계여행
ValueCountFrequency (%)
주식회사 14
 
7.1%
주)한국청년여행사 1
 
0.5%
㈜태화관광 1
 
0.5%
㈜신텐디여행사 1
 
0.5%
㈜아리바바 1
 
0.5%
여행스케치㈜ 1
 
0.5%
유니언투어 1
 
0.5%
㈜일동월드와이드 1
 
0.5%
㈜잇츠코리아 1
 
0.5%
㈜제이트립 1
 
0.5%
Other values (174) 174
88.3%
2023-12-12T10:10:22.632967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
87
 
6.2%
86
 
6.1%
84
 
6.0%
80
 
5.7%
80
 
5.7%
) 67
 
4.7%
43
 
3.0%
38
 
2.7%
( 37
 
2.6%
31
 
2.2%
Other values (198) 778
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1178
83.5%
Other Symbol 84
 
6.0%
Close Punctuation 67
 
4.7%
Open Punctuation 67
 
4.7%
Space Separator 15
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
87
 
7.4%
86
 
7.3%
80
 
6.8%
80
 
6.8%
43
 
3.7%
38
 
3.2%
31
 
2.6%
30
 
2.5%
26
 
2.2%
24
 
2.0%
Other values (193) 653
55.4%
Open Punctuation
ValueCountFrequency (%)
( 37
55.2%
30
44.8%
Other Symbol
ValueCountFrequency (%)
84
100.0%
Close Punctuation
ValueCountFrequency (%)
) 67
100.0%
Space Separator
ValueCountFrequency (%)
15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1262
89.4%
Common 149
 
10.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
87
 
6.9%
86
 
6.8%
84
 
6.7%
80
 
6.3%
80
 
6.3%
43
 
3.4%
38
 
3.0%
31
 
2.5%
30
 
2.4%
26
 
2.1%
Other values (194) 677
53.6%
Common
ValueCountFrequency (%)
) 67
45.0%
( 37
24.8%
30
20.1%
15
 
10.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1178
83.5%
ASCII 119
 
8.4%
None 114
 
8.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
87
 
7.4%
86
 
7.3%
80
 
6.8%
80
 
6.8%
43
 
3.7%
38
 
3.2%
31
 
2.6%
30
 
2.5%
26
 
2.2%
24
 
2.0%
Other values (193) 653
55.4%
None
ValueCountFrequency (%)
84
73.7%
30
 
26.3%
ASCII
ValueCountFrequency (%)
) 67
56.3%
( 37
31.1%
15
 
12.6%

Unnamed: 2
Text

MISSING 

Distinct183
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T10:10:22.995925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length21
Mean length9.1748634
Min length2

Characters and Unicode

Total characters1679
Distinct characters280
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row업체명(중문)
2nd row(株)金龙旅行社
3rd row乐天观光(株)
4th row(株)韩进观光
5th row(株)启明世界旅行
ValueCountFrequency (%)
tour 5
 
2.1%
co 4
 
1.7%
ltd 4
 
1.7%
株式會社 3
 
1.3%
co.,ltd 3
 
1.3%
korea 2
 
0.9%
株式会社 2
 
0.9%
tours 2
 
0.9%
株)親友旅行社 1
 
0.4%
株)眞星觀光旅行社 1
 
0.4%
Other values (206) 206
88.4%
2023-12-12T10:10:23.539093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
128
 
7.6%
94
 
5.6%
( 92
 
5.5%
92
 
5.5%
74
 
4.4%
61
 
3.6%
52
 
3.1%
50
 
3.0%
48
 
2.9%
) 48
 
2.9%
Other values (270) 940
56.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1036
61.7%
Uppercase Letter 292
 
17.4%
Open Punctuation 122
 
7.3%
Close Punctuation 122
 
7.3%
Space Separator 52
 
3.1%
Lowercase Letter 36
 
2.1%
Other Punctuation 19
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
128
 
12.4%
94
 
9.1%
92
 
8.9%
61
 
5.9%
50
 
4.8%
48
 
4.6%
33
 
3.2%
24
 
2.3%
17
 
1.6%
16
 
1.5%
Other values (222) 473
45.7%
Uppercase Letter
ValueCountFrequency (%)
T 37
12.7%
O 30
 
10.3%
R 21
 
7.2%
L 20
 
6.8%
E 19
 
6.5%
C 19
 
6.5%
U 18
 
6.2%
N 17
 
5.8%
A 16
 
5.5%
I 15
 
5.1%
Other values (15) 80
27.4%
Lowercase Letter
ValueCountFrequency (%)
e 5
13.9%
r 5
13.9%
a 4
11.1%
s 4
11.1%
m 3
8.3%
o 3
8.3%
i 3
8.3%
d 2
 
5.6%
u 2
 
5.6%
t 2
 
5.6%
Other values (3) 3
8.3%
Other Punctuation
ValueCountFrequency (%)
. 7
36.8%
, 6
31.6%
& 4
21.1%
' 1
 
5.3%
1
 
5.3%
Open Punctuation
ValueCountFrequency (%)
( 92
75.4%
30
 
24.6%
Close Punctuation
ValueCountFrequency (%)
74
60.7%
) 48
39.3%
Space Separator
ValueCountFrequency (%)
52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 1031
61.4%
Latin 328
 
19.5%
Common 315
 
18.8%
Hangul 5
 
0.3%

Most frequent character per script

Han
ValueCountFrequency (%)
128
 
12.4%
94
 
9.1%
92
 
8.9%
61
 
5.9%
50
 
4.8%
48
 
4.7%
33
 
3.2%
24
 
2.3%
17
 
1.6%
16
 
1.6%
Other values (217) 468
45.4%
Latin
ValueCountFrequency (%)
T 37
 
11.3%
O 30
 
9.1%
R 21
 
6.4%
L 20
 
6.1%
E 19
 
5.8%
C 19
 
5.8%
U 18
 
5.5%
N 17
 
5.2%
A 16
 
4.9%
I 15
 
4.6%
Other values (28) 116
35.4%
Common
ValueCountFrequency (%)
( 92
29.2%
74
23.5%
52
16.5%
) 48
15.2%
30
 
9.5%
. 7
 
2.2%
, 6
 
1.9%
& 4
 
1.3%
' 1
 
0.3%
1
 
0.3%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
CJK 966
57.5%
ASCII 538
32.0%
None 104
 
6.2%
CJK Compat Ideographs 65
 
3.9%
Hangul 5
 
0.3%
Punctuation 1
 
0.1%

Most frequent character per block

CJK
ValueCountFrequency (%)
128
 
13.3%
94
 
9.7%
92
 
9.5%
50
 
5.2%
48
 
5.0%
33
 
3.4%
24
 
2.5%
17
 
1.8%
16
 
1.7%
15
 
1.6%
Other values (212) 449
46.5%
ASCII
ValueCountFrequency (%)
( 92
17.1%
52
 
9.7%
) 48
 
8.9%
T 37
 
6.9%
O 30
 
5.6%
R 21
 
3.9%
L 20
 
3.7%
E 19
 
3.5%
C 19
 
3.5%
U 18
 
3.3%
Other values (35) 182
33.8%
None
ValueCountFrequency (%)
74
71.2%
30
28.8%
CJK Compat Ideographs
ValueCountFrequency (%)
61
93.8%
1
 
1.5%
1
 
1.5%
1
 
1.5%
1
 
1.5%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

Unnamed: 3
Text

MISSING 

Distinct183
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T10:10:24.005179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.3278689
Min length3

Characters and Unicode

Total characters609
Distinct characters168
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row대표자
2nd row유옥붕
3rd row조광희 유동수
4th row김정수
5th row김미숙
ValueCountFrequency (%)
2
 
1.0%
임문수 1
 
0.5%
김동영 1
 
0.5%
유수옥 1
 
0.5%
강완구 1
 
0.5%
최성희 1
 
0.5%
부동석 1
 
0.5%
강준구 1
 
0.5%
황남해 1
 
0.5%
이진용 1
 
0.5%
Other values (193) 193
94.6%
2023-12-12T10:10:24.670960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
6.2%
32
 
5.3%
24
 
3.9%
16
 
2.6%
13
 
2.1%
12
 
2.0%
12
 
2.0%
11
 
1.8%
10
 
1.6%
10
 
1.6%
Other values (158) 431
70.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 562
92.3%
Space Separator 38
 
6.2%
Control 7
 
1.1%
Other Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
5.7%
24
 
4.3%
16
 
2.8%
13
 
2.3%
12
 
2.1%
12
 
2.1%
11
 
2.0%
10
 
1.8%
10
 
1.8%
9
 
1.6%
Other values (155) 413
73.5%
Space Separator
ValueCountFrequency (%)
38
100.0%
Control
ValueCountFrequency (%)
7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 562
92.3%
Common 47
 
7.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
5.7%
24
 
4.3%
16
 
2.8%
13
 
2.3%
12
 
2.1%
12
 
2.1%
11
 
2.0%
10
 
1.8%
10
 
1.8%
9
 
1.6%
Other values (155) 413
73.5%
Common
ValueCountFrequency (%)
38
80.9%
7
 
14.9%
, 2
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 562
92.3%
ASCII 47
 
7.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38
80.9%
7
 
14.9%
, 2
 
4.3%
Hangul
ValueCountFrequency (%)
32
 
5.7%
24
 
4.3%
16
 
2.8%
13
 
2.3%
12
 
2.1%
12
 
2.1%
11
 
2.0%
10
 
1.8%
10
 
1.8%
9
 
1.6%
Other values (155) 413
73.5%

Unnamed: 4
Text

MISSING 

Distinct183
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T10:10:25.106122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length3
Mean length3.3224044
Min length2

Characters and Unicode

Total characters608
Distinct characters321
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row대표자(중문)
2nd row刘玉鹏
3rd row赵光熙 刘东秀
4th row金正洙
5th row金美淑
ValueCountFrequency (%)
2
 
1.0%
李素賢 1
 
0.5%
柳秀玉 1
 
0.5%
姜完求 1
 
0.5%
崔盛熙 1
 
0.5%
夫東錫 1
 
0.5%
姜俊求 1
 
0.5%
黃南海 1
 
0.5%
李進龍 1
 
0.5%
李振守 1
 
0.5%
Other values (193) 193
94.6%
2023-12-12T10:10:25.687787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
 
5.4%
26
 
4.3%
13
 
2.1%
12
 
2.0%
11
 
1.8%
10
 
1.6%
8
 
1.3%
7
 
1.2%
6
 
1.0%
6
 
1.0%
Other values (311) 476
78.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 563
92.6%
Space Separator 26
 
4.3%
Control 7
 
1.2%
Uppercase Letter 6
 
1.0%
Other Punctuation 2
 
0.3%
Close Punctuation 2
 
0.3%
Open Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
5.9%
13
 
2.3%
12
 
2.1%
11
 
2.0%
10
 
1.8%
8
 
1.4%
6
 
1.1%
6
 
1.1%
5
 
0.9%
5
 
0.9%
Other values (300) 454
80.6%
Uppercase Letter
ValueCountFrequency (%)
Z 1
16.7%
H 1
16.7%
A 1
16.7%
O 1
16.7%
X 1
16.7%
I 1
16.7%
Space Separator
ValueCountFrequency (%)
26
100.0%
Control
ValueCountFrequency (%)
7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 558
91.8%
Common 39
 
6.4%
Latin 6
 
1.0%
Hangul 5
 
0.8%

Most frequent character per script

Han
ValueCountFrequency (%)
33
 
5.9%
13
 
2.3%
12
 
2.2%
11
 
2.0%
10
 
1.8%
8
 
1.4%
6
 
1.1%
6
 
1.1%
5
 
0.9%
5
 
0.9%
Other values (295) 449
80.5%
Latin
ValueCountFrequency (%)
Z 1
16.7%
H 1
16.7%
A 1
16.7%
O 1
16.7%
X 1
16.7%
I 1
16.7%
Common
ValueCountFrequency (%)
26
66.7%
7
 
17.9%
, 2
 
5.1%
) 2
 
5.1%
( 2
 
5.1%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
CJK 545
89.6%
ASCII 45
 
7.4%
CJK Compat Ideographs 13
 
2.1%
Hangul 5
 
0.8%

Most frequent character per block

CJK
ValueCountFrequency (%)
33
 
6.1%
13
 
2.4%
12
 
2.2%
10
 
1.8%
8
 
1.5%
6
 
1.1%
6
 
1.1%
5
 
0.9%
5
 
0.9%
5
 
0.9%
Other values (292) 442
81.1%
ASCII
ValueCountFrequency (%)
26
57.8%
7
 
15.6%
, 2
 
4.4%
) 2
 
4.4%
( 2
 
4.4%
Z 1
 
2.2%
H 1
 
2.2%
A 1
 
2.2%
O 1
 
2.2%
X 1
 
2.2%
CJK Compat Ideographs
ValueCountFrequency (%)
11
84.6%
1
 
7.7%
1
 
7.7%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Unnamed: 5
Text

MISSING 

Distinct183
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T10:10:26.074243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.508197
Min length2

Characters and Unicode

Total characters2106
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row전화
2nd row02-720-2861
3rd row02-2078-6658
4th row02-726-5546
5th row02-732-8888
ValueCountFrequency (%)
02-701-8858 1
 
0.5%
02-313-8842 1
 
0.5%
033-262-0686 1
 
0.5%
02-332-5959 1
 
0.5%
053-424-1400 1
 
0.5%
02-3144-4689 1
 
0.5%
02-725-3400 1
 
0.5%
070-7709-6124 1
 
0.5%
064-702-8801 1
 
0.5%
051-465-3333 1
 
0.5%
Other values (173) 173
94.5%
2023-12-12T10:10:26.601764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 364
17.3%
0 318
15.1%
2 293
13.9%
3 196
9.3%
8 185
8.8%
7 166
7.9%
6 146
6.9%
1 133
 
6.3%
5 113
 
5.4%
4 101
 
4.8%
Other values (3) 91
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1740
82.6%
Dash Punctuation 364
 
17.3%
Other Letter 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 318
18.3%
2 293
16.8%
3 196
11.3%
8 185
10.6%
7 166
9.5%
6 146
8.4%
1 133
7.6%
5 113
 
6.5%
4 101
 
5.8%
9 89
 
5.1%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 364
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2104
99.9%
Hangul 2
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 364
17.3%
0 318
15.1%
2 293
13.9%
3 196
9.3%
8 185
8.8%
7 166
7.9%
6 146
6.9%
1 133
 
6.3%
5 113
 
5.4%
4 101
 
4.8%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2104
99.9%
Hangul 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 364
17.3%
0 318
15.1%
2 293
13.9%
3 196
9.3%
8 185
8.8%
7 166
7.9%
6 146
6.9%
1 133
 
6.3%
5 113
 
5.4%
4 101
 
4.8%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Unnamed: 6
Text

MISSING 

Distinct180
Distinct (%)98.4%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T10:10:26.949692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.52459
Min length1

Characters and Unicode

Total characters2109
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique177 ?
Unique (%)96.7%

Sample

1st row팩스
2nd row02-720-2865
3rd row02-6442-3920
4th row02-773-1623
5th row02-2630-2563
ValueCountFrequency (%)
02-338-8086 2
 
1.1%
02-718-1689 2
 
1.1%
02-6323-8866 2
 
1.1%
031-245-6350 1
 
0.5%
02-3143-6688 1
 
0.5%
053-427-3934 1
 
0.5%
032-724-9019 1
 
0.5%
055-232-7688 1
 
0.5%
02-322-5893 1
 
0.5%
02-326-1818 1
 
0.5%
Other values (170) 170
92.9%
2023-12-12T10:10:27.471528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 363
17.2%
2 296
14.0%
0 289
13.7%
3 187
8.9%
8 180
8.5%
7 159
7.5%
6 149
7.1%
5 134
 
6.4%
1 132
 
6.3%
4 126
 
6.0%
Other values (3) 94
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1744
82.7%
Dash Punctuation 363
 
17.2%
Other Letter 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 296
17.0%
0 289
16.6%
3 187
10.7%
8 180
10.3%
7 159
9.1%
6 149
8.5%
5 134
7.7%
1 132
7.6%
4 126
7.2%
9 92
 
5.3%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 363
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2107
99.9%
Hangul 2
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 363
17.2%
2 296
14.0%
0 289
13.7%
3 187
8.9%
8 180
8.5%
7 159
7.5%
6 149
7.1%
5 134
 
6.4%
1 132
 
6.3%
4 126
 
6.0%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2107
99.9%
Hangul 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 363
17.2%
2 296
14.0%
0 289
13.7%
3 187
8.9%
8 180
8.5%
7 159
7.5%
6 149
7.1%
5 134
 
6.4%
1 132
 
6.3%
4 126
 
6.0%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Unnamed: 7
Text

MISSING 

Distinct183
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T10:10:27.872233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length82
Median length52
Mean length36.874317
Min length2

Characters and Unicode

Total characters6748
Distinct characters280
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row주소
2nd row03981 서울특별시 마포구 성미산로 189, 2층
3rd row04543 서울특별시 중구 을지로11길 15 동화빌딩 501호(중국사업부)
4th row04532 서울특별시 중구 소공로 88, 2,5층 (소공동, 한진빌딩)
5th row07213 서울특별시 영등포구 양평로 67 한강포스빌 420호
ValueCountFrequency (%)
서울특별시 127
 
9.9%
마포구 34
 
2.6%
중구 28
 
2.2%
영등포구 22
 
1.7%
2층 19
 
1.5%
종로구 16
 
1.2%
경기도 14
 
1.1%
서대문구 10
 
0.8%
제주시 7
 
0.5%
3층 7
 
0.5%
Other values (761) 1004
78.0%
2023-12-12T10:10:28.583804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1114
 
16.5%
0 347
 
5.1%
1 333
 
4.9%
2 250
 
3.7%
3 229
 
3.4%
206
 
3.1%
4 196
 
2.9%
196
 
2.9%
178
 
2.6%
176
 
2.6%
Other values (270) 3523
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3270
48.5%
Decimal Number 2031
30.1%
Space Separator 1114
 
16.5%
Other Punctuation 166
 
2.5%
Close Punctuation 57
 
0.8%
Open Punctuation 57
 
0.8%
Dash Punctuation 22
 
0.3%
Uppercase Letter 14
 
0.2%
Math Symbol 10
 
0.1%
Control 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
206
 
6.3%
196
 
6.0%
178
 
5.4%
176
 
5.4%
139
 
4.3%
134
 
4.1%
134
 
4.1%
123
 
3.8%
88
 
2.7%
75
 
2.3%
Other values (240) 1821
55.7%
Decimal Number
ValueCountFrequency (%)
0 347
17.1%
1 333
16.4%
2 250
12.3%
3 229
11.3%
4 196
9.7%
5 163
8.0%
6 134
 
6.6%
8 132
 
6.5%
7 131
 
6.5%
9 116
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
A 3
21.4%
S 2
14.3%
B 2
14.3%
I 2
14.3%
D 1
 
7.1%
L 1
 
7.1%
T 1
 
7.1%
G 1
 
7.1%
R 1
 
7.1%
Math Symbol
ValueCountFrequency (%)
> 4
40.0%
< 4
40.0%
~ 2
20.0%
Lowercase Letter
ValueCountFrequency (%)
k 1
50.0%
s 1
50.0%
Space Separator
ValueCountFrequency (%)
1114
100.0%
Other Punctuation
ValueCountFrequency (%)
, 166
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Control
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3462
51.3%
Hangul 3270
48.5%
Latin 16
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
206
 
6.3%
196
 
6.0%
178
 
5.4%
176
 
5.4%
139
 
4.3%
134
 
4.1%
134
 
4.1%
123
 
3.8%
88
 
2.7%
75
 
2.3%
Other values (240) 1821
55.7%
Common
ValueCountFrequency (%)
1114
32.2%
0 347
 
10.0%
1 333
 
9.6%
2 250
 
7.2%
3 229
 
6.6%
4 196
 
5.7%
, 166
 
4.8%
5 163
 
4.7%
6 134
 
3.9%
8 132
 
3.8%
Other values (9) 398
 
11.5%
Latin
ValueCountFrequency (%)
A 3
18.8%
S 2
12.5%
B 2
12.5%
I 2
12.5%
D 1
 
6.2%
L 1
 
6.2%
T 1
 
6.2%
G 1
 
6.2%
R 1
 
6.2%
k 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3478
51.5%
Hangul 3270
48.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1114
32.0%
0 347
 
10.0%
1 333
 
9.6%
2 250
 
7.2%
3 229
 
6.6%
4 196
 
5.6%
, 166
 
4.8%
5 163
 
4.7%
6 134
 
3.9%
8 132
 
3.8%
Other values (20) 414
 
11.9%
Hangul
ValueCountFrequency (%)
206
 
6.3%
196
 
6.0%
178
 
5.4%
176
 
5.4%
139
 
4.3%
134
 
4.1%
134
 
4.1%
123
 
3.8%
88
 
2.7%
75
 
2.3%
Other values (240) 1821
55.7%

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)1.1%
Memory size1.6 KiB

Unnamed: 9
Categorical

Distinct27
Distinct (%)14.6%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2019.8.27.
27 
2016.11.4.
27 
2015.8.24.
16 
2018.09.27.
14 
2014.12.12.
11 
Other values (22)
90 

Length

Max length11
Median length10
Mean length10.032432
Min length3

Unique

Unique7 ?
Unique (%)3.8%

Sample

1st row<NA>
2nd row<NA>
3rd row지정일
4th row2000.06.27
5th row2000.06.27

Common Values

ValueCountFrequency (%)
2019.8.27. 27
14.6%
2016.11.4. 27
14.6%
2015.8.24. 16
 
8.6%
2018.09.27. 14
 
7.6%
2014.12.12. 11
 
5.9%
2010.08.03 10
 
5.4%
2000.06.27 10
 
5.4%
2021.7.19. 9
 
4.9%
2014.02.26 9
 
4.9%
2012.03.09 8
 
4.3%
Other values (17) 44
23.8%

Length

2023-12-12T10:10:29.150026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2019.8.27 27
14.6%
2016.11.4 27
14.6%
2015.8.24 16
 
8.6%
2018.09.27 14
 
7.6%
2014.12.12 12
 
6.5%
2010.08.03 10
 
5.4%
2000.06.27 10
 
5.4%
2021.7.19 9
 
4.9%
2014.02.26 9
 
4.9%
2012.03.09 8
 
4.3%
Other values (16) 43
23.2%

Missing values

2023-12-12T10:10:21.348182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:10:21.547778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T10:10:21.732158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

중국 단체관광객 유치 전담여행사Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
0총 182개사 (2021. 7월)<NA><NA><NA><NA><NA><NA><NA>NaN<NA>
1NaN<NA><NA><NA><NA><NA><NA><NA>NaN<NA>
2번호업체명업체명(중문)대표자대표자(중문)전화팩스주소등록번호지정일
31(주)금룡여행사(株)金龙旅行社유옥붕刘玉鹏02-720-286102-720-286503981 서울특별시 마포구 성미산로 189, 2층20000627012000.06.27
42롯데관광(주)乐天观光(株)조광희 유동수赵光熙 刘东秀02-2078-665802-6442-392004543 서울특별시 중구 을지로11길 15 동화빌딩 501호(중국사업부)20000627022000.06.27
53(주)한진관광(株)韩进观光김정수金正洙02-726-554602-773-162304532 서울특별시 중구 소공로 88, 2,5층 (소공동, 한진빌딩)20000627042000.06.27
64(주)계명세계여행(株)启明世界旅行김미숙金美淑02-732-888802-2630-256307213 서울특별시 영등포구 양평로 67 한강포스빌 420호20000627052000.06.27
75(주)내일관광여행사(株)来日观光旅行社이문균李文鈞02-773-388802-776-488803710 서울특별시 서대문구 모래내로 207, 1동 401호20000627092000.06.27
86(주)화인관광(株)华人观光손서장孙书壮02-322-919102-322-844804051 서울특별시 마포구 홍익로 6길 67, 연희빌딩 401호20000627102000.06.27
97(주)한국중국여행사(株)韩国中国旅行社조학령 조 반曹学玲 曹 班02-752-339902-757-373704522 서울특별시 중구 다동길 46, 다동빌딩 402호20000627132000.06.27
중국 단체관광객 유치 전담여행사Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
175173㈜화동여행사(株)華東旅行社김영백金永栢02-2269-999802-2273-168803017 서울특별시 종로구 자하문로 280, 청하빌딩 511호 (부암동)20190827282019.8.27.
176174㈜무브牟福최민석崔珉碩02-1877-20250504-342-087104147 서울특별시 마포구 백범로31길 21 서울창업허브 517호20210719012021.7.19.
177175썬투어즈㈜焯辰旅遊로유룬羅宇麟02-6356-888002-6356-888110842 경기도 파주시 문산읍 휴암로 538, 714호20210719022021.7.19.
178176주식회사 엔에이디鄭氾中韓國濟旅行社정 범鄭 氾032-764-7700032-765-981022303 인천광역시 중구 월미로 266, 2층20210719032021.7.19.
179177㈜재미난투어(株)有趣旅行社이 훈李 勋070-8835-1835051-466-183548821 부산광역시 동구 중앙대로180번길 13, 901호20210719042021.7.19.
180178주식회사 지제이투어GJTOUR강대위姜大威02-332-168502-3143-668803716 서울특별시 서대문구 연희로5길 54-3, 501호20210719052021.7.19.
181179㈜케이앤디알(株)KNDR유새하柳思荷031-233-6350031-245-635016252 경기도 수원시 팔달구 화서문로 64, 금강빌딩 203호20210719062021.7.19.
182180주식회사 킴스엠앤티Kim's M&T김춘추金春秋02-570-359002-575-982806296 서울특별시 강남구 남부순환로 2728, 유일빌딩 5층20210719072021.7.19.
183181㈜포시즌여행사(株)四季旅行社한만형韓萬亨02-2038-3369-10016 경기도 김포시 통진읍 담터로55번길 32, 2층20210719082021.7.19.
184182㈜한국교육여행사韩国教育旅行社구자만具滋蔓02-1644-823002-6442-174503992 서울특별시 마포구 동교로25길 11, 2층20210719092021.7.19.

Duplicate rows

Most frequently occurring

Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 9# duplicates
0<NA><NA><NA><NA><NA><NA><NA><NA>2