Overview

Dataset statistics

Number of variables10
Number of observations185
Missing cells17
Missing cells (%)0.9%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory14.6 KiB
Average record size in memory80.7 B

Variable types

Unsupported2
Text7
Categorical1

Dataset

Description문화체육관광부가 지정한 중국인 단체관광객 유치 업무를 할 수 있는 전담여행사 목록으로 업체명, 대표자, 연락처, 지정일자 등의 정보를 제공합니다.
Author문화체육관광부
URLhttps://www.data.go.kr/data/15096591/fileData.do

Alerts

Dataset has 1 (0.5%) duplicate rowsDuplicates
Unnamed: 1 has 2 (1.1%) missing valuesMissing
Unnamed: 2 has 2 (1.1%) missing valuesMissing
Unnamed: 3 has 2 (1.1%) missing valuesMissing
Unnamed: 4 has 2 (1.1%) missing valuesMissing
Unnamed: 5 has 2 (1.1%) missing valuesMissing
Unnamed: 6 has 2 (1.1%) missing valuesMissing
Unnamed: 7 has 2 (1.1%) missing valuesMissing
Unnamed: 8 has 2 (1.1%) missing valuesMissing
중국 단체관광객 유치 전담여행사 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 23:09:07.924428
Analysis finished2023-12-11 23:09:08.961699
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

중국 단체관광객 유치 전담여행사
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)0.5%
Memory size1.6 KiB

Unnamed: 1
Text

MISSING 

Distinct183
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T08:09:09.101790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length8.6557377
Min length3

Characters and Unicode

Total characters1584
Distinct characters204
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row업체명
2nd row(주)금룡여행사
3rd row롯데관광(주)
4th row(주)한진관광
5th row(주)계명세계여행
ValueCountFrequency (%)
주식회사 16
 
8.0%
주)태화관광 1
 
0.5%
위앤라이 1
 
0.5%
주)아리바바 1
 
0.5%
여행스케치(주 1
 
0.5%
유니언투어 1
 
0.5%
주)일동월드와이드 1
 
0.5%
주)잇츠코리아 1
 
0.5%
주)제이트립 1
 
0.5%
주)진성관광여행사 1
 
0.5%
Other values (174) 174
87.4%
2023-12-12T08:09:09.408574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
171
 
10.8%
( 149
 
9.4%
) 149
 
9.4%
89
 
5.6%
79
 
5.0%
79
 
5.0%
43
 
2.7%
38
 
2.4%
31
 
2.0%
30
 
1.9%
Other values (194) 726
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1269
80.1%
Open Punctuation 149
 
9.4%
Close Punctuation 149
 
9.4%
Space Separator 17
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
171
 
13.5%
89
 
7.0%
79
 
6.2%
79
 
6.2%
43
 
3.4%
38
 
3.0%
31
 
2.4%
30
 
2.4%
27
 
2.1%
23
 
1.8%
Other values (191) 659
51.9%
Open Punctuation
ValueCountFrequency (%)
( 149
100.0%
Close Punctuation
ValueCountFrequency (%)
) 149
100.0%
Space Separator
ValueCountFrequency (%)
17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1269
80.1%
Common 315
 
19.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
171
 
13.5%
89
 
7.0%
79
 
6.2%
79
 
6.2%
43
 
3.4%
38
 
3.0%
31
 
2.4%
30
 
2.4%
27
 
2.1%
23
 
1.8%
Other values (191) 659
51.9%
Common
ValueCountFrequency (%)
( 149
47.3%
) 149
47.3%
17
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1269
80.1%
ASCII 315
 
19.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
171
 
13.5%
89
 
7.0%
79
 
6.2%
79
 
6.2%
43
 
3.4%
38
 
3.0%
31
 
2.4%
30
 
2.4%
27
 
2.1%
23
 
1.8%
Other values (191) 659
51.9%
ASCII
ValueCountFrequency (%)
( 149
47.3%
) 149
47.3%
17
 
5.4%

Unnamed: 2
Text

MISSING 

Distinct183
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T08:09:09.694530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length21
Mean length9.147541
Min length2

Characters and Unicode

Total characters1674
Distinct characters277
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row업체명(중문)
2nd row(株)金龙旅行社
3rd row乐天观光(株)
4th row(株)韩进观光
5th row(株)启明世界旅行
ValueCountFrequency (%)
tour 5
 
2.2%
ltd 4
 
1.7%
co 4
 
1.7%
co.,ltd 3
 
1.3%
株式會社 3
 
1.3%
tours 2
 
0.9%
株式会社 2
 
0.9%
korea 2
 
0.9%
株)眞星觀光旅行社 1
 
0.4%
濟州趣遊 1
 
0.4%
Other values (204) 204
88.3%
2023-12-12T08:09:10.112282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
130
 
7.8%
( 121
 
7.2%
) 121
 
7.2%
96
 
5.7%
91
 
5.4%
61
 
3.6%
50
 
3.0%
49
 
2.9%
48
 
2.9%
T 37
 
2.2%
Other values (267) 870
52.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1035
61.8%
Uppercase Letter 300
 
17.9%
Open Punctuation 121
 
7.2%
Close Punctuation 121
 
7.2%
Space Separator 50
 
3.0%
Lowercase Letter 28
 
1.7%
Other Punctuation 19
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
130
 
12.6%
96
 
9.3%
91
 
8.8%
61
 
5.9%
49
 
4.7%
48
 
4.6%
33
 
3.2%
24
 
2.3%
17
 
1.6%
16
 
1.5%
Other values (222) 470
45.4%
Uppercase Letter
ValueCountFrequency (%)
T 37
12.3%
O 30
 
10.0%
R 22
 
7.3%
L 20
 
6.7%
E 20
 
6.7%
C 19
 
6.3%
U 19
 
6.3%
N 18
 
6.0%
A 18
 
6.0%
I 15
 
5.0%
Other values (15) 82
27.3%
Lowercase Letter
ValueCountFrequency (%)
a 4
14.3%
r 3
10.7%
e 3
10.7%
s 3
10.7%
o 3
10.7%
m 3
10.7%
u 2
7.1%
i 2
7.1%
d 2
7.1%
t 1
 
3.6%
Other values (2) 2
7.1%
Other Punctuation
ValueCountFrequency (%)
. 7
36.8%
, 6
31.6%
& 4
21.1%
' 1
 
5.3%
1
 
5.3%
Open Punctuation
ValueCountFrequency (%)
( 121
100.0%
Close Punctuation
ValueCountFrequency (%)
) 121
100.0%
Space Separator
ValueCountFrequency (%)
50
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 1030
61.5%
Latin 328
 
19.6%
Common 311
 
18.6%
Hangul 5
 
0.3%

Most frequent character per script

Han
ValueCountFrequency (%)
130
 
12.6%
96
 
9.3%
91
 
8.8%
61
 
5.9%
49
 
4.8%
48
 
4.7%
33
 
3.2%
24
 
2.3%
17
 
1.7%
16
 
1.6%
Other values (217) 465
45.1%
Latin
ValueCountFrequency (%)
T 37
 
11.3%
O 30
 
9.1%
R 22
 
6.7%
L 20
 
6.1%
E 20
 
6.1%
C 19
 
5.8%
U 19
 
5.8%
N 18
 
5.5%
A 18
 
5.5%
I 15
 
4.6%
Other values (27) 110
33.5%
Common
ValueCountFrequency (%)
( 121
38.9%
) 121
38.9%
50
16.1%
. 7
 
2.3%
, 6
 
1.9%
& 4
 
1.3%
' 1
 
0.3%
1
 
0.3%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
CJK 965
57.6%
ASCII 638
38.1%
CJK Compat Ideographs 65
 
3.9%
Hangul 5
 
0.3%
Punctuation 1
 
0.1%

Most frequent character per block

CJK
ValueCountFrequency (%)
130
 
13.5%
96
 
9.9%
91
 
9.4%
49
 
5.1%
48
 
5.0%
33
 
3.4%
24
 
2.5%
17
 
1.8%
16
 
1.7%
15
 
1.6%
Other values (212) 446
46.2%
ASCII
ValueCountFrequency (%)
( 121
19.0%
) 121
19.0%
50
 
7.8%
T 37
 
5.8%
O 30
 
4.7%
R 22
 
3.4%
L 20
 
3.1%
E 20
 
3.1%
C 19
 
3.0%
U 19
 
3.0%
Other values (34) 179
28.1%
CJK Compat Ideographs
ValueCountFrequency (%)
61
93.8%
1
 
1.5%
1
 
1.5%
1
 
1.5%
1
 
1.5%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

Unnamed: 3
Text

MISSING 

Distinct183
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T08:09:10.505691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length3
Mean length3.3497268
Min length3

Characters and Unicode

Total characters613
Distinct characters170
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row대표자
2nd row유옥붕
3rd row조광희
4th row김정수
5th row김미숙
ValueCountFrequency (%)
2
 
1.0%
임문수 1
 
0.5%
이진용 1
 
0.5%
조창희 1
 
0.5%
민서진 1
 
0.5%
김동영 1
 
0.5%
유수옥 1
 
0.5%
강완구 1
 
0.5%
최성희 1
 
0.5%
부동석 1
 
0.5%
Other values (194) 194
94.6%
2023-12-12T08:09:10.998459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
 
5.4%
32
 
5.2%
26
 
4.2%
14
 
2.3%
14
 
2.3%
13
 
2.1%
12
 
2.0%
10
 
1.6%
10
 
1.6%
10
 
1.6%
Other values (160) 439
71.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 569
92.8%
Space Separator 33
 
5.4%
Control 10
 
1.6%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
5.6%
26
 
4.6%
14
 
2.5%
14
 
2.5%
13
 
2.3%
12
 
2.1%
10
 
1.8%
10
 
1.8%
9
 
1.6%
8
 
1.4%
Other values (157) 421
74.0%
Space Separator
ValueCountFrequency (%)
33
100.0%
Control
ValueCountFrequency (%)
10
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 569
92.8%
Common 44
 
7.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
5.6%
26
 
4.6%
14
 
2.5%
14
 
2.5%
13
 
2.3%
12
 
2.1%
10
 
1.8%
10
 
1.8%
9
 
1.6%
8
 
1.4%
Other values (157) 421
74.0%
Common
ValueCountFrequency (%)
33
75.0%
10
 
22.7%
, 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 569
92.8%
ASCII 44
 
7.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
33
75.0%
10
 
22.7%
, 1
 
2.3%
Hangul
ValueCountFrequency (%)
32
 
5.6%
26
 
4.6%
14
 
2.5%
14
 
2.5%
13
 
2.3%
12
 
2.1%
10
 
1.8%
10
 
1.8%
9
 
1.6%
8
 
1.4%
Other values (157) 421
74.0%

Unnamed: 4
Text

MISSING 

Distinct183
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T08:09:11.338540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length3
Mean length3.3442623
Min length2

Characters and Unicode

Total characters612
Distinct characters330
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row대표자(중문)
2nd row刘玉鹏
3rd row赵光熙
4th row金正洙
5th row金美淑
ValueCountFrequency (%)
2
 
1.0%
林文秀 1
 
0.5%
李振守 1
 
0.5%
闵叙振 1
 
0.5%
金东永 1
 
0.5%
柳秀玉 1
 
0.5%
姜完求 1
 
0.5%
崔盛熙 1
 
0.5%
夫東錫 1
 
0.5%
姜俊求 1
 
0.5%
Other values (193) 193
94.6%
2023-12-12T08:09:11.829435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
 
5.4%
21
 
3.4%
16
 
2.6%
13
 
2.1%
10
 
1.6%
10
 
1.6%
8
 
1.3%
7
 
1.1%
6
 
1.0%
6
 
1.0%
Other values (320) 482
78.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 570
93.1%
Space Separator 21
 
3.4%
Control 10
 
1.6%
Uppercase Letter 6
 
1.0%
Open Punctuation 2
 
0.3%
Close Punctuation 2
 
0.3%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
5.8%
16
 
2.8%
13
 
2.3%
10
 
1.8%
8
 
1.4%
7
 
1.2%
6
 
1.1%
6
 
1.1%
6
 
1.1%
5
 
0.9%
Other values (309) 460
80.7%
Uppercase Letter
ValueCountFrequency (%)
O 1
16.7%
I 1
16.7%
A 1
16.7%
H 1
16.7%
Z 1
16.7%
X 1
16.7%
Space Separator
ValueCountFrequency (%)
21
100.0%
Control
ValueCountFrequency (%)
10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 565
92.3%
Common 36
 
5.9%
Latin 6
 
1.0%
Hangul 5
 
0.8%

Most frequent character per script

Han
ValueCountFrequency (%)
33
 
5.8%
16
 
2.8%
13
 
2.3%
10
 
1.8%
8
 
1.4%
7
 
1.2%
6
 
1.1%
6
 
1.1%
6
 
1.1%
5
 
0.9%
Other values (304) 455
80.5%
Latin
ValueCountFrequency (%)
O 1
16.7%
I 1
16.7%
A 1
16.7%
H 1
16.7%
Z 1
16.7%
X 1
16.7%
Common
ValueCountFrequency (%)
21
58.3%
10
27.8%
( 2
 
5.6%
) 2
 
5.6%
, 1
 
2.8%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
CJK 553
90.4%
ASCII 42
 
6.9%
CJK Compat Ideographs 12
 
2.0%
Hangul 5
 
0.8%

Most frequent character per block

CJK
ValueCountFrequency (%)
33
 
6.0%
16
 
2.9%
13
 
2.4%
8
 
1.4%
7
 
1.3%
6
 
1.1%
6
 
1.1%
6
 
1.1%
5
 
0.9%
5
 
0.9%
Other values (301) 448
81.0%
ASCII
ValueCountFrequency (%)
21
50.0%
10
23.8%
( 2
 
4.8%
) 2
 
4.8%
O 1
 
2.4%
, 1
 
2.4%
I 1
 
2.4%
A 1
 
2.4%
H 1
 
2.4%
Z 1
 
2.4%
CJK Compat Ideographs
ValueCountFrequency (%)
10
83.3%
1
 
8.3%
1
 
8.3%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Unnamed: 5
Text

MISSING 

Distinct183
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T08:09:12.123652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.508197
Min length2

Characters and Unicode

Total characters2106
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row전화
2nd row02-720-2861
3rd row02-2078-6658
4th row02-726-5546
5th row02-732-8888
ValueCountFrequency (%)
02-701-8858 1
 
0.5%
02-313-8842 1
 
0.5%
033-262-0686 1
 
0.5%
02-332-5959 1
 
0.5%
053-424-1400 1
 
0.5%
02-3144-4689 1
 
0.5%
02-725-3400 1
 
0.5%
070-7709-6124 1
 
0.5%
064-702-8801 1
 
0.5%
051-465-3333 1
 
0.5%
Other values (173) 173
94.5%
2023-12-12T08:09:12.565765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 364
17.3%
0 320
15.2%
2 293
13.9%
3 198
9.4%
8 185
8.8%
7 162
7.7%
6 146
6.9%
1 133
 
6.3%
5 111
 
5.3%
4 100
 
4.7%
Other values (3) 94
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1740
82.6%
Dash Punctuation 364
 
17.3%
Other Letter 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 320
18.4%
2 293
16.8%
3 198
11.4%
8 185
10.6%
7 162
9.3%
6 146
8.4%
1 133
7.6%
5 111
 
6.4%
4 100
 
5.7%
9 92
 
5.3%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 364
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2104
99.9%
Hangul 2
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 364
17.3%
0 320
15.2%
2 293
13.9%
3 198
9.4%
8 185
8.8%
7 162
7.7%
6 146
6.9%
1 133
 
6.3%
5 111
 
5.3%
4 100
 
4.8%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2104
99.9%
Hangul 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 364
17.3%
0 320
15.2%
2 293
13.9%
3 198
9.4%
8 185
8.8%
7 162
7.7%
6 146
6.9%
1 133
 
6.3%
5 111
 
5.3%
4 100
 
4.8%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Unnamed: 6
Text

MISSING 

Distinct180
Distinct (%)98.4%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T08:09:12.816931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.595628
Min length2

Characters and Unicode

Total characters2122
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique177 ?
Unique (%)96.7%

Sample

1st row팩스
2nd row02-720-2865
3rd row02-6442-3920
4th row02-773-1623
5th row02-2630-2563
ValueCountFrequency (%)
02-338-8086 2
 
1.1%
02-718-1689 2
 
1.1%
02-6323-8866 2
 
1.1%
031-245-6350 1
 
0.5%
02-3143-6688 1
 
0.5%
053-427-3934 1
 
0.5%
032-724-9019 1
 
0.5%
055-232-7688 1
 
0.5%
02-322-5893 1
 
0.5%
02-326-1818 1
 
0.5%
Other values (170) 170
92.9%
2023-12-12T08:09:13.174447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 364
17.2%
2 295
13.9%
0 291
13.7%
3 192
9.0%
8 181
8.5%
7 157
7.4%
6 156
7.4%
1 132
 
6.2%
5 132
 
6.2%
4 125
 
5.9%
Other values (3) 97
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1756
82.8%
Dash Punctuation 364
 
17.2%
Other Letter 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 295
16.8%
0 291
16.6%
3 192
10.9%
8 181
10.3%
7 157
8.9%
6 156
8.9%
1 132
7.5%
5 132
7.5%
4 125
7.1%
9 95
 
5.4%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 364
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2120
99.9%
Hangul 2
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 364
17.2%
2 295
13.9%
0 291
13.7%
3 192
9.1%
8 181
8.5%
7 157
7.4%
6 156
7.4%
1 132
 
6.2%
5 132
 
6.2%
4 125
 
5.9%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2120
99.9%
Hangul 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 364
17.2%
2 295
13.9%
0 291
13.7%
3 192
9.1%
8 181
8.5%
7 157
7.4%
6 156
7.4%
1 132
 
6.2%
5 132
 
6.2%
4 125
 
5.9%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Unnamed: 7
Text

MISSING 

Distinct183
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T08:09:13.477592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length82
Median length52
Mean length37.10929
Min length2

Characters and Unicode

Total characters6791
Distinct characters298
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row주소
2nd row03981 서울특별시 마포구 성미산로 189, 2층
3rd row04543 서울특별시 중구 을지로11길 15 동화빌딩 501호(중국사업부)
4th row04532 서울특별시 중구 소공로 88, 2,5층 (소공동, 한진빌딩)
5th row07213 서울특별시 영등포구 양평로 67 한강포스빌 420호
ValueCountFrequency (%)
서울특별시 131
 
10.2%
마포구 32
 
2.5%
중구 30
 
2.3%
2층 22
 
1.7%
영등포구 21
 
1.6%
종로구 15
 
1.2%
경기도 14
 
1.1%
서대문구 9
 
0.7%
제주특별자치도 7
 
0.5%
제주시 7
 
0.5%
Other values (763) 994
77.5%
2023-12-12T08:09:13.924003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1104
 
16.3%
0 357
 
5.3%
1 338
 
5.0%
2 255
 
3.8%
3 228
 
3.4%
202
 
3.0%
193
 
2.8%
4 179
 
2.6%
176
 
2.6%
174
 
2.6%
Other values (288) 3585
52.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3286
48.4%
Decimal Number 2038
30.0%
Space Separator 1104
 
16.3%
Other Punctuation 171
 
2.5%
Close Punctuation 63
 
0.9%
Open Punctuation 63
 
0.9%
Dash Punctuation 24
 
0.4%
Uppercase Letter 19
 
0.3%
Math Symbol 10
 
0.1%
Lowercase Letter 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
202
 
6.1%
193
 
5.9%
176
 
5.4%
174
 
5.3%
138
 
4.2%
138
 
4.2%
138
 
4.2%
124
 
3.8%
90
 
2.7%
82
 
2.5%
Other values (249) 1831
55.7%
Uppercase Letter
ValueCountFrequency (%)
S 4
21.1%
A 2
10.5%
G 2
10.5%
B 2
10.5%
D 1
 
5.3%
V 1
 
5.3%
K 1
 
5.3%
N 1
 
5.3%
T 1
 
5.3%
I 1
 
5.3%
Other values (3) 3
15.8%
Decimal Number
ValueCountFrequency (%)
0 357
17.5%
1 338
16.6%
2 255
12.5%
3 228
11.2%
4 179
8.8%
5 167
8.2%
6 133
 
6.5%
7 132
 
6.5%
8 129
 
6.3%
9 120
 
5.9%
Lowercase Letter
ValueCountFrequency (%)
e 2
25.0%
r 1
12.5%
t 1
12.5%
n 1
12.5%
c 1
12.5%
k 1
12.5%
s 1
12.5%
Math Symbol
ValueCountFrequency (%)
> 4
40.0%
< 4
40.0%
~ 2
20.0%
Space Separator
ValueCountFrequency (%)
1104
100.0%
Other Punctuation
ValueCountFrequency (%)
, 171
100.0%
Close Punctuation
ValueCountFrequency (%)
) 63
100.0%
Open Punctuation
ValueCountFrequency (%)
( 63
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Control
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3478
51.2%
Hangul 3286
48.4%
Latin 27
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
202
 
6.1%
193
 
5.9%
176
 
5.4%
174
 
5.3%
138
 
4.2%
138
 
4.2%
138
 
4.2%
124
 
3.8%
90
 
2.7%
82
 
2.5%
Other values (249) 1831
55.7%
Latin
ValueCountFrequency (%)
S 4
 
14.8%
A 2
 
7.4%
G 2
 
7.4%
e 2
 
7.4%
B 2
 
7.4%
D 1
 
3.7%
r 1
 
3.7%
t 1
 
3.7%
n 1
 
3.7%
c 1
 
3.7%
Other values (10) 10
37.0%
Common
ValueCountFrequency (%)
1104
31.7%
0 357
 
10.3%
1 338
 
9.7%
2 255
 
7.3%
3 228
 
6.6%
4 179
 
5.1%
, 171
 
4.9%
5 167
 
4.8%
6 133
 
3.8%
7 132
 
3.8%
Other values (9) 414
 
11.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3505
51.6%
Hangul 3286
48.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1104
31.5%
0 357
 
10.2%
1 338
 
9.6%
2 255
 
7.3%
3 228
 
6.5%
4 179
 
5.1%
, 171
 
4.9%
5 167
 
4.8%
6 133
 
3.8%
7 132
 
3.8%
Other values (29) 441
 
12.6%
Hangul
ValueCountFrequency (%)
202
 
6.1%
193
 
5.9%
176
 
5.4%
174
 
5.3%
138
 
4.2%
138
 
4.2%
138
 
4.2%
124
 
3.8%
90
 
2.7%
82
 
2.5%
Other values (249) 1831
55.7%

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)1.1%
Memory size1.6 KiB

Unnamed: 9
Categorical

Distinct27
Distinct (%)14.6%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2019.8.27.
27 
2016.11.4.
27 
2015.8.24.
16 
2018.09.27.
14 
2014.12.12.
11 
Other values (22)
90 

Length

Max length11
Median length10
Mean length10.032432
Min length3

Unique

Unique7 ?
Unique (%)3.8%

Sample

1st row<NA>
2nd row<NA>
3rd row지정일
4th row2000.06.27
5th row2000.06.27

Common Values

ValueCountFrequency (%)
2019.8.27. 27
14.6%
2016.11.4. 27
14.6%
2015.8.24. 16
 
8.6%
2018.09.27. 14
 
7.6%
2014.12.12. 11
 
5.9%
2010.08.03 10
 
5.4%
2000.06.27 10
 
5.4%
2021.7.19. 9
 
4.9%
2014.02.26 9
 
4.9%
2012.03.09 8
 
4.3%
Other values (17) 44
23.8%

Length

2023-12-12T08:09:14.056186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2019.8.27 27
14.6%
2016.11.4 27
14.6%
2015.8.24 16
 
8.6%
2018.09.27 14
 
7.6%
2014.12.12 12
 
6.5%
2010.08.03 10
 
5.4%
2000.06.27 10
 
5.4%
2021.7.19 9
 
4.9%
2014.02.26 9
 
4.9%
2012.03.09 8
 
4.3%
Other values (16) 43
23.2%

Missing values

2023-12-12T08:09:08.565841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:09:08.700776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T08:09:08.826223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

중국 단체관광객 유치 전담여행사Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
0총 182개사 (2021. 12월)<NA><NA><NA><NA><NA><NA><NA>NaN<NA>
1NaN<NA><NA><NA><NA><NA><NA><NA>NaN<NA>
2번호업체명업체명(중문)대표자대표자(중문)전화팩스주소등록번호지정일
31(주)금룡여행사(株)金龙旅行社유옥붕刘玉鹏02-720-286102-720-286503981 서울특별시 마포구 성미산로 189, 2층20000627012000.06.27
42롯데관광(주)乐天观光(株)조광희赵光熙02-2078-665802-6442-392004543 서울특별시 중구 을지로11길 15 동화빌딩 501호(중국사업부)20000627022000.06.27
53(주)한진관광(株)韩进观光김정수金正洙02-726-554602-773-162304532 서울특별시 중구 소공로 88, 2,5층 (소공동, 한진빌딩)20000627042000.06.27
64(주)계명세계여행(株)启明世界旅行김미숙金美淑02-732-888802-2630-256307213 서울특별시 영등포구 양평로 67 한강포스빌 420호20000627052000.06.27
75(주)내일관광여행사(株)来日观光旅行社이문균李文鈞02-773-388802-776-488803459 서울특별시 은평구 응암로 352, 302호(녹번동, 태선라이프빌딩)20000627092000.06.27
86(주)화인관광(株)华人观光손서장孙书壮02-322-919102-322-844804051 서울특별시 마포구 홍익로 6길 67, 연희빌딩 401호20000627102000.06.27
97(주)한국중국여행사(株)韩国中国旅行社조학령 조 반曹学玲 曹 班02-752-339902-757-373704522 서울특별시 중구 다동길 46, 다동빌딩 402호20000627132000.06.27
중국 단체관광객 유치 전담여행사Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
175173(주)화동여행사(株)華東旅行社김영백金永栢02-2269-999802-2273-168803017 서울특별시 종로구 자하문로 280, 청하빌딩 511호 (부암동)20190827282019.8.27.
176174(주)무브牟福최민석崔珉碩02-1877-20250504-342-087104147 서울특별시 마포구 백범로31길 21 서울창업허브 517호20210719012021.7.19.
177175썬투어즈(주)焯辰旅遊로유룬羅宇麟02-6356-888002-6356-888110842 경기도 파주시 문산읍 휴암로 538, 714호20210719022021.7.19.
178176주식회사 엔에이디鄭氾中韓國濟旅行社정 범鄭 氾032-764-9002032-765-981022303 인천광역시 중구 월미로 266, 2층20210719032021.7.19.
179177(주)재미난투어(株)有趣旅行社최부림崔溥林070-8835-1835051-466-183548821 부산광역시 동구 중앙대로180번길 13, 901호20210719042021.7.19.
180178주식회사 지제이투어GJTOUR강대위姜大威02-332-168502-3143-668803716 서울특별시 서대문구 연희로5길 54-3, 501호20210719052021.7.19.
181179(주)케이앤디알(株)KNDR유새하柳思荷031-233-6350031-245-635016252 경기도 수원시 팔달구 화서문로 64, 금강빌딩 203호20210719062021.7.19.
182180주식회사 킴스엠앤티Kim's M&T김춘추金春秋02-570-359002-575-982806296 서울특별시 강남구 남부순환로 2728, 유일빌딩 5층20210719072021.7.19.
183181(주)포시즌여행사(株)四季旅行社한만형韓萬亨02-2038-336902-6969-536910016 경기도 김포시 통진읍 담터로55번길 32, 2층20210719082021.7.19.
184182(주)한국교육여행사韩国教育旅行社구자만具滋蔓02-1644-823002-6442-174503992 서울특별시 마포구 동교로25길 11, 2층20210719092021.7.19.

Duplicate rows

Most frequently occurring

Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 9# duplicates
0<NA><NA><NA><NA><NA><NA><NA><NA>2