Overview

Dataset statistics

Number of variables6
Number of observations42
Missing cells7
Missing cells (%)2.8%
Duplicate rows1
Duplicate rows (%)2.4%
Total size in memory2.1 KiB
Average record size in memory51.1 B

Variable types

Categorical2
Text4

Dataset

Description여행업체정보 : 충청북도 충주시 여행업체정보 현황에 대한 데이터를 제공합니다(등록업종, 업체명, 대표자, 소재지, 전화번호, 데이터 기준일자)
URLhttps://www.data.go.kr/data/3037841/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (2.4%) duplicate rowsDuplicates
전화번호 has 7 (16.7%) missing valuesMissing

Reproduction

Analysis started2023-12-12 05:09:40.471683
Analysis finished2023-12-12 05:09:41.108569
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록업종
Categorical

Distinct3
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size468.0 B
국내외여행업
23 
종합여행업
14 
국내여행업

Length

Max length6
Median length6
Mean length5.547619
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 23
54.8%
종합여행업 14
33.3%
국내여행업 5
 
11.9%

Length

2023-12-12T14:09:41.199673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:09:41.329005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 23
54.8%
종합여행업 14
33.3%
국내여행업 5
 
11.9%
Distinct41
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size468.0 B
2023-12-12T14:09:41.564710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length11
Mean length8.9285714
Min length3

Characters and Unicode

Total characters375
Distinct characters120
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)95.2%

Sample

1st row㈜지인
2nd row(주)충주호크루즈
3rd row충주여행가게
4th row(주)충주호크루즈
5th row자작자작협동조합
ValueCountFrequency (%)
주식회사 8
 
14.5%
주)충주호크루즈 2
 
3.6%
㈜지인 1
 
1.8%
알타반 1
 
1.8%
트래블(artaban 1
 
1.8%
travel 1
 
1.8%
스타여행사 1
 
1.8%
미라클여행사 1
 
1.8%
지인 1
 
1.8%
선진항공여행사 1
 
1.8%
Other values (37) 37
67.3%
2023-12-12T14:09:42.081528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26
 
6.9%
24
 
6.4%
19
 
5.1%
( 18
 
4.8%
18
 
4.8%
) 18
 
4.8%
13
 
3.5%
10
 
2.7%
9
 
2.4%
9
 
2.4%
Other values (110) 211
56.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 295
78.7%
Uppercase Letter 19
 
5.1%
Open Punctuation 18
 
4.8%
Close Punctuation 18
 
4.8%
Space Separator 13
 
3.5%
Lowercase Letter 7
 
1.9%
Other Symbol 4
 
1.1%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
8.8%
24
 
8.1%
19
 
6.4%
18
 
6.1%
10
 
3.4%
9
 
3.1%
9
 
3.1%
8
 
2.7%
8
 
2.7%
8
 
2.7%
Other values (88) 156
52.9%
Uppercase Letter
ValueCountFrequency (%)
A 4
21.1%
T 3
15.8%
R 3
15.8%
L 1
 
5.3%
V 1
 
5.3%
N 1
 
5.3%
E 1
 
5.3%
B 1
 
5.3%
K 1
 
5.3%
O 1
 
5.3%
Other values (2) 2
10.5%
Lowercase Letter
ValueCountFrequency (%)
l 2
28.6%
o 2
28.6%
r 1
14.3%
a 1
14.3%
c 1
14.3%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 299
79.7%
Common 50
 
13.3%
Latin 26
 
6.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
8.7%
24
 
8.0%
19
 
6.4%
18
 
6.0%
10
 
3.3%
9
 
3.0%
9
 
3.0%
8
 
2.7%
8
 
2.7%
8
 
2.7%
Other values (89) 160
53.5%
Latin
ValueCountFrequency (%)
A 4
15.4%
T 3
11.5%
R 3
11.5%
l 2
 
7.7%
o 2
 
7.7%
L 1
 
3.8%
V 1
 
3.8%
N 1
 
3.8%
E 1
 
3.8%
r 1
 
3.8%
Other values (7) 7
26.9%
Common
ValueCountFrequency (%)
( 18
36.0%
) 18
36.0%
13
26.0%
- 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 295
78.7%
ASCII 76
 
20.3%
None 4
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
26
 
8.8%
24
 
8.1%
19
 
6.4%
18
 
6.1%
10
 
3.4%
9
 
3.1%
9
 
3.1%
8
 
2.7%
8
 
2.7%
8
 
2.7%
Other values (88) 156
52.9%
ASCII
ValueCountFrequency (%)
( 18
23.7%
) 18
23.7%
13
17.1%
A 4
 
5.3%
T 3
 
3.9%
R 3
 
3.9%
l 2
 
2.6%
o 2
 
2.6%
L 1
 
1.3%
V 1
 
1.3%
Other values (11) 11
14.5%
None
ValueCountFrequency (%)
4
100.0%
Distinct39
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size468.0 B
2023-12-12T14:09:42.346203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters126
Distinct characters62
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)85.7%

Sample

1st row이승진
2nd row김철석
3rd row홍근표
4th row김철석
5th row공영환
ValueCountFrequency (%)
이승진 2
 
4.8%
김명훈 2
 
4.8%
김철석 2
 
4.8%
장예진 1
 
2.4%
박선오 1
 
2.4%
박진영 1
 
2.4%
한기현 1
 
2.4%
백기환 1
 
2.4%
정연주 1
 
2.4%
오성권 1
 
2.4%
Other values (29) 29
69.0%
2023-12-12T14:09:42.750950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
9.5%
6
 
4.8%
5
 
4.0%
4
 
3.2%
4
 
3.2%
4
 
3.2%
4
 
3.2%
3
 
2.4%
3
 
2.4%
3
 
2.4%
Other values (52) 78
61.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 126
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
9.5%
6
 
4.8%
5
 
4.0%
4
 
3.2%
4
 
3.2%
4
 
3.2%
4
 
3.2%
3
 
2.4%
3
 
2.4%
3
 
2.4%
Other values (52) 78
61.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 126
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
9.5%
6
 
4.8%
5
 
4.0%
4
 
3.2%
4
 
3.2%
4
 
3.2%
4
 
3.2%
3
 
2.4%
3
 
2.4%
3
 
2.4%
Other values (52) 78
61.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 126
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
 
9.5%
6
 
4.8%
5
 
4.0%
4
 
3.2%
4
 
3.2%
4
 
3.2%
4
 
3.2%
3
 
2.4%
3
 
2.4%
3
 
2.4%
Other values (52) 78
61.9%
Distinct40
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size468.0 B
2023-12-12T14:09:43.134073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length35
Mean length27.333333
Min length20

Characters and Unicode

Total characters1148
Distinct characters124
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)90.5%

Sample

1st row충청북도 충주시 충원대로 268 창업보육센터 301호(단월동, 건국대학교글로컬캠퍼스)
2nd row충청북도 충주시 동량면 지등로 882
3rd row충청북도 충주시 팽고리산길 40, 102호(스마일모터스, 금릉동)
4th row충청북도 충주시 동량면 지등로 882
5th row충청북도 충주시 성서1길 12-1, 2층 201호(성남동)
ValueCountFrequency (%)
충청북도 42
 
18.1%
충주시 42
 
18.1%
문화동 5
 
2.2%
중원대로 5
 
2.2%
금릉동 4
 
1.7%
1층 4
 
1.7%
건국대학교글로컬캠퍼스 3
 
1.3%
번영대로 3
 
1.3%
봉현로 3
 
1.3%
268 3
 
1.3%
Other values (98) 118
50.9%
2023-12-12T14:09:43.796148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
192
 
16.7%
89
 
7.8%
1 48
 
4.2%
43
 
3.7%
43
 
3.7%
43
 
3.7%
43
 
3.7%
42
 
3.7%
38
 
3.3%
) 35
 
3.0%
Other values (114) 532
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 683
59.5%
Space Separator 192
 
16.7%
Decimal Number 167
 
14.5%
Close Punctuation 35
 
3.0%
Open Punctuation 35
 
3.0%
Other Punctuation 27
 
2.4%
Dash Punctuation 5
 
0.4%
Uppercase Letter 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
89
 
13.0%
43
 
6.3%
43
 
6.3%
43
 
6.3%
43
 
6.3%
42
 
6.1%
38
 
5.6%
32
 
4.7%
23
 
3.4%
15
 
2.2%
Other values (95) 272
39.8%
Decimal Number
ValueCountFrequency (%)
1 48
28.7%
2 30
18.0%
0 23
13.8%
3 16
 
9.6%
4 13
 
7.8%
8 11
 
6.6%
5 10
 
6.0%
6 10
 
6.0%
9 5
 
3.0%
7 1
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
B 1
25.0%
C 1
25.0%
M 1
25.0%
A 1
25.0%
Space Separator
ValueCountFrequency (%)
192
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 35
100.0%
Other Punctuation
ValueCountFrequency (%)
, 27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 683
59.5%
Common 461
40.2%
Latin 4
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
 
13.0%
43
 
6.3%
43
 
6.3%
43
 
6.3%
43
 
6.3%
42
 
6.1%
38
 
5.6%
32
 
4.7%
23
 
3.4%
15
 
2.2%
Other values (95) 272
39.8%
Common
ValueCountFrequency (%)
192
41.6%
1 48
 
10.4%
) 35
 
7.6%
( 35
 
7.6%
2 30
 
6.5%
, 27
 
5.9%
0 23
 
5.0%
3 16
 
3.5%
4 13
 
2.8%
8 11
 
2.4%
Other values (5) 31
 
6.7%
Latin
ValueCountFrequency (%)
B 1
25.0%
C 1
25.0%
M 1
25.0%
A 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 683
59.5%
ASCII 465
40.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
192
41.3%
1 48
 
10.3%
) 35
 
7.5%
( 35
 
7.5%
2 30
 
6.5%
, 27
 
5.8%
0 23
 
4.9%
3 16
 
3.4%
4 13
 
2.8%
8 11
 
2.4%
Other values (9) 35
 
7.5%
Hangul
ValueCountFrequency (%)
89
 
13.0%
43
 
6.3%
43
 
6.3%
43
 
6.3%
43
 
6.3%
42
 
6.1%
38
 
5.6%
32
 
4.7%
23
 
3.4%
15
 
2.2%
Other values (95) 272
39.8%

전화번호
Text

MISSING 

Distinct32
Distinct (%)91.4%
Missing7
Missing (%)16.7%
Memory size468.0 B
2023-12-12T14:09:44.119380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.857143
Min length9

Characters and Unicode

Total characters415
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)82.9%

Sample

1st row043-856-2000
2nd row043-851-7400
3rd row043-855-2008
4th row043-851-7400
5th row043-855-1254
ValueCountFrequency (%)
043-846-1311 2
 
5.7%
043-856-2000 2
 
5.7%
043-851-7400 2
 
5.7%
043-843-7766 1
 
2.9%
043-851-7981 1
 
2.9%
043-850-7161 1
 
2.9%
1661-5795 1
 
2.9%
043-845-8588 1
 
2.9%
1644-8423 1
 
2.9%
043-857-7419 1
 
2.9%
Other values (22) 22
62.9%
2023-12-12T14:09:44.631465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 68
16.4%
0 65
15.7%
4 63
15.2%
8 45
10.8%
3 44
10.6%
5 33
8.0%
1 28
6.7%
7 24
 
5.8%
6 22
 
5.3%
2 16
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 347
83.6%
Dash Punctuation 68
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 65
18.7%
4 63
18.2%
8 45
13.0%
3 44
12.7%
5 33
9.5%
1 28
8.1%
7 24
 
6.9%
6 22
 
6.3%
2 16
 
4.6%
9 7
 
2.0%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 415
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 68
16.4%
0 65
15.7%
4 63
15.2%
8 45
10.8%
3 44
10.6%
5 33
8.0%
1 28
6.7%
7 24
 
5.8%
6 22
 
5.3%
2 16
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 415
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 68
16.4%
0 65
15.7%
4 63
15.2%
8 45
10.8%
3 44
10.6%
5 33
8.0%
1 28
6.7%
7 24
 
5.8%
6 22
 
5.3%
2 16
 
3.9%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size468.0 B
2023-06-30
42 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-30
2nd row2023-06-30
3rd row2023-06-30
4th row2023-06-30
5th row2023-06-30

Common Values

ValueCountFrequency (%)
2023-06-30 42
100.0%

Length

2023-12-12T14:09:44.848524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:09:45.019823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-30 42
100.0%

Correlations

2023-12-12T14:09:45.128867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록업종업체명대표자소재지전화번호
등록업종1.0001.0000.8420.6160.208
업체명1.0001.0001.0001.0001.000
대표자0.8421.0001.0001.0000.998
소재지0.6161.0001.0001.0001.000
전화번호0.2081.0000.9981.0001.000

Missing values

2023-12-12T14:09:40.895113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:09:41.045181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록업종업체명대표자소재지전화번호데이터기준일자
0국내여행업㈜지인이승진충청북도 충주시 충원대로 268 창업보육센터 301호(단월동, 건국대학교글로컬캠퍼스)043-856-20002023-06-30
1국내여행업(주)충주호크루즈김철석충청북도 충주시 동량면 지등로 882043-851-74002023-06-30
2국내여행업충주여행가게홍근표충청북도 충주시 팽고리산길 40, 102호(스마일모터스, 금릉동)043-855-20082023-06-30
3국내여행업(주)충주호크루즈김철석충청북도 충주시 동량면 지등로 882043-851-74002023-06-30
4국내여행업자작자작협동조합공영환충청북도 충주시 성서1길 12-1, 2층 201호(성남동)<NA>2023-06-30
5국내외여행업국원전세버스협동조합이선희충청북도 충주시 봉현로 12(봉방동)043-855-12542023-06-30
6국내외여행업(합)충주관광여행사정운한충청북도 충주시 탄금대로 23 (문화동)043-844-55652023-06-30
7국내외여행업대일관광주식회사김복용충청북도 충주시 국원대로 5 (문화동)043-842-77102023-06-30
8국내외여행업(합)태화관광이범영충청북도 충주시 연수서1길 20, 206호 (연수동)043-843-78362023-06-30
9국내외여행업통일고속관광 주식회사조철행충청북도 충주시 안림로 6 (안림동)043-851-79002023-06-30
등록업종업체명대표자소재지전화번호데이터기준일자
32종합여행업㈜씨씨에스충북방송김형준충청북도 충주시 예성로 114(용산동)043-850-71612023-06-30
33종합여행업주식회사 제이알프로젝트신동진충청북도 충주시 대가미15길 1, 1층(교현동)<NA>2023-06-30
34종합여행업(재)충주중원문화재단백인욱충청북도 충주시 중앙탑면 중앙탑길 150(마리나센터, 1층)043-851-79812023-06-30
35종합여행업대일운수 주식회사김명훈충청북도 충주시 번영대로 69, 2호 (금릉동)043-843-77662023-06-30
36종합여행업로컬로(localro)박진영충청북도 충주시 관아6길 5, 다온빌딩 1층 103호 (성내동)<NA>2023-06-30
37종합여행업제일투어김성남충청북도 충주시 앙성면 가곡로 1033-1, 1층<NA>2023-06-30
38종합여행업합자회사 탄금대관광김명훈충청북도 충주시 국원대로 220, 4층(금름동)043-844-79002023-06-30
39종합여행업㈜엠비씨충북한기현충청북도 충주시 중원대로 3250, MBC충북(호암동)<NA>2023-06-30
40종합여행업두드림 월드투어장예진충청북도 충주시 칠금3길 4, 1층 1호(칠금동)043-847-76772023-06-30
41종합여행업㈜망고투어전은정충청북도 충주시 주덕읍 시장길 21-1, 2층043-846-13112023-06-30

Duplicate rows

Most frequently occurring

등록업종업체명대표자소재지전화번호데이터기준일자# duplicates
0국내여행업(주)충주호크루즈김철석충청북도 충주시 동량면 지등로 882043-851-74002023-06-302