Overview

Dataset statistics

Number of variables6
Number of observations21
Missing cells27
Missing cells (%)21.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory56.3 B

Variable types

Numeric1
Text3
Categorical1
Unsupported1

Dataset

Description전라남도 고흥군 국내여행업 현황에 대한 데이터로 사업장 명칭, 사업장 소재지 전체 주소, 사업장 전화번호, 문화체육업종명 등에 대한 정보를 제공합니다.
Author전라남도 고흥군
URLhttps://www.data.go.kr/data/15090053/fileData.do

Alerts

문화체육업종명 has constant value ""Constant
소재지전화 has 6 (28.6%) missing valuesMissing
비고 has 21 (100.0%) missing valuesMissing
번호 has unique valuesUnique
사업장명 has unique valuesUnique
소재지전체주소 has unique valuesUnique
비고 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 15:55:55.566519
Analysis finished2023-12-12 15:55:56.101365
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11
Minimum1
Maximum21
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size321.0 B
2023-12-13T00:55:56.160376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q16
median11
Q316
95-th percentile20
Maximum21
Range20
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.2048368
Coefficient of variation (CV)0.56407607
Kurtosis-1.2
Mean11
Median Absolute Deviation (MAD)5
Skewness0
Sum231
Variance38.5
MonotonicityStrictly increasing
2023-12-13T00:55:56.272827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
1 1
 
4.8%
2 1
 
4.8%
21 1
 
4.8%
20 1
 
4.8%
19 1
 
4.8%
18 1
 
4.8%
17 1
 
4.8%
16 1
 
4.8%
15 1
 
4.8%
14 1
 
4.8%
Other values (11) 11
52.4%
ValueCountFrequency (%)
1 1
4.8%
2 1
4.8%
3 1
4.8%
4 1
4.8%
5 1
4.8%
6 1
4.8%
7 1
4.8%
8 1
4.8%
9 1
4.8%
10 1
4.8%
ValueCountFrequency (%)
21 1
4.8%
20 1
4.8%
19 1
4.8%
18 1
4.8%
17 1
4.8%
16 1
4.8%
15 1
4.8%
14 1
4.8%
13 1
4.8%
12 1
4.8%

사업장명
Text

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
2023-12-13T00:55:56.458423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length7.7142857
Min length4

Characters and Unicode

Total characters162
Distinct characters56
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row(유)남도관광 고흥영업소
2nd row고흥관광
3rd row고흥21세기 관광여행사
4th row유한회사 하나여행사
5th row우주관광(주)
ValueCountFrequency (%)
유한회사 3
 
11.1%
유)남도관광 1
 
3.7%
고흥남도관광여행사 1
 
3.7%
유)한마음투어 1
 
3.7%
유)도수항공여행사 1
 
3.7%
투어 1
 
3.7%
플래티늄 1
 
3.7%
하이투어 1
 
3.7%
주)동방관광여행사 1
 
3.7%
유)우주항공여행사 1
 
3.7%
Other values (15) 15
55.6%
2023-12-13T00:55:56.828964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
6.8%
11
 
6.8%
11
 
6.8%
8
 
4.9%
8
 
4.9%
8
 
4.9%
) 7
 
4.3%
( 6
 
3.7%
6
 
3.7%
5
 
3.1%
Other values (46) 81
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 141
87.0%
Close Punctuation 7
 
4.3%
Open Punctuation 6
 
3.7%
Space Separator 6
 
3.7%
Decimal Number 2
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
7.8%
11
 
7.8%
11
 
7.8%
8
 
5.7%
8
 
5.7%
8
 
5.7%
5
 
3.5%
5
 
3.5%
4
 
2.8%
4
 
2.8%
Other values (41) 66
46.8%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
2 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 141
87.0%
Common 21
 
13.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
7.8%
11
 
7.8%
11
 
7.8%
8
 
5.7%
8
 
5.7%
8
 
5.7%
5
 
3.5%
5
 
3.5%
4
 
2.8%
4
 
2.8%
Other values (41) 66
46.8%
Common
ValueCountFrequency (%)
) 7
33.3%
( 6
28.6%
6
28.6%
1 1
 
4.8%
2 1
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 141
87.0%
ASCII 21
 
13.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11
 
7.8%
11
 
7.8%
11
 
7.8%
8
 
5.7%
8
 
5.7%
8
 
5.7%
5
 
3.5%
5
 
3.5%
4
 
2.8%
4
 
2.8%
Other values (41) 66
46.8%
ASCII
ValueCountFrequency (%)
) 7
33.3%
( 6
28.6%
6
28.6%
1 1
 
4.8%
2 1
 
4.8%
Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
2023-12-13T00:55:57.472342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length28
Mean length24.904762
Min length22

Characters and Unicode

Total characters523
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row전라남도 고흥군 고흥읍 남계리 716-2
2nd row전라남도 고흥군 고흥읍 남계리 663-10번지
3rd row전라남도 고흥군 고흥읍 남계리 633-15번지 2층
4th row전라남도 고흥군 고흥읍 서문리 213-12번지
5th row전라남도 고흥군 고흥읍 남계리 924-1번지
ValueCountFrequency (%)
전라남도 21
19.1%
고흥군 21
19.1%
고흥읍 12
 
10.9%
남계리 8
 
7.3%
서문리 3
 
2.7%
도양읍 3
 
2.7%
과역리 2
 
1.8%
봉암리 2
 
1.8%
과역면 2
 
1.8%
710-3번지 1
 
0.9%
Other values (35) 35
31.8%
2023-12-13T00:55:57.955700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
89
17.0%
33
 
6.3%
33
 
6.3%
30
 
5.7%
25
 
4.8%
1 25
 
4.8%
21
 
4.0%
21
 
4.0%
21
 
4.0%
21
 
4.0%
Other values (41) 204
39.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 320
61.2%
Decimal Number 96
 
18.4%
Space Separator 89
 
17.0%
Dash Punctuation 18
 
3.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
10.3%
33
10.3%
30
 
9.4%
25
 
7.8%
21
 
6.6%
21
 
6.6%
21
 
6.6%
21
 
6.6%
20
 
6.2%
20
 
6.2%
Other values (29) 75
23.4%
Decimal Number
ValueCountFrequency (%)
1 25
26.0%
2 15
15.6%
7 9
 
9.4%
6 8
 
8.3%
3 8
 
8.3%
8 7
 
7.3%
4 7
 
7.3%
5 7
 
7.3%
0 7
 
7.3%
9 3
 
3.1%
Space Separator
ValueCountFrequency (%)
89
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 320
61.2%
Common 203
38.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
10.3%
33
10.3%
30
 
9.4%
25
 
7.8%
21
 
6.6%
21
 
6.6%
21
 
6.6%
21
 
6.6%
20
 
6.2%
20
 
6.2%
Other values (29) 75
23.4%
Common
ValueCountFrequency (%)
89
43.8%
1 25
 
12.3%
- 18
 
8.9%
2 15
 
7.4%
7 9
 
4.4%
6 8
 
3.9%
3 8
 
3.9%
8 7
 
3.4%
4 7
 
3.4%
5 7
 
3.4%
Other values (2) 10
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 320
61.2%
ASCII 203
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
89
43.8%
1 25
 
12.3%
- 18
 
8.9%
2 15
 
7.4%
7 9
 
4.4%
6 8
 
3.9%
3 8
 
3.9%
8 7
 
3.4%
4 7
 
3.4%
5 7
 
3.4%
Other values (2) 10
 
4.9%
Hangul
ValueCountFrequency (%)
33
10.3%
33
10.3%
30
 
9.4%
25
 
7.8%
21
 
6.6%
21
 
6.6%
21
 
6.6%
21
 
6.6%
20
 
6.2%
20
 
6.2%
Other values (29) 75
23.4%

소재지전화
Text

MISSING 

Distinct15
Distinct (%)100.0%
Missing6
Missing (%)28.6%
Memory size300.0 B
2023-12-13T00:55:58.217708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters180
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)100.0%

Sample

1st row061-835-3848
2nd row061-832-9654
3rd row061-834-0238
4th row061-835-8700
5th row061-842-2002
ValueCountFrequency (%)
061-835-3848 1
 
6.7%
061-832-9654 1
 
6.7%
061-834-0238 1
 
6.7%
061-835-8700 1
 
6.7%
061-842-2002 1
 
6.7%
061-832-0577 1
 
6.7%
061-833-5424 1
 
6.7%
061-834-3999 1
 
6.7%
061-834-3057 1
 
6.7%
061-834-1500 1
 
6.7%
Other values (5) 5
33.3%
2023-12-13T00:55:58.691089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 30
16.7%
0 27
15.0%
8 20
11.1%
3 19
10.6%
1 18
10.0%
6 17
9.4%
4 16
8.9%
2 12
 
6.7%
5 8
 
4.4%
7 8
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 150
83.3%
Dash Punctuation 30
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 27
18.0%
8 20
13.3%
3 19
12.7%
1 18
12.0%
6 17
11.3%
4 16
10.7%
2 12
8.0%
5 8
 
5.3%
7 8
 
5.3%
9 5
 
3.3%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 180
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 30
16.7%
0 27
15.0%
8 20
11.1%
3 19
10.6%
1 18
10.0%
6 17
9.4%
4 16
8.9%
2 12
 
6.7%
5 8
 
4.4%
7 8
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 180
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 30
16.7%
0 27
15.0%
8 20
11.1%
3 19
10.6%
1 18
10.0%
6 17
9.4%
4 16
8.9%
2 12
 
6.7%
5 8
 
4.4%
7 8
 
4.4%

문화체육업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size300.0 B
국내여행업
21 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내여행업 21
100.0%

Length

2023-12-13T00:55:58.847084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:55:58.979671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내여행업 21
100.0%

비고
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)100.0%
Memory size321.0 B

Interactions

2023-12-13T00:55:55.766840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:55:59.083828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호사업장명소재지전체주소소재지전화
번호1.0001.0001.0001.000
사업장명1.0001.0001.0001.000
소재지전체주소1.0001.0001.0001.000
소재지전화1.0001.0001.0001.000

Missing values

2023-12-13T00:55:55.939949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:55:56.061707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호사업장명소재지전체주소소재지전화문화체육업종명비고
01(유)남도관광 고흥영업소전라남도 고흥군 고흥읍 남계리 716-2<NA>국내여행업<NA>
12고흥관광전라남도 고흥군 고흥읍 남계리 663-10번지061-835-3848국내여행업<NA>
23고흥21세기 관광여행사전라남도 고흥군 고흥읍 남계리 633-15번지 2층061-832-9654국내여행업<NA>
34유한회사 하나여행사전라남도 고흥군 고흥읍 서문리 213-12번지061-834-0238국내여행업<NA>
45우주관광(주)전라남도 고흥군 고흥읍 남계리 924-1번지061-835-8700국내여행업<NA>
56(유)푸른바다전라남도 고흥군 도양읍 봉암리 3907번지061-842-2002국내여행업<NA>
67팔영산관광전라남도 고흥군 고흥읍 서문리 218-15번지061-832-0577국내여행업<NA>
78뉴수정관광전라남도 고흥군 과역면 과역리 408-8번지061-833-5424국내여행업<NA>
89뉴동방고속관광전라남도 고흥군 고흥읍 남계리 837-18번지061-834-3999국내여행업<NA>
910소나무여행사전라남도 고흥군 고흥읍 남계리 647-10번지 학원 및 사무실061-834-3057국내여행업<NA>
번호사업장명소재지전체주소소재지전화문화체육업종명비고
1112임해관광전라남도 고흥군 대서면 안남리 625번지061-834-1500국내여행업<NA>
1213남도레저전라남도 고흥군 도양읍 관리 710-3번지061-843-2900국내여행업<NA>
1314나로우주여행사전라남도 고흥군 봉래면 신금리 1256-1번지<NA>국내여행업<NA>
1415유)우주항공여행사전라남도 고흥군 도양읍 봉암리 2225-8번지061-844-6244국내여행업<NA>
1516(주)동방관광여행사전라남도 고흥군 고흥읍 서문리 218-11번지<NA>국내여행업<NA>
1617유한회사 하이투어전라남도 고흥군 고흥읍 남계리 537-10번지 1층061-835-7171국내여행업<NA>
1718플래티늄 투어전라남도 고흥군 과역면 과역리 164-12번지<NA>국내여행업<NA>
1819(유)도수항공여행사전라남도 고흥군 도덕면 신양리 947-1번지061-834-0328국내여행업<NA>
1920(유)한마음투어전라남도 고흥군 고흥읍 남계리 241-7번지061-834-7722국내여행업<NA>
2021대동관광전라남도 고흥군 동강면 유둔리 201-4번지<NA>국내여행업<NA>