Overview

Dataset statistics

Number of variables5
Number of observations23
Missing cells15
Missing cells (%)13.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 KiB
Average record size in memory45.7 B

Variable types

Categorical1
Text3
DateTime1

Dataset

Description경상남도 거제시 유원시설업현황(영업종류, 상호명, 소재지, 위도, 경도, 전화번호, 영업상태, 신고일자, 기준일자)등에 대한 정보를 제공합니다.
Author경상남도 거제시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3079557

Alerts

업종 is highly imbalanced (61.7%)Imbalance
상호 has 1 (4.3%) missing valuesMissing
소재지(도로명) has 1 (4.3%) missing valuesMissing
연락처 has 12 (52.2%) missing valuesMissing
등록일 has 1 (4.3%) missing valuesMissing

Reproduction

Analysis started2023-12-10 23:19:58.861953
Analysis finished2023-12-10 23:19:59.523228
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

IMBALANCE 

Distinct4
Distinct (%)17.4%
Missing0
Missing (%)0.0%
Memory size316.0 B
기타유원시설업
20 
종합유원시설업
 
1
일반유원시설업
 
1
<NA>
 
1

Length

Max length7
Median length7
Mean length6.8695652
Min length4

Unique

Unique3 ?
Unique (%)13.0%

Sample

1st row종합유원시설업
2nd row일반유원시설업
3rd row기타유원시설업
4th row기타유원시설업
5th row기타유원시설업

Common Values

ValueCountFrequency (%)
기타유원시설업 20
87.0%
종합유원시설업 1
 
4.3%
일반유원시설업 1
 
4.3%
<NA> 1
 
4.3%

Length

2023-12-11T08:19:59.589626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:19:59.685992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타유원시설업 20
87.0%
종합유원시설업 1
 
4.3%
일반유원시설업 1
 
4.3%
na 1
 
4.3%

상호
Text

MISSING 

Distinct22
Distinct (%)100.0%
Missing1
Missing (%)4.3%
Memory size316.0 B
2023-12-11T08:19:59.875205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length12
Mean length8.4545455
Min length3

Characters and Unicode

Total characters186
Distinct characters90
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row오션어드벤처
2nd row거제랜드
3rd row노리파크 거제상동점
4th row롤리폴리(옥포점)
5th row아이점핑
ValueCountFrequency (%)
라크씨엘 2
 
5.7%
오션어드벤처 1
 
2.9%
행복누림문화센터 1
 
2.9%
슈슈봉봉 1
 
2.9%
거제점 1
 
2.9%
바운스 1
 
2.9%
상동키즈랜드 1
 
2.9%
상문동 1
 
2.9%
놀이방 1
 
2.9%
거제랜드 1
 
2.9%
Other values (24) 24
68.6%
2023-12-11T08:20:00.292024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
7.0%
9
 
4.8%
8
 
4.3%
8
 
4.3%
6
 
3.2%
5
 
2.7%
4
 
2.2%
4
 
2.2%
4
 
2.2%
4
 
2.2%
Other values (80) 121
65.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 159
85.5%
Space Separator 13
 
7.0%
Lowercase Letter 7
 
3.8%
Close Punctuation 3
 
1.6%
Open Punctuation 3
 
1.6%
Uppercase Letter 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
5.7%
8
 
5.0%
8
 
5.0%
6
 
3.8%
5
 
3.1%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (69) 103
64.8%
Lowercase Letter
ValueCountFrequency (%)
c 1
14.3%
l 1
14.3%
a 1
14.3%
n 1
14.3%
e 1
14.3%
y 1
14.3%
o 1
14.3%
Space Separator
ValueCountFrequency (%)
13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Uppercase Letter
ValueCountFrequency (%)
T 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 159
85.5%
Common 19
 
10.2%
Latin 8
 
4.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
5.7%
8
 
5.0%
8
 
5.0%
6
 
3.8%
5
 
3.1%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (69) 103
64.8%
Latin
ValueCountFrequency (%)
c 1
12.5%
l 1
12.5%
a 1
12.5%
n 1
12.5%
e 1
12.5%
y 1
12.5%
o 1
12.5%
T 1
12.5%
Common
ValueCountFrequency (%)
13
68.4%
) 3
 
15.8%
( 3
 
15.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 159
85.5%
ASCII 27
 
14.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13
48.1%
) 3
 
11.1%
( 3
 
11.1%
c 1
 
3.7%
l 1
 
3.7%
a 1
 
3.7%
n 1
 
3.7%
e 1
 
3.7%
y 1
 
3.7%
o 1
 
3.7%
Hangul
ValueCountFrequency (%)
9
 
5.7%
8
 
5.0%
8
 
5.0%
6
 
3.8%
5
 
3.1%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (69) 103
64.8%

소재지(도로명)
Text

MISSING 

Distinct22
Distinct (%)100.0%
Missing1
Missing (%)4.3%
Memory size316.0 B
2023-12-11T08:20:00.571193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length37
Mean length30.5
Min length21

Characters and Unicode

Total characters671
Distinct characters102
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row경상남도 거제시 일운면 거제대로 2660
2nd row경상남도 거제시 계룡로 61, 거제포로수용소유적공원 (고현동)
3rd row경상남도 거제시 상동7길 30-1, 상가동 지하1층 101호 (상동동)
4th row경상남도 거제시 옥포로 250 (옥포동)
5th row경상남도 거제시 사등면 두동로 54-8 (나라빌딩 203호(현장사용 302호))
ValueCountFrequency (%)
경상남도 22
 
16.2%
거제시 22
 
16.2%
상동동 5
 
3.7%
고현동 3
 
2.2%
장평동 3
 
2.2%
2층 3
 
2.2%
아주1로 2
 
1.5%
3층 2
 
1.5%
상동5길 2
 
1.5%
수양로 2
 
1.5%
Other values (63) 70
51.5%
2023-12-11T08:20:01.104492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
114
 
17.0%
31
 
4.6%
1 30
 
4.5%
30
 
4.5%
30
 
4.5%
29
 
4.3%
22
 
3.3%
22
 
3.3%
22
 
3.3%
22
 
3.3%
Other values (92) 319
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 377
56.2%
Space Separator 114
 
17.0%
Decimal Number 111
 
16.5%
Open Punctuation 20
 
3.0%
Close Punctuation 20
 
3.0%
Other Punctuation 15
 
2.2%
Dash Punctuation 7
 
1.0%
Uppercase Letter 6
 
0.9%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
8.2%
30
 
8.0%
30
 
8.0%
29
 
7.7%
22
 
5.8%
22
 
5.8%
22
 
5.8%
22
 
5.8%
19
 
5.0%
8
 
2.1%
Other values (71) 142
37.7%
Decimal Number
ValueCountFrequency (%)
1 30
27.0%
2 21
18.9%
0 16
14.4%
4 11
 
9.9%
6 8
 
7.2%
5 7
 
6.3%
3 7
 
6.3%
8 7
 
6.3%
7 4
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
B 1
16.7%
I 1
16.7%
P 1
16.7%
A 1
16.7%
R 1
16.7%
K 1
16.7%
Space Separator
ValueCountFrequency (%)
114
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Other Punctuation
ValueCountFrequency (%)
, 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 377
56.2%
Common 288
42.9%
Latin 6
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
8.2%
30
 
8.0%
30
 
8.0%
29
 
7.7%
22
 
5.8%
22
 
5.8%
22
 
5.8%
22
 
5.8%
19
 
5.0%
8
 
2.1%
Other values (71) 142
37.7%
Common
ValueCountFrequency (%)
114
39.6%
1 30
 
10.4%
2 21
 
7.3%
( 20
 
6.9%
) 20
 
6.9%
0 16
 
5.6%
, 15
 
5.2%
4 11
 
3.8%
6 8
 
2.8%
5 7
 
2.4%
Other values (5) 26
 
9.0%
Latin
ValueCountFrequency (%)
B 1
16.7%
I 1
16.7%
P 1
16.7%
A 1
16.7%
R 1
16.7%
K 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 377
56.2%
ASCII 294
43.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
114
38.8%
1 30
 
10.2%
2 21
 
7.1%
( 20
 
6.8%
) 20
 
6.8%
0 16
 
5.4%
, 15
 
5.1%
4 11
 
3.7%
6 8
 
2.7%
5 7
 
2.4%
Other values (11) 32
 
10.9%
Hangul
ValueCountFrequency (%)
31
 
8.2%
30
 
8.0%
30
 
8.0%
29
 
7.7%
22
 
5.8%
22
 
5.8%
22
 
5.8%
22
 
5.8%
19
 
5.0%
8
 
2.1%
Other values (71) 142
37.7%

연락처
Text

MISSING 

Distinct11
Distinct (%)100.0%
Missing12
Missing (%)52.2%
Memory size316.0 B
2023-12-11T08:20:01.323647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters132
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)100.0%

Sample

1st row055-733-7201
2nd row055-950-7885
3rd row055-632-7959
4th row055-688-0234
5th row055-636-1734
ValueCountFrequency (%)
055-733-7201 1
9.1%
055-950-7885 1
9.1%
055-632-7959 1
9.1%
055-688-0234 1
9.1%
055-636-1734 1
9.1%
055-682-5565 1
9.1%
055-636-8229 1
9.1%
055-687-1030 1
9.1%
055-641-9900 1
9.1%
055-635-3995 1
9.1%
2023-12-11T08:20:01.720888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 30
22.7%
- 22
16.7%
0 19
14.4%
6 13
9.8%
3 12
 
9.1%
9 8
 
6.1%
7 7
 
5.3%
8 7
 
5.3%
2 6
 
4.5%
1 5
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 110
83.3%
Dash Punctuation 22
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 30
27.3%
0 19
17.3%
6 13
11.8%
3 12
 
10.9%
9 8
 
7.3%
7 7
 
6.4%
8 7
 
6.4%
2 6
 
5.5%
1 5
 
4.5%
4 3
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 132
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 30
22.7%
- 22
16.7%
0 19
14.4%
6 13
9.8%
3 12
 
9.1%
9 8
 
6.1%
7 7
 
5.3%
8 7
 
5.3%
2 6
 
4.5%
1 5
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 132
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 30
22.7%
- 22
16.7%
0 19
14.4%
6 13
9.8%
3 12
 
9.1%
9 8
 
6.1%
7 7
 
5.3%
8 7
 
5.3%
2 6
 
4.5%
1 5
 
3.8%

등록일
Date

MISSING 

Distinct21
Distinct (%)95.5%
Missing1
Missing (%)4.3%
Memory size316.0 B
Minimum2013-06-12 00:00:00
Maximum2019-10-01 00:00:00
2023-12-11T08:20:01.868375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:20:02.000795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)

Correlations

2023-12-11T08:20:02.127847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종상호소재지(도로명)연락처등록일
업종1.0001.0001.0001.0001.000
상호1.0001.0001.0001.0001.000
소재지(도로명)1.0001.0001.0001.0001.000
연락처1.0001.0001.0001.0001.000
등록일1.0001.0001.0001.0001.000

Missing values

2023-12-11T08:19:59.187261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:19:59.308877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T08:19:59.447287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종상호소재지(도로명)연락처등록일
0종합유원시설업오션어드벤처경상남도 거제시 일운면 거제대로 2660055-733-72012013-06-12
1일반유원시설업거제랜드경상남도 거제시 계룡로 61, 거제포로수용소유적공원 (고현동)055-950-78852019-05-03
2기타유원시설업노리파크 거제상동점경상남도 거제시 상동7길 30-1, 상가동 지하1층 101호 (상동동)055-632-79592014-08-07
3기타유원시설업롤리폴리(옥포점)경상남도 거제시 옥포로 250 (옥포동)055-688-02342014-09-11
4기타유원시설업아이점핑경상남도 거제시 사등면 두동로 54-8 (나라빌딩 203호(현장사용 302호))<NA>2016-01-05
5기타유원시설업인형뽑기(Toy clane)경상남도 거제시 아주1로 27 (아주동)<NA>2016-10-10
6기타유원시설업스타스페이스경상남도 거제시 고현로11길 20 (고현동, 엠파크)<NA>2016-12-16
7기타유원시설업차타타 거제장평점경상남도 거제시 장평4로 63 (장평동)055-636-17342017-01-19
8기타유원시설업빠방 키즈레이싱 파크경상남도 거제시 아주1로 8, 3층 (아주동, 스타타워2차)055-682-55652017-04-04
9기타유원시설업아지트경상남도 거제시 덕포5길 26 (덕포동)<NA>2017-07-12
업종상호소재지(도로명)연락처등록일
13기타유원시설업슈슈봉봉 거제점경상남도 거제시 중곡1로 48, 해원빌딩 2층 (고현동)055-641-99002018-09-19
14기타유원시설업바운스경상남도 거제시 장목면 거제북로 2501-40, 한화리조트거제벨버디어<NA>2018-09-21
15기타유원시설업상동키즈랜드경상남도 거제시 상동5길 117-40, 210호 (상동동)055-635-39952018-10-18
16기타유원시설업라크씨엘경상남도 거제시 수양로 448-21, 세정빌딩 201호 (수월동)<NA>2018-11-21
17기타유원시설업상문동 행복누림문화센터 놀이방경상남도 거제시 거제중앙로 1641, 행복누림문화센터 2층 (상동동)055-636-71302019-03-27
18기타유원시설업거제오션뷰카트장경상남도 거제시 일운면 거제대로 1514-18<NA>2019-03-27
19기타유원시설업라크씨엘 상동점경상남도 거제시 상동5길 84, 3층 (상동동)<NA>2019-04-11
20기타유원시설업뽀로로 맘앤키즈카페 거제디큐브백화점경상남도 거제시 장평로 12, 디큐브백화점 8층 (장평동)<NA>2019-04-26
21기타유원시설업수영하는 아기경상남도 거제시 수양로 110, 601동 B201~3호 (양정동, 거제2차 IPARK)<NA>2019-10-01
22<NA><NA><NA><NA><NA>