Overview

Dataset statistics

Number of variables5
Number of observations68
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory41.9 B

Variable types

Text2
Categorical2
DateTime1

Dataset

Description강원도 공공체육시설(골프장, 스키장 등) 현황 안내
Author강원도
URLhttps://www.data.go.kr/data/3069957/fileData.do

Alerts

영업구분 has constant value ""Constant
분류 is highly imbalanced (60.2%)Imbalance
시설명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:59:27.589862
Analysis finished2023-12-12 13:59:28.219659
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설명
Text

UNIQUE 

Distinct68
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-12T22:59:28.443124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length9.7352941
Min length5

Characters and Unicode

Total characters662
Distinct characters134
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)100.0%

Sample

1st row강촌 컨트리클럽
2nd row강촌리조트 대중골프장
3rd row골든비치 컨트리클럽
4th row남춘천CC
5th row남춘천컨트리클럽
ValueCountFrequency (%)
컨트리클럽 21
 
15.9%
골프클럽 8
 
6.1%
대중골프장 5
 
3.8%
휘닉스 3
 
2.3%
용평리조트 3
 
2.3%
스키장 3
 
2.3%
알펜시아 3
 
2.3%
강촌 2
 
1.5%
횡성 2
 
1.5%
하이원 2
 
1.5%
Other values (74) 80
60.6%
2023-12-12T22:59:28.977613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
64
 
9.7%
54
 
8.2%
38
 
5.7%
38
 
5.7%
36
 
5.4%
27
 
4.1%
27
 
4.1%
27
 
4.1%
24
 
3.6%
12
 
1.8%
Other values (124) 315
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 574
86.7%
Space Separator 64
 
9.7%
Uppercase Letter 10
 
1.5%
Decimal Number 8
 
1.2%
Other Punctuation 6
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
9.4%
38
 
6.6%
38
 
6.6%
36
 
6.3%
27
 
4.7%
27
 
4.7%
27
 
4.7%
24
 
4.2%
12
 
2.1%
10
 
1.7%
Other values (114) 281
49.0%
Decimal Number
ValueCountFrequency (%)
2 3
37.5%
0 2
25.0%
1 2
25.0%
7 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
& 3
50.0%
. 2
33.3%
1
 
16.7%
Uppercase Letter
ValueCountFrequency (%)
C 9
90.0%
G 1
 
10.0%
Space Separator
ValueCountFrequency (%)
64
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 574
86.7%
Common 78
 
11.8%
Latin 10
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
9.4%
38
 
6.6%
38
 
6.6%
36
 
6.3%
27
 
4.7%
27
 
4.7%
27
 
4.7%
24
 
4.2%
12
 
2.1%
10
 
1.7%
Other values (114) 281
49.0%
Common
ValueCountFrequency (%)
64
82.1%
& 3
 
3.8%
2 3
 
3.8%
0 2
 
2.6%
1 2
 
2.6%
. 2
 
2.6%
1
 
1.3%
7 1
 
1.3%
Latin
ValueCountFrequency (%)
C 9
90.0%
G 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 574
86.7%
ASCII 87
 
13.1%
None 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
64
73.6%
C 9
 
10.3%
& 3
 
3.4%
2 3
 
3.4%
0 2
 
2.3%
1 2
 
2.3%
. 2
 
2.3%
G 1
 
1.1%
7 1
 
1.1%
Hangul
ValueCountFrequency (%)
54
 
9.4%
38
 
6.6%
38
 
6.6%
36
 
6.3%
27
 
4.7%
27
 
4.7%
27
 
4.7%
24
 
4.2%
12
 
2.1%
10
 
1.7%
Other values (114) 281
49.0%
None
ValueCountFrequency (%)
1
100.0%

분류
Categorical

IMBALANCE 

Distinct3
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size676.0 B
골프장
59 
스키장
자동차 경주장
 
1

Length

Max length7
Median length3
Mean length3.0588235
Min length3

Unique

Unique1 ?
Unique (%)1.5%

Sample

1st row골프장
2nd row골프장
3rd row골프장
4th row골프장
5th row골프장

Common Values

ValueCountFrequency (%)
골프장 59
86.8%
스키장 8
 
11.8%
자동차 경주장 1
 
1.5%

Length

2023-12-12T22:59:29.211779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:59:29.335922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
골프장 59
85.5%
스키장 8
 
11.6%
자동차 1
 
1.4%
경주장 1
 
1.4%
Distinct56
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-12T22:59:29.609849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length30
Mean length21.294118
Min length18

Characters and Unicode

Total characters1448
Distinct characters153
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)67.6%

Sample

1st row강원도 춘천시 남산면 북한강변길 688
2nd row강원도 춘천시 남산면 북한강변길 688
3rd row강원도 양양군 손양면 공항로 230
4th row강원도 춘천시 신동면 오봉길 156
5th row강원도 춘천시 신동면 오봉길 156
ValueCountFrequency (%)
강원도 68
 
19.5%
춘천시 14
 
4.0%
평창군 10
 
2.9%
원주시 9
 
2.6%
홍천군 9
 
2.6%
대관령면 7
 
2.0%
남산면 6
 
1.7%
횡성군 6
 
1.7%
서면 6
 
1.7%
지정면 5
 
1.4%
Other values (150) 208
59.8%
2023-12-12T22:59:30.049677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
280
19.3%
85
 
5.9%
75
 
5.2%
70
 
4.8%
53
 
3.7%
1 45
 
3.1%
38
 
2.6%
36
 
2.5%
2 34
 
2.3%
33
 
2.3%
Other values (143) 699
48.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 899
62.1%
Space Separator 280
 
19.3%
Decimal Number 240
 
16.6%
Dash Punctuation 14
 
1.0%
Close Punctuation 7
 
0.5%
Open Punctuation 7
 
0.5%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
85
 
9.5%
75
 
8.3%
70
 
7.8%
53
 
5.9%
38
 
4.2%
36
 
4.0%
33
 
3.7%
33
 
3.7%
25
 
2.8%
23
 
2.6%
Other values (128) 428
47.6%
Decimal Number
ValueCountFrequency (%)
1 45
18.8%
2 34
14.2%
6 26
10.8%
3 24
10.0%
5 24
10.0%
0 21
8.8%
8 21
8.8%
4 16
 
6.7%
7 15
 
6.2%
9 14
 
5.8%
Space Separator
ValueCountFrequency (%)
280
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 899
62.1%
Common 549
37.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
85
 
9.5%
75
 
8.3%
70
 
7.8%
53
 
5.9%
38
 
4.2%
36
 
4.0%
33
 
3.7%
33
 
3.7%
25
 
2.8%
23
 
2.6%
Other values (128) 428
47.6%
Common
ValueCountFrequency (%)
280
51.0%
1 45
 
8.2%
2 34
 
6.2%
6 26
 
4.7%
3 24
 
4.4%
5 24
 
4.4%
0 21
 
3.8%
8 21
 
3.8%
4 16
 
2.9%
7 15
 
2.7%
Other values (5) 43
 
7.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 899
62.1%
ASCII 549
37.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
280
51.0%
1 45
 
8.2%
2 34
 
6.2%
6 26
 
4.7%
3 24
 
4.4%
5 24
 
4.4%
0 21
 
3.8%
8 21
 
3.8%
4 16
 
2.9%
7 15
 
2.7%
Other values (5) 43
 
7.8%
Hangul
ValueCountFrequency (%)
85
 
9.5%
75
 
8.3%
70
 
7.8%
53
 
5.9%
38
 
4.2%
36
 
4.0%
33
 
3.7%
33
 
3.7%
25
 
2.8%
23
 
2.6%
Other values (128) 428
47.6%
Distinct58
Distinct (%)85.3%
Missing0
Missing (%)0.0%
Memory size676.0 B
Minimum2006-12-11 00:00:00
Maximum2019-07-19 00:00:00
2023-12-12T22:59:30.473760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:59:30.641230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업구분
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size676.0 B
영업/정상
68 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row영업/정상
3rd row영업/정상
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 68
100.0%

Length

2023-12-12T22:59:30.795563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:59:30.895744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 68
100.0%

Correlations

2023-12-12T22:59:30.951354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설명분류도로명주소허가일자
시설명1.0001.0001.0001.000
분류1.0001.0000.0000.755
도로명주소1.0000.0001.0000.929
허가일자1.0000.7550.9291.000

Missing values

2023-12-12T22:59:28.028320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:59:28.175607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설명분류도로명주소허가일자영업구분
0강촌 컨트리클럽골프장강원도 춘천시 남산면 북한강변길 6882008-04-22영업/정상
1강촌리조트 대중골프장골프장강원도 춘천시 남산면 북한강변길 6882009-04-21영업/정상
2골든비치 컨트리클럽골프장강원도 양양군 손양면 공항로 2302007-08-20영업/정상
3남춘천CC골프장강원도 춘천시 신동면 오봉길 1562011-06-20영업/정상
4남춘천컨트리클럽골프장강원도 춘천시 신동면 오봉길 1562011-06-08영업/정상
5대명비발디파크 대중골프장골프장강원도 홍천군 서면 한치골길 2622008-02-25영업/정상
6대명설악컨트리클럽골프장강원도 고성군 토성면 미시령옛길 11532008-02-25영업/정상
7더플레이어스 골프클럽골프장강원도 춘천시 동산면 새술막길 4382012-11-07영업/정상
8델피노 컨트리클럽골프장강원도 고성군 토성면 미시령옛길 11532012-06-26영업/정상
9동강시스타골프장강원도 영월군 영월읍 사지막길 1602010-11-11영업/정상
시설명분류도로명주소허가일자영업구분
58하이원 컨트리클럽골프장강원도 정선군 고한읍 하이원길 2652009-04-21영업/정상
59한탄강 컨트리클럽골프장강원도 철원군 갈말읍 순담길 592013-07-08영업/정상
60홍천골프장골프장강원도 홍천군 홍천읍 높은터로 5332010-04-16영업/정상
61횡성 섬강벨라스톤 컨트리클럽골프장강원도 횡성군 서원면 옥계9길 1242011-04-20영업/정상
62횡성 옥스필드 컨트리클럽골프장강원도 횡성군 서원면 경강로유현6길 282011-10-05영업/정상
63휘닉스 대중골프장골프장강원도 평창군 봉평면 태기로 1742009-03-11영업/정상
64휘닉스 스노우파크스키장강원도 평창군 봉평면 태기로 1742007-01-04영업/정상
65휘닉스 컨트리클럽골프장강원도 평창군 봉평면 태기로 227-842009-03-11영업/정상
66휘슬링 락 컨트리클럽골프장강원도 춘천시 남산면 동촌로 5012011-09-01영업/정상
67힐드로사이 컨트리클럽골프장강원도 홍천군 남면 한서로 28402011-08-08영업/정상