Overview

Dataset statistics

Number of variables7
Number of observations41
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory61.1 B

Variable types

Categorical3
Text3
Numeric1

Dataset

Description충청북도 내에 소재하고 있는 골프장에 대한 정보를 제공합니다. (종류(회원제, 대중제, 비회원제), 골프장명, 도로명주소, 규모, 홀 수, 사업자 법적위치, 사업자명)
Author충청북도
URLhttps://www.data.go.kr/data/15071058/fileData.do

Alerts

규모(제곱미터) is highly overall correlated with 홀 수High correlation
홀 수 is highly overall correlated with 규모(제곱미터)High correlation
사업자 법적위치 is highly imbalanced (79.2%)Imbalance
규모(제곱미터) has unique valuesUnique

Reproduction

Analysis started2024-03-14 19:09:19.747911
Analysis finished2024-03-14 19:09:20.930119
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

종류
Categorical

Distinct3
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Memory size456.0 B
대중제
33 
회원제
비회원제(대중형X)
 
3

Length

Max length10
Median length3
Mean length3.5121951
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row회원제
2nd row대중제
3rd row회원제
4th row회원제
5th row대중제

Common Values

ValueCountFrequency (%)
대중제 33
80.5%
회원제 5
 
12.2%
비회원제(대중형X) 3
 
7.3%

Length

2024-03-15T04:09:21.061191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T04:09:21.273505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대중제 33
80.5%
회원제 5
 
12.2%
비회원제(대중형x 3
 
7.3%
Distinct39
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size456.0 B
2024-03-15T04:09:22.229811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length14
Mean length6.6341463
Min length2

Characters and Unicode

Total characters272
Distinct characters111
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)90.2%

Sample

1st row그랜드
2nd row에스엘세레스 임페리얼레이크
3rd row디 에머슨(구중앙)
4th row천 룡
5th row천 룡
ValueCountFrequency (%)
2
 
3.9%
시그너스 2
 
3.9%
2
 
3.9%
힐데스하임 1
 
2.0%
음성 1
 
2.0%
스타cc(구-샹떼힐 1
 
2.0%
올데이골프앤리조트(구-로얄포레 1
 
2.0%
모나크cc 1
 
2.0%
세레니티cc 1
 
2.0%
그랜드 1
 
2.0%
Other values (38) 38
74.5%
2024-03-15T04:09:23.409029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
5.9%
12
 
4.4%
( 11
 
4.0%
10
 
3.7%
) 10
 
3.7%
C 8
 
2.9%
8
 
2.9%
- 7
 
2.6%
6
 
2.2%
6
 
2.2%
Other values (101) 178
65.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 224
82.4%
Open Punctuation 11
 
4.0%
Space Separator 10
 
3.7%
Close Punctuation 10
 
3.7%
Uppercase Letter 8
 
2.9%
Dash Punctuation 7
 
2.6%
Lowercase Letter 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
7.1%
12
 
5.4%
8
 
3.6%
6
 
2.7%
6
 
2.7%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.8%
4
 
1.8%
Other values (95) 153
68.3%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 224
82.4%
Common 38
 
14.0%
Latin 10
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
7.1%
12
 
5.4%
8
 
3.6%
6
 
2.7%
6
 
2.7%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.8%
4
 
1.8%
Other values (95) 153
68.3%
Common
ValueCountFrequency (%)
( 11
28.9%
10
26.3%
) 10
26.3%
- 7
18.4%
Latin
ValueCountFrequency (%)
C 8
80.0%
c 2
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 224
82.4%
ASCII 48
 
17.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
16
 
7.1%
12
 
5.4%
8
 
3.6%
6
 
2.7%
6
 
2.7%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.8%
4
 
1.8%
Other values (95) 153
68.3%
ASCII
ValueCountFrequency (%)
( 11
22.9%
10
20.8%
) 10
20.8%
C 8
16.7%
- 7
14.6%
c 2
 
4.2%
Distinct40
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size456.0 B
2024-03-15T04:09:24.586682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length21
Mean length17.439024
Min length12

Characters and Unicode

Total characters715
Distinct characters125
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)95.1%

Sample

1st row청주시 청원구 오창읍 꽃화산길 51-20
2nd row충주시 금가면 다래울길 52
3rd row진천군 백곡면 배티로 818-105
4th row진천군 이월면 진안로 347-123
5th row진천군 이월면 진안로 425
ValueCountFrequency (%)
충주시 14
 
7.9%
음성군 8
 
4.5%
청주시 7
 
4.0%
진천군 6
 
3.4%
앙성면 5
 
2.8%
4
 
2.3%
삼성면 3
 
1.7%
백곡면 2
 
1.1%
서원구 2
 
1.1%
대소원면 2
 
1.1%
Other values (111) 124
70.1%
2024-03-15T04:09:26.207018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
136
 
19.0%
31
 
4.3%
1 30
 
4.2%
24
 
3.4%
23
 
3.2%
21
 
2.9%
3 21
 
2.9%
19
 
2.7%
18
 
2.5%
2 16
 
2.2%
Other values (115) 376
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 426
59.6%
Decimal Number 141
 
19.7%
Space Separator 136
 
19.0%
Dash Punctuation 12
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
7.3%
24
 
5.6%
23
 
5.4%
21
 
4.9%
19
 
4.5%
18
 
4.2%
16
 
3.8%
14
 
3.3%
12
 
2.8%
10
 
2.3%
Other values (103) 238
55.9%
Decimal Number
ValueCountFrequency (%)
1 30
21.3%
3 21
14.9%
2 16
11.3%
4 15
10.6%
5 15
10.6%
8 13
9.2%
7 9
 
6.4%
0 9
 
6.4%
9 7
 
5.0%
6 6
 
4.3%
Space Separator
ValueCountFrequency (%)
136
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 426
59.6%
Common 289
40.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
7.3%
24
 
5.6%
23
 
5.4%
21
 
4.9%
19
 
4.5%
18
 
4.2%
16
 
3.8%
14
 
3.3%
12
 
2.8%
10
 
2.3%
Other values (103) 238
55.9%
Common
ValueCountFrequency (%)
136
47.1%
1 30
 
10.4%
3 21
 
7.3%
2 16
 
5.5%
4 15
 
5.2%
5 15
 
5.2%
8 13
 
4.5%
- 12
 
4.2%
7 9
 
3.1%
0 9
 
3.1%
Other values (2) 13
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 426
59.6%
ASCII 289
40.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
136
47.1%
1 30
 
10.4%
3 21
 
7.3%
2 16
 
5.5%
4 15
 
5.2%
5 15
 
5.2%
8 13
 
4.5%
- 12
 
4.2%
7 9
 
3.1%
0 9
 
3.1%
Other values (2) 13
 
4.5%
Hangul
ValueCountFrequency (%)
31
 
7.3%
24
 
5.6%
23
 
5.4%
21
 
4.9%
19
 
4.5%
18
 
4.2%
16
 
3.8%
14
 
3.3%
12
 
2.8%
10
 
2.3%
Other values (103) 238
55.9%

규모(제곱미터)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1019991.8
Minimum263900
Maximum2180205
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size497.0 B
2024-03-15T04:09:26.615783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum263900
5-th percentile371280
Q1880653
median999268
Q31200500
95-th percentile1567859
Maximum2180205
Range1916305
Interquartile range (IQR)319847

Descriptive statistics

Standard deviation382640.96
Coefficient of variation (CV)0.37514122
Kurtosis1.2806555
Mean1019991.8
Median Absolute Deviation (MAD)168936
Skewness0.30826114
Sum41819663
Variance1.4641411 × 1011
MonotonicityNot monotonic
2024-03-15T04:09:27.068757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
1696299 1
 
2.4%
880653 1
 
2.4%
896185 1
 
2.4%
926019 1
 
2.4%
1167916 1
 
2.4%
1372015 1
 
2.4%
371280 1
 
2.4%
1421784 1
 
2.4%
828506 1
 
2.4%
1047645 1
 
2.4%
Other values (31) 31
75.6%
ValueCountFrequency (%)
263900 1
2.4%
345018 1
2.4%
371280 1
2.4%
391643 1
2.4%
421101 1
2.4%
465281 1
2.4%
702082 1
2.4%
828506 1
2.4%
830332 1
2.4%
870253 1
2.4%
ValueCountFrequency (%)
2180205 1
2.4%
1696299 1
2.4%
1567859 1
2.4%
1434573 1
2.4%
1421784 1
2.4%
1415008 1
2.4%
1372015 1
2.4%
1371052 1
2.4%
1253344 1
2.4%
1245242 1
2.4%

홀 수
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Memory size456.0 B
18
20 
27
13 
9
28
 
1
36
 
1

Length

Max length2
Median length2
Mean length1.8536585
Min length1

Unique

Unique2 ?
Unique (%)4.9%

Sample

1st row27
2nd row18
3rd row28
4th row27
5th row9

Common Values

ValueCountFrequency (%)
18 20
48.8%
27 13
31.7%
9 6
 
14.6%
28 1
 
2.4%
36 1
 
2.4%

Length

2024-03-15T04:09:27.433087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T04:09:27.636409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
18 20
48.8%
27 13
31.7%
9 6
 
14.6%
28 1
 
2.4%
36 1
 
2.4%

사업자 법적위치
Categorical

IMBALANCE 

Distinct3
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Memory size456.0 B
주식회사
39 
유한회사
 
1
사단법인(비영리)
 
1

Length

Max length9
Median length4
Mean length4.1219512
Min length4

Unique

Unique2 ?
Unique (%)4.9%

Sample

1st row주식회사
2nd row주식회사
3rd row주식회사
4th row주식회사
5th row주식회사

Common Values

ValueCountFrequency (%)
주식회사 39
95.1%
유한회사 1
 
2.4%
사단법인(비영리) 1
 
2.4%

Length

2024-03-15T04:09:28.036773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T04:09:28.374203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주식회사 39
95.1%
유한회사 1
 
2.4%
사단법인(비영리 1
 
2.4%
Distinct36
Distinct (%)87.8%
Missing0
Missing (%)0.0%
Memory size456.0 B
2024-03-15T04:09:29.262898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9
Mean length7.8780488
Min length5

Characters and Unicode

Total characters323
Distinct characters108
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)75.6%

Sample

1st row청주개발(주)
2nd row(주)에스엘세레스
3rd row중앙관광개발(주)
4th row천룡종합개발(주)
5th row천룡종합개발(주)
ValueCountFrequency (%)
주)에스엘세레스 2
 
4.9%
주)대영베이스 2
 
4.9%
주)이도 2
 
4.9%
천룡종합개발(주 2
 
4.9%
주)시그너스cc 2
 
4.9%
골프존카운티(주 1
 
2.4%
주)다음홀딩스 1
 
2.4%
주)블랙스톤에듀팜리조트 1
 
2.4%
충주기업도시(주 1
 
2.4%
캄코(주 1
 
2.4%
Other values (26) 26
63.4%
2024-03-15T04:09:30.353289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41
 
12.7%
( 40
 
12.4%
) 40
 
12.4%
13
 
4.0%
10
 
3.1%
10
 
3.1%
8
 
2.5%
6
 
1.9%
5
 
1.5%
4
 
1.2%
Other values (98) 146
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 239
74.0%
Open Punctuation 40
 
12.4%
Close Punctuation 40
 
12.4%
Lowercase Letter 4
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
17.2%
13
 
5.4%
10
 
4.2%
10
 
4.2%
8
 
3.3%
6
 
2.5%
5
 
2.1%
4
 
1.7%
3
 
1.3%
3
 
1.3%
Other values (95) 136
56.9%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 239
74.0%
Common 80
 
24.8%
Latin 4
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
17.2%
13
 
5.4%
10
 
4.2%
10
 
4.2%
8
 
3.3%
6
 
2.5%
5
 
2.1%
4
 
1.7%
3
 
1.3%
3
 
1.3%
Other values (95) 136
56.9%
Common
ValueCountFrequency (%)
( 40
50.0%
) 40
50.0%
Latin
ValueCountFrequency (%)
c 4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 239
74.0%
ASCII 84
 
26.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
41
 
17.2%
13
 
5.4%
10
 
4.2%
10
 
4.2%
8
 
3.3%
6
 
2.5%
5
 
2.1%
4
 
1.7%
3
 
1.3%
3
 
1.3%
Other values (95) 136
56.9%
ASCII
ValueCountFrequency (%)
( 40
47.6%
) 40
47.6%
c 4
 
4.8%

Interactions

2024-03-15T04:09:20.308632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T04:09:30.514162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종류골프장명소재지도로명주소규모(제곱미터)홀 수사업자 법적위치사업자명
종류1.0000.7241.0000.7720.1750.0000.949
골프장명0.7241.0001.0000.9610.7491.0001.000
소재지도로명주소1.0001.0001.0000.9610.9171.0001.000
규모(제곱미터)0.7720.9610.9611.0000.7750.0000.909
홀 수0.1750.7490.9170.7751.0000.0580.912
사업자 법적위치0.0001.0001.0000.0000.0581.0001.000
사업자명0.9491.0001.0000.9090.9121.0001.000
2024-03-15T04:09:30.743323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
홀 수종류사업자 법적위치
홀 수1.0000.1190.000
종류0.1191.0000.000
사업자 법적위치0.0000.0001.000
2024-03-15T04:09:30.947618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
규모(제곱미터)종류홀 수사업자 법적위치
규모(제곱미터)1.0000.4400.5570.000
종류0.4401.0000.1190.000
홀 수0.5570.1191.0000.000
사업자 법적위치0.0000.0000.0001.000

Missing values

2024-03-15T04:09:20.638583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T04:09:20.847555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

종류골프장명소재지도로명주소규모(제곱미터)홀 수사업자 법적위치사업자명
0회원제그랜드청주시 청원구 오창읍 꽃화산길 51-20169629927주식회사청주개발(주)
1대중제에스엘세레스 임페리얼레이크충주시 금가면 다래울길 5283033218주식회사(주)에스엘세레스
2회원제디 에머슨(구중앙)진천군 백곡면 배티로 818-105115192428주식회사중앙관광개발(주)
3회원제천 룡진천군 이월면 진안로 347-123115310727주식회사천룡종합개발(주)
4대중제천 룡진천군 이월면 진안로 4252639009주식회사천룡종합개발(주)
5대중제시그너스충주시 앙성면 중방곡길 57-443450189주식회사(주)시그너스cc
6대중제시그너스충주시 앙성면 중방곡길 57-4489338318주식회사(주)시그너스cc
7대중제떼제베청주시 흥덕구 옥산면 동림2딜 149156785936주식회사옥산레제(주)
8대중제썬밸리음성군 삼성면 범말길 49번지94497618주식회사(주)연흥개발
9회원제실크리버청주시 서원구 남이면 문곡구절골길 235107515118주식회사다옴홀딩스(주)
종류골프장명소재지도로명주소규모(제곱미터)홀 수사업자 법적위치사업자명
31대중제세일충주시 신니면 동락길 20788065318주식회사세일개발(주)
32대중제클럽디보은보은군 보은읍 장속중초로 38694412818주식회사(주)이도
33대중제올데이(구-제피로스)충주시 앙성면 조천리 산9-1137105227주식회사(주)엘스엘세레스
34대중제일레븐충주시 앙성면 본평리 산 43-1124524218주식회사(주)일레븐건설
35대중제감곡cc음성군 감고면 문촌리 산 81-6102311418주식회사캄코(주)
36대중제킹스데일충주시 주덕읍 기업도시 3로 287025318주식회사충주기업도시(주)
37대중제블랙스톤((에듀팜)증평군 도안면 벼루재길 33570208218주식회사(주)블랙스톤에듀팜리조트
38비회원제(대중형X)세레니티CC청주시 서원구 남이면 산막리 산 1284652819주식회사(주)다음홀딩스
39비회원제(대중형X)모나크CC음성군 금왕읍 대금로 1851번길 3-1496046218주식회사(주)남경레저
40비회원제(대중형X)음성 힐데스하임 CC음성군 소이면 후미리 산 40-4101587027주식회사(주)한국토지신탁