Overview

Dataset statistics

Number of variables6
Number of observations533
Missing cells263
Missing cells (%)8.2%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory25.1 KiB
Average record size in memory48.2 B

Variable types

Categorical5
Text1

Dataset

Description제주특별자치도 내에 소재하고 있는 문화 예술 단체에 대한 데이터로 구분, 상세, 단체명, 행정시, 읍면동 정보를 제공합니다.
Author제주특별자치도
URLhttps://www.data.go.kr/data/3083524/fileData.do

Alerts

Dataset has 1 (0.2%) duplicate rowsDuplicates
읍면동 has a high cardinality: 51 distinct valuesHigh cardinality
데이터기준일자 is highly overall correlated with 구분 and 3 other fieldsHigh correlation
상세 is highly overall correlated with 구분 and 1 other fieldsHigh correlation
구분 is highly overall correlated with 상세 and 3 other fieldsHigh correlation
읍면동 is highly overall correlated with 구분 and 2 other fieldsHigh correlation
행정시 is highly overall correlated with 구분 and 2 other fieldsHigh correlation
단체명 has 263 (49.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 05:54:47.555586
Analysis finished2023-12-12 05:54:48.178081
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
<NA>
319 
사단·재단법인총괄
214 

Length

Max length9
Median length4
Mean length6.0075047
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사단·재단법인총괄
2nd row사단·재단법인총괄
3rd row사단·재단법인총괄
4th row사단·재단법인총괄
5th row사단·재단법인총괄

Common Values

ValueCountFrequency (%)
<NA> 319
59.8%
사단·재단법인총괄 214
40.2%

Length

2023-12-12T14:54:48.251845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:54:48.342494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 319
59.8%
사단·재단법인총괄 214
40.2%

상세
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
<NA>
263 
사단법인
219 
전문예술법인
 
25
한국예총제주도연합회 및 회원단체
 
12
재단법인
 
8

Length

Max length17
Median length4
Mean length4.4765478
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국예총제주도연합회 및 회원단체
2nd row한국예총제주도연합회 및 회원단체
3rd row한국예총제주도연합회 및 회원단체
4th row한국예총제주도연합회 및 회원단체
5th row한국예총제주도연합회 및 회원단체

Common Values

ValueCountFrequency (%)
<NA> 263
49.3%
사단법인 219
41.1%
전문예술법인 25
 
4.7%
한국예총제주도연합회 및 회원단체 12
 
2.3%
재단법인 8
 
1.5%
제주민예총 및 회원단체 6
 
1.1%

Length

2023-12-12T14:54:48.450906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:54:48.599188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 263
46.2%
사단법인 219
38.5%
전문예술법인 25
 
4.4%
18
 
3.2%
회원단체 18
 
3.2%
한국예총제주도연합회 12
 
2.1%
재단법인 8
 
1.4%
제주민예총 6
 
1.1%

단체명
Text

MISSING 

Distinct268
Distinct (%)99.3%
Missing263
Missing (%)49.3%
Memory size4.3 KiB
2023-12-12T14:54:48.850233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length17
Mean length9.0148148
Min length2

Characters and Unicode

Total characters2434
Distinct characters318
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique266 ?
Unique (%)98.5%

Sample

1st row㈔한국예총제주특별자치도연합회
2nd row㈔한국문인협회제주도지회
3rd row㈔한국사진작가협회제주도협의회
4th row㈔한국연극협회제주도지회
5th row㈔한국미술협회제주도지회
ValueCountFrequency (%)
제주특별자치도 5
 
1.6%
대한불교 3
 
1.0%
기념사업회 3
 
1.0%
제주 3
 
1.0%
탐라미술인협회 2
 
0.6%
탐라양씨건승문화재단 2
 
0.6%
제주민예총 2
 
0.6%
2
 
0.6%
아트 2
 
0.6%
사단법인 2
 
0.6%
Other values (288) 288
91.7%
2023-12-12T14:54:49.226575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
167
 
6.9%
147
 
6.0%
137
 
5.6%
101
 
4.1%
89
 
3.7%
58
 
2.4%
57
 
2.3%
56
 
2.3%
46
 
1.9%
44
 
1.8%
Other values (308) 1532
62.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2343
96.3%
Space Separator 44
 
1.8%
Other Symbol 14
 
0.6%
Close Punctuation 13
 
0.5%
Open Punctuation 13
 
0.5%
Uppercase Letter 3
 
0.1%
Dash Punctuation 2
 
0.1%
Other Punctuation 1
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
167
 
7.1%
147
 
6.3%
137
 
5.8%
101
 
4.3%
89
 
3.8%
58
 
2.5%
57
 
2.4%
56
 
2.4%
46
 
2.0%
44
 
1.9%
Other values (298) 1441
61.5%
Uppercase Letter
ValueCountFrequency (%)
I 1
33.3%
W 1
33.3%
T 1
33.3%
Space Separator
ValueCountFrequency (%)
44
100.0%
Other Symbol
ValueCountFrequency (%)
14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
: 1
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2357
96.8%
Common 74
 
3.0%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
167
 
7.1%
147
 
6.2%
137
 
5.8%
101
 
4.3%
89
 
3.8%
58
 
2.5%
57
 
2.4%
56
 
2.4%
46
 
2.0%
44
 
1.9%
Other values (299) 1455
61.7%
Common
ValueCountFrequency (%)
44
59.5%
) 13
 
17.6%
( 13
 
17.6%
- 2
 
2.7%
: 1
 
1.4%
3 1
 
1.4%
Latin
ValueCountFrequency (%)
I 1
33.3%
W 1
33.3%
T 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2343
96.3%
ASCII 77
 
3.2%
None 14
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
167
 
7.1%
147
 
6.3%
137
 
5.8%
101
 
4.3%
89
 
3.8%
58
 
2.5%
57
 
2.4%
56
 
2.4%
46
 
2.0%
44
 
1.9%
Other values (298) 1441
61.5%
ASCII
ValueCountFrequency (%)
44
57.1%
) 13
 
16.9%
( 13
 
16.9%
- 2
 
2.6%
: 1
 
1.3%
I 1
 
1.3%
3 1
 
1.3%
W 1
 
1.3%
T 1
 
1.3%
None
ValueCountFrequency (%)
14
100.0%

행정시
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
<NA>
263 
제주시
218 
서귀포시
51 
서울특별시
 
1

Length

Max length5
Median length4
Mean length3.5928705
Min length3

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row제주시
2nd row제주시
3rd row제주시
4th row제주시
5th row제주시

Common Values

ValueCountFrequency (%)
<NA> 263
49.3%
제주시 218
40.9%
서귀포시 51
 
9.6%
서울특별시 1
 
0.2%

Length

2023-12-12T14:54:49.395879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:54:49.527174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 263
49.3%
제주시 218
40.9%
서귀포시 51
 
9.6%
서울특별시 1
 
0.2%

읍면동
Categorical

HIGH CARDINALITY  HIGH CORRELATION 

Distinct51
Distinct (%)9.6%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
<NA>
303 
일도이동
 
18
이도일동
 
17
연동
 
16
이도이동
 
16
Other values (46)
163 

Length

Max length4
Median length4
Mean length3.7410882
Min length2

Unique

Unique15 ?
Unique (%)2.8%

Sample

1st row이도일동
2nd row건입동
3rd row이도일동
4th row오라일동
5th row일도이동

Common Values

ValueCountFrequency (%)
<NA> 303
56.8%
일도이동 18
 
3.4%
이도일동 17
 
3.2%
연동 16
 
3.0%
이도이동 16
 
3.0%
노형동 16
 
3.0%
삼도일동 10
 
1.9%
도남동 8
 
1.5%
삼도이동 8
 
1.5%
아라이동 7
 
1.3%
Other values (41) 114
 
21.4%

Length

2023-12-12T14:54:49.683465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 303
56.8%
일도이동 18
 
3.4%
이도일동 17
 
3.2%
이도이동 16
 
3.0%
노형동 16
 
3.0%
연동 16
 
3.0%
삼도일동 10
 
1.9%
도남동 8
 
1.5%
삼도이동 8
 
1.5%
일도일동 7
 
1.3%
Other values (41) 114
 
21.4%

데이터기준일자
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-08-04
270 
<NA>
263 

Length

Max length10
Median length10
Mean length7.0393996
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-04
2nd row2023-08-04
3rd row2023-08-04
4th row2023-08-04
5th row2023-08-04

Common Values

ValueCountFrequency (%)
2023-08-04 270
50.7%
<NA> 263
49.3%

Length

2023-12-12T14:54:49.859502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:54:49.971039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-04 270
50.7%
na 263
49.3%

Correlations

2023-12-12T14:54:50.050380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상세행정시읍면동
상세1.0000.0000.000
행정시0.0001.0000.997
읍면동0.0000.9971.000
2023-12-12T14:54:50.186336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터기준일자상세구분읍면동행정시
데이터기준일자1.0001.0001.0001.0001.000
상세1.0001.0001.0000.0000.000
구분1.0001.0001.0001.0001.000
읍면동1.0000.0001.0001.0000.878
행정시1.0000.0001.0000.8781.000
2023-12-12T14:54:50.328426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분상세행정시읍면동데이터기준일자
구분1.0001.0001.0001.0001.000
상세1.0001.0000.0000.0001.000
행정시1.0000.0001.0000.8781.000
읍면동1.0000.0000.8781.0001.000
데이터기준일자1.0001.0001.0001.0001.000

Missing values

2023-12-12T14:54:47.998196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:54:48.116779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분상세단체명행정시읍면동데이터기준일자
0사단·재단법인총괄한국예총제주도연합회 및 회원단체㈔한국예총제주특별자치도연합회제주시이도일동2023-08-04
1사단·재단법인총괄한국예총제주도연합회 및 회원단체㈔한국문인협회제주도지회제주시건입동2023-08-04
2사단·재단법인총괄한국예총제주도연합회 및 회원단체㈔한국사진작가협회제주도협의회제주시이도일동2023-08-04
3사단·재단법인총괄한국예총제주도연합회 및 회원단체㈔한국연극협회제주도지회제주시오라일동2023-08-04
4사단·재단법인총괄한국예총제주도연합회 및 회원단체㈔한국미술협회제주도지회제주시일도이동2023-08-04
5사단·재단법인총괄한국예총제주도연합회 및 회원단체㈔한국연예예술인협회제주도지회제주시일도이동2023-08-04
6사단·재단법인총괄한국예총제주도연합회 및 회원단체㈔한국음악협회제주도지회제주시이도일동2023-08-04
7사단·재단법인총괄한국예총제주도연합회 및 회원단체㈔한국건축가협회제주도지회제주시건입동2023-08-04
8사단·재단법인총괄한국예총제주도연합회 및 회원단체㈔한국무용협회제주도지회제주시연동2023-08-04
9사단·재단법인총괄한국예총제주도연합회 및 회원단체㈔한국영화인협회제주도지회제주시일도이동2023-08-04
구분상세단체명행정시읍면동데이터기준일자
523<NA><NA><NA><NA><NA><NA>
524<NA><NA><NA><NA><NA><NA>
525<NA><NA><NA><NA><NA><NA>
526<NA><NA><NA><NA><NA><NA>
527<NA><NA><NA><NA><NA><NA>
528<NA><NA><NA><NA><NA><NA>
529<NA><NA><NA><NA><NA><NA>
530<NA><NA><NA><NA><NA><NA>
531<NA><NA><NA><NA><NA><NA>
532<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

구분상세단체명행정시읍면동데이터기준일자# duplicates
0<NA><NA><NA><NA><NA><NA>263