Overview

Dataset statistics

Number of variables5
Number of observations57
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory44.3 B

Variable types

Numeric2
Text2
Categorical1

Dataset

Description인천광역시의 전문예술·법인 단체 지정 현황입니다. 매년 하반기 심의를 통해 지정하여, 단체의 전문성을 인정하고 제도적·간접적 지원을 통하여 자생력과 경쟁력을 갖춘 단체를 육성하고자 합니다.
URLhttps://www.data.go.kr/data/15103892/fileData.do

Alerts

구분 is highly overall correlated with 지정연도 and 1 other fieldsHigh correlation
지정연도 is highly overall correlated with 구분High correlation
단체구분 is highly overall correlated with 구분High correlation
구분 has unique valuesUnique
단체명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:37:54.708366
Analysis finished2023-12-12 07:37:55.435541
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct57
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29
Minimum1
Maximum57
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size645.0 B
2023-12-12T16:37:55.499829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.8
Q115
median29
Q343
95-th percentile54.2
Maximum57
Range56
Interquartile range (IQR)28

Descriptive statistics

Standard deviation16.598193
Coefficient of variation (CV)0.57235147
Kurtosis-1.2
Mean29
Median Absolute Deviation (MAD)14
Skewness0
Sum1653
Variance275.5
MonotonicityStrictly increasing
2023-12-12T16:37:55.630727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.8%
44 1
 
1.8%
32 1
 
1.8%
33 1
 
1.8%
34 1
 
1.8%
35 1
 
1.8%
36 1
 
1.8%
37 1
 
1.8%
38 1
 
1.8%
39 1
 
1.8%
Other values (47) 47
82.5%
ValueCountFrequency (%)
1 1
1.8%
2 1
1.8%
3 1
1.8%
4 1
1.8%
5 1
1.8%
6 1
1.8%
7 1
1.8%
8 1
1.8%
9 1
1.8%
10 1
1.8%
ValueCountFrequency (%)
57 1
1.8%
56 1
1.8%
55 1
1.8%
54 1
1.8%
53 1
1.8%
52 1
1.8%
51 1
1.8%
50 1
1.8%
49 1
1.8%
48 1
1.8%

단체명
Text

UNIQUE 

Distinct57
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size588.0 B
2023-12-12T16:37:55.884624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length16
Mean length10.754386
Min length3

Characters and Unicode

Total characters613
Distinct characters161
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)100.0%

Sample

1st row(사)인천여성미술비엔날레 조직위원회
2nd row(사)인천음악문화원(인천오페라단)
3rd row갤러리 진
4th row교육극단 보물상자
5th row아트커뮤니티 아비투스
ValueCountFrequency (%)
극단 5
 
5.5%
인천광역시지회 4
 
4.4%
미추홀 2
 
2.2%
인천지회 2
 
2.2%
퓨전국악단 1
 
1.1%
사)한국음악협회 1
 
1.1%
사)인천민예총 1
 
1.1%
서구문화재단 1
 
1.1%
재)인천광역시 1
 
1.1%
은율탈춤보존회 1
 
1.1%
Other values (72) 72
79.1%
2023-12-12T16:37:56.258383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
5.5%
32
 
5.2%
31
 
5.1%
26
 
4.2%
( 25
 
4.1%
) 25
 
4.1%
24
 
3.9%
24
 
3.9%
17
 
2.8%
17
 
2.8%
Other values (151) 358
58.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 524
85.5%
Space Separator 34
 
5.5%
Open Punctuation 25
 
4.1%
Close Punctuation 25
 
4.1%
Uppercase Letter 4
 
0.7%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
6.1%
31
 
5.9%
26
 
5.0%
24
 
4.6%
24
 
4.6%
17
 
3.2%
17
 
3.2%
13
 
2.5%
12
 
2.3%
11
 
2.1%
Other values (144) 317
60.5%
Uppercase Letter
ValueCountFrequency (%)
I 2
50.0%
R 1
25.0%
M 1
25.0%
Space Separator
ValueCountFrequency (%)
34
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 524
85.5%
Common 85
 
13.9%
Latin 4
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
6.1%
31
 
5.9%
26
 
5.0%
24
 
4.6%
24
 
4.6%
17
 
3.2%
17
 
3.2%
13
 
2.5%
12
 
2.3%
11
 
2.1%
Other values (144) 317
60.5%
Common
ValueCountFrequency (%)
34
40.0%
( 25
29.4%
) 25
29.4%
- 1
 
1.2%
Latin
ValueCountFrequency (%)
I 2
50.0%
R 1
25.0%
M 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 524
85.5%
ASCII 89
 
14.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
34
38.2%
( 25
28.1%
) 25
28.1%
I 2
 
2.2%
R 1
 
1.1%
M 1
 
1.1%
- 1
 
1.1%
Hangul
ValueCountFrequency (%)
32
 
6.1%
31
 
5.9%
26
 
5.0%
24
 
4.6%
24
 
4.6%
17
 
3.2%
17
 
3.2%
13
 
2.5%
12
 
2.3%
11
 
2.1%
Other values (144) 317
60.5%

단체구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size588.0 B
임의단체
30 
법인
22 
법 인

Length

Max length4
Median length4
Mean length3.1403509
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row법인
2nd row법인
3rd row임의단체
4th row임의단체
5th row임의단체

Common Values

ValueCountFrequency (%)
임의단체 30
52.6%
법인 22
38.6%
법 인 5
 
8.8%

Length

2023-12-12T16:37:56.412910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:37:56.529431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
임의단체 30
48.4%
법인 22
35.5%
5
 
8.1%
5
 
8.1%
Distinct55
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size588.0 B
2023-12-12T16:37:56.789074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters171
Distinct characters77
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)93.0%

Sample

1st row오정숙
2nd row황건식
3rd row김진수
4th row오영일
5th row장구보
ValueCountFrequency (%)
최병국 2
 
3.5%
김진수 2
 
3.5%
지운하 1
 
1.8%
김원범 1
 
1.8%
이재호 1
 
1.8%
서성식 1
 
1.8%
오정숙 1
 
1.8%
김경아 1
 
1.8%
김주성 1
 
1.8%
한규식 1
 
1.8%
Other values (45) 45
78.9%
2023-12-12T16:37:57.191999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
9.4%
8
 
4.7%
8
 
4.7%
7
 
4.1%
6
 
3.5%
5
 
2.9%
5
 
2.9%
4
 
2.3%
4
 
2.3%
4
 
2.3%
Other values (67) 104
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 171
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
9.4%
8
 
4.7%
8
 
4.7%
7
 
4.1%
6
 
3.5%
5
 
2.9%
5
 
2.9%
4
 
2.3%
4
 
2.3%
4
 
2.3%
Other values (67) 104
60.8%

Most occurring scripts

ValueCountFrequency (%)
Hangul 171
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
9.4%
8
 
4.7%
8
 
4.7%
7
 
4.1%
6
 
3.5%
5
 
2.9%
5
 
2.9%
4
 
2.3%
4
 
2.3%
4
 
2.3%
Other values (67) 104
60.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 171
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
16
 
9.4%
8
 
4.7%
8
 
4.7%
7
 
4.1%
6
 
3.5%
5
 
2.9%
5
 
2.9%
4
 
2.3%
4
 
2.3%
4
 
2.3%
Other values (67) 104
60.8%

지정연도
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)19.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2015.4561
Minimum2011
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size645.0 B
2023-12-12T16:37:57.335003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2011
5-th percentile2011
Q12012
median2015
Q32019
95-th percentile2021.2
Maximum2022
Range11
Interquartile range (IQR)7

Descriptive statistics

Standard deviation3.9054457
Coefficient of variation (CV)0.0019377478
Kurtosis-1.4457195
Mean2015.4561
Median Absolute Deviation (MAD)3
Skewness0.31806312
Sum114881
Variance15.252506
MonotonicityIncreasing
2023-12-12T16:37:57.472110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2011 12
21.1%
2012 8
14.0%
2013 7
12.3%
2021 7
12.3%
2017 6
10.5%
2015 4
 
7.0%
2018 3
 
5.3%
2019 3
 
5.3%
2020 3
 
5.3%
2022 3
 
5.3%
ValueCountFrequency (%)
2011 12
21.1%
2012 8
14.0%
2013 7
12.3%
2015 4
 
7.0%
2016 1
 
1.8%
2017 6
10.5%
2018 3
 
5.3%
2019 3
 
5.3%
2020 3
 
5.3%
2021 7
12.3%
ValueCountFrequency (%)
2022 3
 
5.3%
2021 7
12.3%
2020 3
 
5.3%
2019 3
 
5.3%
2018 3
 
5.3%
2017 6
10.5%
2016 1
 
1.8%
2015 4
7.0%
2013 7
12.3%
2012 8
14.0%

Interactions

2023-12-12T16:37:55.136400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:54.996944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:55.205142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:55.068647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:37:57.556943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분단체명단체구분대 표 자지정연도
구분1.0001.0000.7150.9700.859
단체명1.0001.0001.0001.0001.000
단체구분0.7151.0001.0000.9360.723
대 표 자0.9701.0000.9361.0001.000
지정연도0.8591.0000.7231.0001.000
2023-12-12T16:37:57.658993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분지정연도단체구분
구분1.0000.9910.516
지정연도0.9911.0000.429
단체구분0.5160.4291.000

Missing values

2023-12-12T16:37:55.306398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:37:55.401324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분단체명단체구분대 표 자지정연도
01(사)인천여성미술비엔날레 조직위원회법인오정숙2011
12(사)인천음악문화원(인천오페라단)법인황건식2011
23갤러리 진임의단체김진수2011
34교육극단 보물상자임의단체오영일2011
45아트커뮤니티 아비투스임의단체장구보2011
56극단 마임임의단체최규호2011
67극단 십년후임의단체최원영2011
78미추홀 오페라단임의단체이도형2011
89미추홀 요들단임의단체김진구2011
910사랑극단 꼬마세상임의단체김일준2011
구분단체명단체구분대 표 자지정연도
4748극단MIR 레퍼토리임의단체이재상2021
4849위드어스 예술단임의단체김성민2021
4950(사)한국캘리그라피창작협회법 인박혁남2021
5051극단 아토임의단체김정열2021
5152이보비태 챔버 오케스트라임의단체민경욱2021
5253극단 아트팩토리임의단체김원범2021
5354(사)인천남사당놀이보존회법 인지운하2021
5455(사)국제장애인문화교류 인천광역시협회법 인서성식2022
5556재단법인 연수문화재단법 인이재호2022
5657(사)인천음악콘텐츠협회법 인우정주2022