Overview

Dataset statistics

Number of variables4
Number of observations22
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory858.0 B
Average record size in memory39.0 B

Variable types

Text2
Numeric1
Categorical1

Dataset

Description경상남도_남해군 자원봉사단체에 대한 데이터로 단체명, 단체원수, 대표자 이름 등 항목에 대한 정보를 제공합니다.
Author경상남도 남해군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3063222

Alerts

인원 is highly overall correlated with 직위High correlation
직위 is highly overall correlated with 인원High correlation
직위 is highly imbalanced (56.1%)Imbalance
단체명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:16:21.624880
Analysis finished2023-12-10 23:16:21.993705
Duration0.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

단체명
Text

UNIQUE 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-11T08:16:22.116804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length11.5
Mean length9.5909091
Min length5

Characters and Unicode

Total characters211
Distinct characters98
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row남해군자원봉사협의회임원단
2nd row남해기독신우회
3rd row남해청실회
4th row새남해로타리클럽
5th row천도교봉사회
ValueCountFrequency (%)
남해군자원봉사협의회임원단 1
 
3.8%
남해기독신우회 1
 
3.8%
등불남해지부 1
 
3.8%
남해군새마을부녀회 1
 
3.8%
보물섬힐링공연단 1
 
3.8%
남해군지부 1
 
3.8%
경남 1
 
3.8%
사)대한민국건국 1
 
3.8%
새남해라이온스클럽 1
 
3.8%
남해군화전농악보존회 1
 
3.8%
Other values (16) 16
61.5%
2023-12-11T08:16:22.406916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
7.6%
15
 
7.1%
15
 
7.1%
9
 
4.3%
8
 
3.8%
6
 
2.8%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (88) 126
59.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 200
94.8%
Space Separator 4
 
1.9%
Uppercase Letter 3
 
1.4%
Open Punctuation 2
 
0.9%
Close Punctuation 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
8.0%
15
 
7.5%
15
 
7.5%
9
 
4.5%
8
 
4.0%
6
 
3.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (82) 115
57.5%
Uppercase Letter
ValueCountFrequency (%)
I 1
33.3%
G 1
33.3%
S 1
33.3%
Space Separator
ValueCountFrequency (%)
4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 200
94.8%
Common 8
 
3.8%
Latin 3
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
8.0%
15
 
7.5%
15
 
7.5%
9
 
4.5%
8
 
4.0%
6
 
3.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (82) 115
57.5%
Common
ValueCountFrequency (%)
4
50.0%
( 2
25.0%
) 2
25.0%
Latin
ValueCountFrequency (%)
I 1
33.3%
G 1
33.3%
S 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 200
94.8%
ASCII 11
 
5.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
16
 
8.0%
15
 
7.5%
15
 
7.5%
9
 
4.5%
8
 
4.0%
6
 
3.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (82) 115
57.5%
ASCII
ValueCountFrequency (%)
4
36.4%
( 2
18.2%
) 2
18.2%
I 1
 
9.1%
G 1
 
9.1%
S 1
 
9.1%

인원
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)81.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean91.136364
Minimum12
Maximum504
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size330.0 B
2023-12-11T08:16:22.509892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile13.15
Q120
median38
Q369.75
95-th percentile318.95
Maximum504
Range492
Interquartile range (IQR)49.75

Descriptive statistics

Standard deviation128.15255
Coefficient of variation (CV)1.4061626
Kurtosis4.5701485
Mean91.136364
Median Absolute Deviation (MAD)22
Skewness2.2256691
Sum2005
Variance16423.076
MonotonicityNot monotonic
2023-12-11T08:16:22.602162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
20 3
 
13.6%
60 3
 
13.6%
13 1
 
4.5%
25 1
 
4.5%
16 1
 
4.5%
32 1
 
4.5%
222 1
 
4.5%
12 1
 
4.5%
23 1
 
4.5%
43 1
 
4.5%
Other values (8) 8
36.4%
ValueCountFrequency (%)
12 1
 
4.5%
13 1
 
4.5%
16 1
 
4.5%
17 1
 
4.5%
20 3
13.6%
23 1
 
4.5%
25 1
 
4.5%
32 1
 
4.5%
33 1
 
4.5%
43 1
 
4.5%
ValueCountFrequency (%)
504 1
 
4.5%
320 1
 
4.5%
299 1
 
4.5%
222 1
 
4.5%
80 1
 
4.5%
73 1
 
4.5%
60 3
13.6%
53 1
 
4.5%
43 1
 
4.5%
33 1
 
4.5%

직위
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size308.0 B
회장
20 
지회장
 
2

Length

Max length3
Median length2
Mean length2.0909091
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row회장
2nd row회장
3rd row회장
4th row회장
5th row회장

Common Values

ValueCountFrequency (%)
회장 20
90.9%
지회장 2
 
9.1%

Length

2023-12-11T08:16:22.700490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:16:22.785033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
회장 20
90.9%
지회장 2
 
9.1%
Distinct21
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-11T08:16:22.913677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters66
Distinct characters40
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)90.9%

Sample

1st row양태종
2nd row박종철
3rd row윤지현
4th row류옥근
5th row이영미
ValueCountFrequency (%)
양태종 2
 
9.1%
최윤수 1
 
4.5%
박미선 1
 
4.5%
곽영순 1
 
4.5%
임양심 1
 
4.5%
한일균 1
 
4.5%
김경성 1
 
4.5%
이나경 1
 
4.5%
송홍주 1
 
4.5%
김순덕 1
 
4.5%
Other values (11) 11
50.0%
2023-12-11T08:16:23.199261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
 
7.6%
4
 
6.1%
4
 
6.1%
4
 
6.1%
3
 
4.5%
3
 
4.5%
3
 
4.5%
2
 
3.0%
2
 
3.0%
2
 
3.0%
Other values (30) 34
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 66
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
7.6%
4
 
6.1%
4
 
6.1%
4
 
6.1%
3
 
4.5%
3
 
4.5%
3
 
4.5%
2
 
3.0%
2
 
3.0%
2
 
3.0%
Other values (30) 34
51.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 66
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
7.6%
4
 
6.1%
4
 
6.1%
4
 
6.1%
3
 
4.5%
3
 
4.5%
3
 
4.5%
2
 
3.0%
2
 
3.0%
2
 
3.0%
Other values (30) 34
51.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 66
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5
 
7.6%
4
 
6.1%
4
 
6.1%
4
 
6.1%
3
 
4.5%
3
 
4.5%
3
 
4.5%
2
 
3.0%
2
 
3.0%
2
 
3.0%
Other values (30) 34
51.5%

Interactions

2023-12-11T08:16:21.778647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:16:23.304097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단체명인원직위성 명
단체명1.0001.0001.0001.000
인원1.0001.0000.7710.000
직위1.0000.7711.0000.000
성 명1.0000.0000.0001.000
2023-12-11T08:16:23.386477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인원직위
인원1.0000.509
직위0.5091.000

Missing values

2023-12-11T08:16:21.898276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:16:21.965163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

단체명인원직위성 명
0남해군자원봉사협의회임원단13회장양태종
1남해기독신우회33회장박종철
2남해청실회53회장윤지현
3새남해로타리클럽80회장류옥근
4천도교봉사회17회장이영미
5한국자유총연맹남해군지회320지회장양태종
6손사랑서금요법봉사단20회장김두엽
7남해로타리클럽73회장이양기
8재향군인여성회299회장이양옥
9바르게살기운동남해군협의회504회장최태정
단체명인원직위성 명
12(사)한국연예예술인총연합회 남해지회43지회장최윤수
13SGI 행복드림봉사회60회장김순덕
14남해신협두손모아봉사단23회장송홍주
15남해군화전농악보존회20회장이나경
16새남해라이온스클럽60회장김경성
17(사)대한민국건국 경남 남해군지부60회장한일균
18보물섬힐링공연단12회장임양심
19남해군새마을부녀회222회장곽영순
20등불남해지부32회장박미선
21남해군그린리더협의회16회장정준순