Overview

Dataset statistics

Number of variables7
Number of observations630
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory35.8 KiB
Average record size in memory58.2 B

Variable types

Numeric2
Text1
Categorical4

Dataset

Description대구광역시 북구_자원봉사단체_20230126
Author대구광역시 북구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15030609&dataSetDetailId=150306091a6e15bd627cd_202001212111&provdMethod=FILE

Alerts

소속기관 has constant value ""Constant
데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 회원수High correlation
회원수 is highly overall correlated with 연번High correlation
상태 is highly imbalanced (96.9%)Imbalance
연번 has unique valuesUnique
단체명 has unique valuesUnique

Reproduction

Analysis started2024-04-20 18:47:00.180629
Analysis finished2024-04-20 18:47:02.464815
Duration2.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct630
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean315.5
Minimum1
Maximum630
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.7 KiB
2024-04-21T03:47:02.668475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile32.45
Q1158.25
median315.5
Q3472.75
95-th percentile598.55
Maximum630
Range629
Interquartile range (IQR)314.5

Descriptive statistics

Standard deviation182.00962
Coefficient of variation (CV)0.5768926
Kurtosis-1.2
Mean315.5
Median Absolute Deviation (MAD)157.5
Skewness0
Sum198765
Variance33127.5
MonotonicityStrictly increasing
2024-04-21T03:47:03.110848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
425 1
 
0.2%
418 1
 
0.2%
419 1
 
0.2%
420 1
 
0.2%
421 1
 
0.2%
422 1
 
0.2%
423 1
 
0.2%
424 1
 
0.2%
426 1
 
0.2%
Other values (620) 620
98.4%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
630 1
0.2%
629 1
0.2%
628 1
0.2%
627 1
0.2%
626 1
0.2%
625 1
0.2%
624 1
0.2%
623 1
0.2%
622 1
0.2%
621 1
0.2%

단체명
Text

UNIQUE 

Distinct630
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2024-04-21T03:47:03.992047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length10.24127
Min length2

Characters and Unicode

Total characters6452
Distinct characters447
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique630 ?
Unique (%)100.0%

Sample

1st row경북대학교 미담장학회
2nd row대구과학대학교 사회봉사센터
3rd row클린월드봉사단(대구북구하나님의교회)
4th row한국자유총연맹 대구북구지회
5th row한국전력공사
ValueCountFrequency (%)
지역자율방재단 23
 
2.6%
자율방범대 22
 
2.5%
새마을부녀회 14
 
1.6%
봉사단 13
 
1.5%
새마을협의회 12
 
1.3%
바르게 9
 
1.0%
북부경찰서 8
 
0.9%
자유총연맹 7
 
0.8%
침산1동 5
 
0.6%
무태조야동 5
 
0.6%
Other values (686) 773
86.8%
2024-04-21T03:47:04.988186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
295
 
4.6%
261
 
4.0%
217
 
3.4%
217
 
3.4%
206
 
3.2%
204
 
3.2%
189
 
2.9%
178
 
2.8%
131
 
2.0%
116
 
1.8%
Other values (437) 4438
68.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5625
87.2%
Space Separator 261
 
4.0%
Decimal Number 144
 
2.2%
Close Punctuation 110
 
1.7%
Open Punctuation 106
 
1.6%
Uppercase Letter 95
 
1.5%
Lowercase Letter 88
 
1.4%
Other Punctuation 19
 
0.3%
Dash Punctuation 3
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
295
 
5.2%
217
 
3.9%
217
 
3.9%
206
 
3.7%
204
 
3.6%
189
 
3.4%
178
 
3.2%
131
 
2.3%
116
 
2.1%
113
 
2.0%
Other values (375) 3759
66.8%
Uppercase Letter
ValueCountFrequency (%)
S 10
 
10.5%
C 9
 
9.5%
G 9
 
9.5%
H 7
 
7.4%
A 6
 
6.3%
D 5
 
5.3%
K 5
 
5.3%
R 5
 
5.3%
O 5
 
5.3%
M 4
 
4.2%
Other values (12) 30
31.6%
Lowercase Letter
ValueCountFrequency (%)
e 12
13.6%
i 8
 
9.1%
l 7
 
8.0%
a 6
 
6.8%
o 6
 
6.8%
n 5
 
5.7%
t 5
 
5.7%
w 5
 
5.7%
r 5
 
5.7%
u 4
 
4.5%
Other values (10) 25
28.4%
Decimal Number
ValueCountFrequency (%)
1 50
34.7%
2 46
31.9%
3 17
 
11.8%
0 11
 
7.6%
4 6
 
4.2%
9 5
 
3.5%
5 3
 
2.1%
6 2
 
1.4%
8 2
 
1.4%
7 2
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 10
52.6%
& 5
26.3%
; 2
 
10.5%
% 1
 
5.3%
/ 1
 
5.3%
Space Separator
ValueCountFrequency (%)
261
100.0%
Close Punctuation
ValueCountFrequency (%)
) 110
100.0%
Open Punctuation
ValueCountFrequency (%)
( 106
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5622
87.1%
Common 644
 
10.0%
Latin 183
 
2.8%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
295
 
5.2%
217
 
3.9%
217
 
3.9%
206
 
3.7%
204
 
3.6%
189
 
3.4%
178
 
3.2%
131
 
2.3%
116
 
2.1%
113
 
2.0%
Other values (372) 3756
66.8%
Latin
ValueCountFrequency (%)
e 12
 
6.6%
S 10
 
5.5%
C 9
 
4.9%
G 9
 
4.9%
i 8
 
4.4%
l 7
 
3.8%
H 7
 
3.8%
a 6
 
3.3%
o 6
 
3.3%
A 6
 
3.3%
Other values (32) 103
56.3%
Common
ValueCountFrequency (%)
261
40.5%
) 110
17.1%
( 106
16.5%
1 50
 
7.8%
2 46
 
7.1%
3 17
 
2.6%
0 11
 
1.7%
. 10
 
1.6%
4 6
 
0.9%
9 5
 
0.8%
Other values (10) 22
 
3.4%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5622
87.1%
ASCII 827
 
12.8%
CJK 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
295
 
5.2%
217
 
3.9%
217
 
3.9%
206
 
3.7%
204
 
3.6%
189
 
3.4%
178
 
3.2%
131
 
2.3%
116
 
2.1%
113
 
2.0%
Other values (372) 3756
66.8%
ASCII
ValueCountFrequency (%)
261
31.6%
) 110
13.3%
( 106
12.8%
1 50
 
6.0%
2 46
 
5.6%
3 17
 
2.1%
e 12
 
1.5%
0 11
 
1.3%
S 10
 
1.2%
. 10
 
1.2%
Other values (52) 194
23.5%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

회원수
Real number (ℝ)

HIGH CORRELATION 

Distinct111
Distinct (%)17.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.257143
Minimum1
Maximum833
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.7 KiB
2024-04-21T03:47:05.227338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6
Q114
median21
Q334
95-th percentile96.55
Maximum833
Range832
Interquartile range (IQR)20

Descriptive statistics

Standard deviation56.230877
Coefficient of variation (CV)1.6414351
Kurtosis97.736903
Mean34.257143
Median Absolute Deviation (MAD)9
Skewness8.4533148
Sum21582
Variance3161.9115
MonotonicityDecreasing
2024-04-21T03:47:05.467976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18 27
 
4.3%
25 25
 
4.0%
13 25
 
4.0%
20 24
 
3.8%
17 22
 
3.5%
9 21
 
3.3%
22 21
 
3.3%
14 20
 
3.2%
19 18
 
2.9%
21 18
 
2.9%
Other values (101) 409
64.9%
ValueCountFrequency (%)
1 2
 
0.3%
2 3
 
0.5%
3 8
 
1.3%
4 5
 
0.8%
5 7
 
1.1%
6 10
1.6%
7 13
2.1%
8 16
2.5%
9 21
3.3%
10 17
2.7%
ValueCountFrequency (%)
833 1
0.2%
678 1
0.2%
438 1
0.2%
326 1
0.2%
288 1
0.2%
273 1
0.2%
269 1
0.2%
233 1
0.2%
180 1
0.2%
168 1
0.2%

소속기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
대구광역시 북구
630 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시 북구
2nd row대구광역시 북구
3rd row대구광역시 북구
4th row대구광역시 북구
5th row대구광역시 북구

Common Values

ValueCountFrequency (%)
대구광역시 북구 630
100.0%

Length

2024-04-21T03:47:05.686915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:47:05.861909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 630
50.0%
북구 630
50.0%

상태
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
활동
628 
비활동
 
2

Length

Max length3
Median length2
Mean length2.0031746
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row활동
2nd row활동
3rd row활동
4th row활동
5th row활동

Common Values

ValueCountFrequency (%)
활동 628
99.7%
비활동 2
 
0.3%

Length

2024-04-21T03:47:06.074663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:47:06.261799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
활동 628
99.7%
비활동 2
 
0.3%

단체구분
Categorical

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
법인
365 
나눔단체
257 
가상단체
 
7
학교
 
1

Length

Max length4
Median length2
Mean length2.8380952
Min length2

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row나눔단체
2nd row나눔단체
3rd row나눔단체
4th row나눔단체
5th row법인

Common Values

ValueCountFrequency (%)
법인 365
57.9%
나눔단체 257
40.8%
가상단체 7
 
1.1%
학교 1
 
0.2%

Length

2024-04-21T03:47:06.630317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:47:06.983324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
법인 365
57.9%
나눔단체 257
40.8%
가상단체 7
 
1.1%
학교 1
 
0.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2023-01-26
630 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-01-26
2nd row2023-01-26
3rd row2023-01-26
4th row2023-01-26
5th row2023-01-26

Common Values

ValueCountFrequency (%)
2023-01-26 630
100.0%

Length

2024-04-21T03:47:07.344537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:47:07.648436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-01-26 630
100.0%

Interactions

2024-04-21T03:47:01.167654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:47:00.654964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:47:01.610819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:47:00.914046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T03:47:07.828719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번회원수상태단체구분
연번1.0000.5170.0000.000
회원수0.5171.0000.0000.135
상태0.0000.0001.0000.000
단체구분0.0000.1350.0001.000
2024-04-21T03:47:08.072090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단체구분상태
단체구분1.0000.000
상태0.0001.000
2024-04-21T03:47:08.309426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번회원수상태단체구분
연번1.000-1.0000.0000.000
회원수-1.0001.0000.0000.093
상태0.0000.0001.0000.000
단체구분0.0000.0930.0001.000

Missing values

2024-04-21T03:47:01.939852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T03:47:02.317184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번단체명회원수소속기관상태단체구분데이터기준일자
01경북대학교 미담장학회833대구광역시 북구활동나눔단체2023-01-26
12대구과학대학교 사회봉사센터678대구광역시 북구활동나눔단체2023-01-26
23클린월드봉사단(대구북구하나님의교회)438대구광역시 북구활동나눔단체2023-01-26
34한국자유총연맹 대구북구지회326대구광역시 북구활동나눔단체2023-01-26
45한국전력공사288대구광역시 북구활동법인2023-01-26
56칠곡가톨릭병원273대구광역시 북구활동법인2023-01-26
67성광고 샤프론봉사단(학부모/학생)269대구광역시 북구활동법인2023-01-26
78칠곡경대병원자원봉사자(실적등록)233대구광역시 북구활동가상단체2023-01-26
89월드컵한마음봉사단180대구광역시 북구활동나눔단체2023-01-26
910북대구세무서 봉사단168대구광역시 북구활동법인2023-01-26
연번단체명회원수소속기관상태단체구분데이터기준일자
620621강북지구대 조야자율방범대3대구광역시 북구활동법인2023-01-26
621622공직재능나눔봉사단(대구북구)3대구광역시 북구활동나눔단체2023-01-26
622623한국 팔공나눔 장학회3대구광역시 북구활동나눔단체2023-01-26
623624한국노년봉사단3대구광역시 북구활동나눔단체2023-01-26
624625관문동 지역사회보장협의체3대구광역시 북구활동나눔단체2023-01-26
625626경상외식산업 봉사단2대구광역시 북구활동법인2023-01-26
626627소리새색소폰봉사단2대구광역시 북구활동법인2023-01-26
627628오삼오봉사단2대구광역시 북구활동나눔단체2023-01-26
628629매천시장여성봉사회1대구광역시 북구활동나눔단체2023-01-26
629630D.U.C1대구광역시 북구활동나눔단체2023-01-26