Overview

Dataset statistics

Number of variables5
Number of observations34
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory45.9 B

Variable types

Text1
Numeric2
Boolean1
DateTime1

Dataset

Description서울특별시 용산구 동아리실사용신청 현황(동아리명, 전체회원수, 실사용인원, 빔사용여부, 신청일)에 대한 데이터를 제공합니다.
Author서울특별시 용산구
URLhttps://www.data.go.kr/data/15071289/fileData.do

Alerts

전체회원수 is highly overall correlated with 실사용인원수High correlation
실사용인원수 is highly overall correlated with 전체회원수High correlation
신청일 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:39:24.122738
Analysis finished2023-12-12 03:39:25.087340
Duration0.96 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct17
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-12T12:39:25.249305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9.5
Mean length7.5588235
Min length3

Characters and Unicode

Total characters257
Distinct characters64
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)23.5%

Sample

1st row이공일팔구민강사
2nd row글샘누리
3rd row한울림 클래식 기타
4th row우리옛놀이연구소
5th row이야기나라&독서토론
ValueCountFrequency (%)
이야기나라&독서토론 6
 
12.0%
용산규방 4
 
8.0%
한울림 4
 
8.0%
용산걷지모 3
 
6.0%
우쿨렐레 3
 
6.0%
3
 
6.0%
아미고 3
 
6.0%
클래식 2
 
4.0%
기타 2
 
4.0%
이공일팔구민강사 2
 
4.0%
Other values (15) 18
36.0%
2023-12-12T12:39:25.635714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
6.2%
13
 
5.1%
10
 
3.9%
9
 
3.5%
9
 
3.5%
8
 
3.1%
7
 
2.7%
6
 
2.3%
6
 
2.3%
6
 
2.3%
Other values (54) 167
65.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 235
91.4%
Space Separator 16
 
6.2%
Other Punctuation 6
 
2.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
5.5%
10
 
4.3%
9
 
3.8%
9
 
3.8%
8
 
3.4%
7
 
3.0%
6
 
2.6%
6
 
2.6%
6
 
2.6%
6
 
2.6%
Other values (52) 155
66.0%
Space Separator
ValueCountFrequency (%)
16
100.0%
Other Punctuation
ValueCountFrequency (%)
& 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 235
91.4%
Common 22
 
8.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
5.5%
10
 
4.3%
9
 
3.8%
9
 
3.8%
8
 
3.4%
7
 
3.0%
6
 
2.6%
6
 
2.6%
6
 
2.6%
6
 
2.6%
Other values (52) 155
66.0%
Common
ValueCountFrequency (%)
16
72.7%
& 6
 
27.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 235
91.4%
ASCII 22
 
8.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16
72.7%
& 6
 
27.3%
Hangul
ValueCountFrequency (%)
13
 
5.5%
10
 
4.3%
9
 
3.8%
9
 
3.8%
8
 
3.4%
7
 
3.0%
6
 
2.6%
6
 
2.6%
6
 
2.6%
6
 
2.6%
Other values (52) 155
66.0%

전체회원수
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)38.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.029412
Minimum6
Maximum26
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-12T12:39:25.812495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile6
Q110
median10
Q314
95-th percentile20.45
Maximum26
Range20
Interquartile range (IQR)4

Descriptive statistics

Standard deviation4.8020532
Coefficient of variation (CV)0.39919269
Kurtosis1.9367416
Mean12.029412
Median Absolute Deviation (MAD)2.5
Skewness1.292675
Sum409
Variance23.059715
MonotonicityNot monotonic
2023-12-12T12:39:25.989534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
10 10
29.4%
6 4
 
11.8%
14 3
 
8.8%
8 3
 
8.8%
11 3
 
8.8%
13 2
 
5.9%
16 2
 
5.9%
18 2
 
5.9%
15 1
 
2.9%
17 1
 
2.9%
Other values (3) 3
 
8.8%
ValueCountFrequency (%)
6 4
 
11.8%
8 3
 
8.8%
9 1
 
2.9%
10 10
29.4%
11 3
 
8.8%
13 2
 
5.9%
14 3
 
8.8%
15 1
 
2.9%
16 2
 
5.9%
17 1
 
2.9%
ValueCountFrequency (%)
26 1
 
2.9%
25 1
 
2.9%
18 2
 
5.9%
17 1
 
2.9%
16 2
 
5.9%
15 1
 
2.9%
14 3
 
8.8%
13 2
 
5.9%
11 3
 
8.8%
10 10
29.4%

실사용인원수
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)38.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.941176
Minimum5
Maximum26
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-12T12:39:26.153805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile6
Q18
median10
Q313
95-th percentile19.15
Maximum26
Range21
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.6511072
Coefficient of variation (CV)0.4251012
Kurtosis4.1573355
Mean10.941176
Median Absolute Deviation (MAD)2
Skewness1.8098987
Sum372
Variance21.632799
MonotonicityNot monotonic
2023-12-12T12:39:26.291225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
10 9
26.5%
8 6
17.6%
6 4
11.8%
14 4
11.8%
13 2
 
5.9%
9 2
 
5.9%
12 1
 
2.9%
15 1
 
2.9%
5 1
 
2.9%
25 1
 
2.9%
Other values (3) 3
 
8.8%
ValueCountFrequency (%)
5 1
 
2.9%
6 4
11.8%
8 6
17.6%
9 2
 
5.9%
10 9
26.5%
11 1
 
2.9%
12 1
 
2.9%
13 2
 
5.9%
14 4
11.8%
15 1
 
2.9%
ValueCountFrequency (%)
26 1
 
2.9%
25 1
 
2.9%
16 1
 
2.9%
15 1
 
2.9%
14 4
11.8%
13 2
 
5.9%
12 1
 
2.9%
11 1
 
2.9%
10 9
26.5%
9 2
 
5.9%
Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size166.0 B
False
20 
True
14 
ValueCountFrequency (%)
False 20
58.8%
True 14
41.2%
2023-12-12T12:39:26.414473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

신청일
Date

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
Minimum2018-01-16 20:48:00
Maximum2020-08-11 10:14:00
2023-12-12T12:39:26.550345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:39:26.746634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)

Interactions

2023-12-12T12:39:24.624431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:39:24.339963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:39:24.742547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:39:24.484058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:39:26.891289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동아리명전체회원수실사용인원수빔사용여부신청일
동아리명1.0000.8660.8760.8361.000
전체회원수0.8661.0000.7500.5011.000
실사용인원수0.8760.7501.0000.5081.000
빔사용여부0.8360.5010.5081.0001.000
신청일1.0001.0001.0001.0001.000
2023-12-12T12:39:27.032642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전체회원수실사용인원수빔사용여부
전체회원수1.0000.6850.307
실사용인원수0.6851.0000.497
빔사용여부0.3070.4971.000

Missing values

2023-12-12T12:39:24.889406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:39:25.034485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

동아리명전체회원수실사용인원수빔사용여부신청일
0이공일팔구민강사108N2020-08-11 10:14
1글샘누리108Y2020-08-10 15:43
2한울림 클래식 기타66N2020-08-04 08:02
3우리옛놀이연구소1313N2020-08-03 21:07
4이야기나라&독서토론1512Y2020-08-03 17:44
5이야기나라&독서토론1715Y2020-01-23 17:43
6이공일팔 구민강사108N2020-01-23 12:49
7손 꽃165Y2020-01-21 19:13
8우리 옛놀이 연구소1010N2020-01-16 19:44
9한울림 클래식 기타66Y2020-01-14 15:28
동아리명전체회원수실사용인원수빔사용여부신청일
24우쿨렐레 미 아미고88N2018-05-17 21:36
25이야기나라&독서토론1414Y2018-05-16 13:11
26이공일팔구민강사1111N2018-05-16 10:42
27플라워한마음99N2018-05-15 00:05
28한울림 클래식기타동아리88N2018-05-14 15:45
29용산걷지모2626Y2018-01-24 14:58
30이야기나라&독서토론1414Y2018-01-23 01:56
31그린러브용산1310Y2018-01-22 15:59
32우쿨렐레 미 아미고66N2018-01-22 09:47
33용산규방1110N2018-01-16 20:48