Overview

Dataset statistics

Number of variables4
Number of observations89
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 KiB
Average record size in memory35.5 B

Variable types

Numeric2
Text1
Categorical1

Alerts

sn is highly overall correlated with h_coHigh correlation
h_co is highly overall correlated with snHigh correlation
sn has unique valuesUnique
h_co has 3 (3.4%) zerosZeros

Reproduction

Analysis started2023-12-10 09:54:05.930308
Analysis finished2023-12-10 09:54:07.645474
Duration1.72 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

sn
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct89
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51.47191
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size933.0 B
2023-12-10T18:54:07.817559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.4
Q126
median54
Q377
95-th percentile95.6
Maximum100
Range99
Interquartile range (IQR)51

Descriptive statistics

Standard deviation29.397777
Coefficient of variation (CV)0.57114215
Kurtosis-1.255344
Mean51.47191
Median Absolute Deviation (MAD)26
Skewness-0.03434543
Sum4581
Variance864.22932
MonotonicityStrictly increasing
2023-12-10T18:54:08.115403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
78 1
 
1.1%
76 1
 
1.1%
75 1
 
1.1%
73 1
 
1.1%
72 1
 
1.1%
71 1
 
1.1%
70 1
 
1.1%
69 1
 
1.1%
68 1
 
1.1%
Other values (79) 79
88.8%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
12 1
1.1%
ValueCountFrequency (%)
100 1
1.1%
99 1
1.1%
98 1
1.1%
97 1
1.1%
96 1
1.1%
95 1
1.1%
94 1
1.1%
93 1
1.1%
92 1
1.1%
91 1
1.1%
Distinct88
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size844.0 B
2023-12-10T18:54:08.625689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length35
Mean length25.449438
Min length10

Characters and Unicode

Total characters2265
Distinct characters336
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique87 ?
Unique (%)97.8%

Sample

1st row2019 논산시청소년어울림마당 5차 "논산시청소년활동발표회"
2nd row특수청소년 난타교실 '더디가도 함께가요'
3rd row원탁의 기사 참가자 모집
4th row전주 예술중학교 진로직업체험 프로그램
5th row제 9회 청소년 연말연시 희망나눔캠프 '사랑 나누고 마음 더하기'
ValueCountFrequency (%)
모집 17
 
3.9%
2020 12
 
2.7%
청소년 10
 
2.3%
2019 10
 
2.3%
동아리 9
 
2.1%
신규회원 9
 
2.1%
안내 7
 
1.6%
2020년 7
 
1.6%
프로그램 5
 
1.1%
양양군청소년수련관 4
 
0.9%
Other values (289) 348
79.5%
2023-12-10T18:54:09.454081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
359
 
15.8%
2 68
 
3.0%
67
 
3.0%
0 65
 
2.9%
58
 
2.6%
51
 
2.3%
1 40
 
1.8%
39
 
1.7%
36
 
1.6%
' 36
 
1.6%
Other values (326) 1446
63.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1543
68.1%
Space Separator 359
 
15.8%
Decimal Number 213
 
9.4%
Other Punctuation 65
 
2.9%
Uppercase Letter 21
 
0.9%
Close Punctuation 19
 
0.8%
Open Punctuation 19
 
0.8%
Lowercase Letter 17
 
0.8%
Dash Punctuation 7
 
0.3%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
67
 
4.3%
58
 
3.8%
51
 
3.3%
39
 
2.5%
36
 
2.3%
32
 
2.1%
30
 
1.9%
30
 
1.9%
27
 
1.7%
27
 
1.7%
Other values (270) 1146
74.3%
Uppercase Letter
ValueCountFrequency (%)
O 3
14.3%
P 2
 
9.5%
D 2
 
9.5%
Y 2
 
9.5%
G 1
 
4.8%
A 1
 
4.8%
M 1
 
4.8%
N 1
 
4.8%
J 1
 
4.8%
K 1
 
4.8%
Other values (6) 6
28.6%
Lowercase Letter
ValueCountFrequency (%)
e 3
17.6%
n 2
11.8%
s 2
11.8%
t 2
11.8%
i 2
11.8%
h 1
 
5.9%
u 1
 
5.9%
o 1
 
5.9%
l 1
 
5.9%
a 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
2 68
31.9%
0 65
30.5%
1 40
18.8%
9 21
 
9.9%
3 9
 
4.2%
5 4
 
1.9%
7 3
 
1.4%
4 2
 
0.9%
6 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
' 36
55.4%
" 16
24.6%
! 6
 
9.2%
/ 2
 
3.1%
· 2
 
3.1%
& 1
 
1.5%
# 1
 
1.5%
. 1
 
1.5%
Close Punctuation
ValueCountFrequency (%)
] 8
42.1%
) 7
36.8%
3
 
15.8%
1
 
5.3%
Open Punctuation
ValueCountFrequency (%)
[ 8
42.1%
( 7
36.8%
3
 
15.8%
1
 
5.3%
Space Separator
ValueCountFrequency (%)
359
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1542
68.1%
Common 684
30.2%
Latin 38
 
1.7%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
67
 
4.3%
58
 
3.8%
51
 
3.3%
39
 
2.5%
36
 
2.3%
32
 
2.1%
30
 
1.9%
30
 
1.9%
27
 
1.8%
27
 
1.8%
Other values (269) 1145
74.3%
Common
ValueCountFrequency (%)
359
52.5%
2 68
 
9.9%
0 65
 
9.5%
1 40
 
5.8%
' 36
 
5.3%
9 21
 
3.1%
" 16
 
2.3%
3 9
 
1.3%
] 8
 
1.2%
[ 8
 
1.2%
Other values (19) 54
 
7.9%
Latin
ValueCountFrequency (%)
e 3
 
7.9%
O 3
 
7.9%
n 2
 
5.3%
s 2
 
5.3%
P 2
 
5.3%
t 2
 
5.3%
i 2
 
5.3%
D 2
 
5.3%
Y 2
 
5.3%
h 1
 
2.6%
Other values (17) 17
44.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1542
68.1%
ASCII 711
31.4%
None 10
 
0.4%
CJK 1
 
< 0.1%
Misc Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
359
50.5%
2 68
 
9.6%
0 65
 
9.1%
1 40
 
5.6%
' 36
 
5.1%
9 21
 
3.0%
" 16
 
2.3%
3 9
 
1.3%
] 8
 
1.1%
[ 8
 
1.1%
Other values (40) 81
 
11.4%
Hangul
ValueCountFrequency (%)
67
 
4.3%
58
 
3.8%
51
 
3.3%
39
 
2.5%
36
 
2.3%
32
 
2.1%
30
 
1.9%
30
 
1.9%
27
 
1.8%
27
 
1.8%
Other values (269) 1145
74.3%
None
ValueCountFrequency (%)
3
30.0%
3
30.0%
· 2
20.0%
1
 
10.0%
1
 
10.0%
CJK
ValueCountFrequency (%)
1
100.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%

act_se_nm
Categorical

Distinct8
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size844.0 B
기타
35 
문화예술
20 
자기개발
10 
진로탐구
10 
봉사협력
Other values (3)

Length

Max length6
Median length4
Mean length3.1460674
Min length2

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st row문화예술
2nd row문화예술
3rd row자기개발
4th row진로탐구
5th row봉사협력

Common Values

ValueCountFrequency (%)
기타 35
39.3%
문화예술 20
22.5%
자기개발 10
 
11.2%
진로탐구 10
 
11.2%
봉사협력 7
 
7.9%
교류 4
 
4.5%
역사탐방 2
 
2.2%
건강/스포츠 1
 
1.1%

Length

2023-12-10T18:54:09.734088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:54:10.026958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 35
39.3%
문화예술 20
22.5%
자기개발 10
 
11.2%
진로탐구 10
 
11.2%
봉사협력 7
 
7.9%
교류 4
 
4.5%
역사탐방 2
 
2.2%
건강/스포츠 1
 
1.1%

h_co
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct63
Distinct (%)70.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean78.404494
Minimum0
Maximum374
Zeros3
Zeros (%)3.4%
Negative0
Negative (%)0.0%
Memory size933.0 B
2023-12-10T18:54:10.337198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q111
median42
Q3110
95-th percentile256.6
Maximum374
Range374
Interquartile range (IQR)99

Descriptive statistics

Standard deviation90.311043
Coefficient of variation (CV)1.1518605
Kurtosis1.8565435
Mean78.404494
Median Absolute Deviation (MAD)38
Skewness1.5377043
Sum6978
Variance8156.0845
MonotonicityNot monotonic
2023-12-10T18:54:10.799604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7 4
 
4.5%
42 4
 
4.5%
2 3
 
3.4%
121 3
 
3.4%
0 3
 
3.4%
12 3
 
3.4%
6 3
 
3.4%
13 3
 
3.4%
5 3
 
3.4%
15 2
 
2.2%
Other values (53) 58
65.2%
ValueCountFrequency (%)
0 3
3.4%
1 1
 
1.1%
2 3
3.4%
3 1
 
1.1%
4 1
 
1.1%
5 3
3.4%
6 3
3.4%
7 4
4.5%
9 1
 
1.1%
10 2
2.2%
ValueCountFrequency (%)
374 1
1.1%
347 1
1.1%
342 1
1.1%
317 1
1.1%
267 1
1.1%
241 1
1.1%
240 1
1.1%
234 1
1.1%
233 1
1.1%
228 1
1.1%

Interactions

2023-12-10T18:54:06.786345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:54:06.417328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:54:06.946772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:54:06.579849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T18:54:11.037787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
snact_progrm_nmact_se_nmh_co
sn1.0001.0000.4980.800
act_progrm_nm1.0001.0001.0001.000
act_se_nm0.4981.0001.0000.219
h_co0.8001.0000.2191.000
2023-12-10T18:54:11.216233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
snh_coact_se_nm
sn1.000-0.5880.259
h_co-0.5881.0000.099
act_se_nm0.2590.0991.000

Missing values

2023-12-10T18:54:07.214418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T18:54:07.573287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

snact_progrm_nmact_se_nmh_co
012019 논산시청소년어울림마당 5차 "논산시청소년활동발표회"문화예술115
12특수청소년 난타교실 '더디가도 함께가요'문화예술228
24원탁의 기사 참가자 모집자기개발42
35전주 예술중학교 진로직업체험 프로그램진로탐구240
46제 9회 청소년 연말연시 희망나눔캠프 '사랑 나누고 마음 더하기'봉사협력233
57제2회 우리들의 확실한 행복캠프자기개발241
68지역사회서비스투자사업 '글로벌 마인드 형성 서비스'자기개발162
79제3회 우리들의 확실한 행복캠프자기개발267
810지역사회서비스투자사업 '아동청소년 비전형성서비스'자기개발234
912꿈꾸는 찰칵이 참가자(중학생)를 모집합니다.기타95
snact_progrm_nmact_se_nmh_co
7991손으로 만드는 함께하는 세상 "꼼지락 교실"문화예술13
8092진해청소년전당 동아리연합회 '하랑' 임원 12월 정기회의교류2
8193손으로 만드는 함께하는 세상 "꼼지락 교실"문화예술13
8294진해청소년전당 청소년동아리 4분기 간담회교류9
8395수련관 11월 과학무료특강 참여자 모집기타12
8496문화예술아카데미 관심 - 트럼펫문화예술7
85972019년 꿈드림「학습멘토단 울타리」를 모집합니다!! [학습지도 활동]기타23
8698환경보호 서명운동 및 EM흙공 던지기 활동기타10
8799제20회 청소년만화축제 작품공모전기타42
88100대한민국 미래의 100년 [2019 다시 청소년이다!] "어찌 잊으오" 5차 태극기 휘날리며2자기개발2