Overview

Dataset statistics

Number of variables10
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.1 KiB
Average record size in memory83.3 B

Variable types

Numeric1
Categorical9

Alerts

partcpt_se_nm is highly overall correlated with partcpt_reqst_no and 4 other fieldsHigh correlation
partcpt_hope_nation_nm is highly overall correlated with partcpt_reqst_no and 4 other fieldsHigh correlation
event_info_sn is highly overall correlated with partcpt_reqst_no and 4 other fieldsHigh correlation
adhrnc_ty_nm is highly overall correlated with partcpt_reqst_no and 3 other fieldsHigh correlation
partcpt_reqst_no is highly overall correlated with event_info_sn and 3 other fieldsHigh correlation
adhrnc_se_nm is highly overall correlated with event_info_sn and 2 other fieldsHigh correlation
adhrnc_se_nm is highly imbalanced (51.3%)Imbalance
partcpt_reqst_no has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:00:30.702587
Analysis finished2023-12-10 10:00:32.670419
Duration1.97 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

partcpt_reqst_no
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean64765.37
Minimum60601
Maximum74003
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:00:33.093445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum60601
5-th percentile60854.6
Q161457.25
median61961.5
Q365422
95-th percentile73997.05
Maximum74003
Range13402
Interquartile range (IQR)3964.75

Descriptive statistics

Standard deviation5373.3526
Coefficient of variation (CV)0.082966446
Kurtosis-0.65956939
Mean64765.37
Median Absolute Deviation (MAD)590
Skewness1.1453541
Sum6476537
Variance28872918
MonotonicityNot monotonic
2023-12-10T19:00:33.439233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
73983 1
 
1.0%
62090 1
 
1.0%
61370 1
 
1.0%
62102 1
 
1.0%
60601 1
 
1.0%
60847 1
 
1.0%
61924 1
 
1.0%
61591 1
 
1.0%
61562 1
 
1.0%
62493 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
60601 1
1.0%
60657 1
1.0%
60717 1
1.0%
60787 1
1.0%
60847 1
1.0%
60855 1
1.0%
60890 1
1.0%
60908 1
1.0%
60929 1
1.0%
60964 1
1.0%
ValueCountFrequency (%)
74003 1
1.0%
74002 1
1.0%
74001 1
1.0%
74000 1
1.0%
73998 1
1.0%
73997 1
1.0%
73996 1
1.0%
73995 1
1.0%
73994 1
1.0%
73993 1
1.0%

event_info_sn
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1506
75 
3020
25 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3020
2nd row3020
3rd row3020
4th row3020
5th row3020

Common Values

ValueCountFrequency (%)
1506 75
75.0%
3020 25
 
25.0%

Length

2023-12-10T19:00:33.724798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:00:34.005062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1506 75
75.0%
3020 25
 
25.0%

partcpt_se_nm
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
한중 청소년교류
75 
국가간 청소년교류
25 

Length

Max length9
Median length8
Mean length8.25
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국가간 청소년교류
2nd row국가간 청소년교류
3rd row국가간 청소년교류
4th row국가간 청소년교류
5th row국가간 청소년교류

Common Values

ValueCountFrequency (%)
한중 청소년교류 75
75.0%
국가간 청소년교류 25
 
25.0%

Length

2023-12-10T19:00:34.199837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:00:34.389483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
청소년교류 100
50.0%
한중 75
37.5%
국가간 25
 
12.5%

partcpt_hope_nation_nm
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
중국
75 
인도네시아
25 

Length

Max length5
Median length2
Mean length2.75
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인도네시아
2nd row인도네시아
3rd row인도네시아
4th row인도네시아
5th row인도네시아

Common Values

ValueCountFrequency (%)
중국 75
75.0%
인도네시아 25
 
25.0%

Length

2023-12-10T19:00:34.742244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:00:34.951841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중국 75
75.0%
인도네시아 25
 
25.0%

adhrnc_ty_nm
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
청소년
66 
청소년(전문파견)
25 
청소년관련학과 청소년
 
5
촬영팀
 
2
공연팀
 
1

Length

Max length11
Median length3
Mean length4.93
Min length3

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row청소년(전문파견)
2nd row청소년(전문파견)
3rd row청소년(전문파견)
4th row청소년(전문파견)
5th row청소년(전문파견)

Common Values

ValueCountFrequency (%)
청소년 66
66.0%
청소년(전문파견) 25
 
25.0%
청소년관련학과 청소년 5
 
5.0%
촬영팀 2
 
2.0%
공연팀 1
 
1.0%
블로그기자단 1
 
1.0%

Length

2023-12-10T19:00:35.192463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:00:35.439594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
청소년 71
67.6%
청소년(전문파견 25
 
23.8%
청소년관련학과 5
 
4.8%
촬영팀 2
 
1.9%
공연팀 1
 
1.0%
블로그기자단 1
 
1.0%

sexdstn_se
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
여자
67 
남자
33 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여자
2nd row남자
3rd row남자
4th row여자
5th row여자

Common Values

ValueCountFrequency (%)
여자 67
67.0%
남자 33
33.0%

Length

2023-12-10T19:00:35.691181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:00:35.881310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여자 67
67.0%
남자 33
33.0%

area_nm
Categorical

Distinct15
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
경기
28 
서울
22 
대구
경남
부산
Other values (10)
30 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique3 ?
Unique (%)3.0%

Sample

1st row경기
2nd row서울
3rd row전남
4th row경기
5th row서울

Common Values

ValueCountFrequency (%)
경기 28
28.0%
서울 22
22.0%
대구 7
 
7.0%
경남 7
 
7.0%
부산 6
 
6.0%
충남 5
 
5.0%
강원 5
 
5.0%
인천 5
 
5.0%
전북 4
 
4.0%
경북 4
 
4.0%
Other values (5) 7
 
7.0%

Length

2023-12-10T19:00:36.127541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 28
28.0%
서울 22
22.0%
대구 7
 
7.0%
경남 7
 
7.0%
부산 6
 
6.0%
충남 5
 
5.0%
강원 5
 
5.0%
인천 5
 
5.0%
전북 4
 
4.0%
경북 4
 
4.0%
Other values (5) 7
 
7.0%

adhrnc_se_nm
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
고등학생
57 
대학생
39 
대학원생
 
1
중학생
 
1
초등학생
 
1

Length

Max length4
Median length4
Mean length3.58
Min length2

Unique

Unique4 ?
Unique (%)4.0%

Sample

1st row대학생
2nd row대학생
3rd row대학생
4th row대학생
5th row대학생

Common Values

ValueCountFrequency (%)
고등학생 57
57.0%
대학생 39
39.0%
대학원생 1
 
1.0%
중학생 1
 
1.0%
초등학생 1
 
1.0%
일반 1
 
1.0%

Length

2023-12-10T19:00:36.414721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:00:36.684777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고등학생 57
57.0%
대학생 39
39.0%
대학원생 1
 
1.0%
중학생 1
 
1.0%
초등학생 1
 
1.0%
일반 1
 
1.0%
Distinct6
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
<NA>
33 
24 
중하
19 
16 
중상

Length

Max length4
Median length2
Mean length2.24
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row중하
3rd row<NA>
4th row
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 33
33.0%
24
24.0%
중하 19
19.0%
16
16.0%
중상 6
 
6.0%
2
 
2.0%

Length

2023-12-10T19:00:36.939280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:00:37.183640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 33
33.0%
24
24.0%
중하 19
19.0%
16
16.0%
중상 6
 
6.0%
2
 
2.0%
Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
<NA>
30 
25 
중하
14 
중상
12 
10 
Other values (2)

Length

Max length4
Median length3
Mean length2.18
Min length1

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row
2nd row중상
3rd row
4th row
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 30
30.0%
25
25.0%
중하 14
14.0%
중상 12
 
12.0%
10
 
10.0%
8
 
8.0%
원어민 1
 
1.0%

Length

2023-12-10T19:00:37.436066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:00:38.088718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 30
30.0%
25
25.0%
중하 14
14.0%
중상 12
 
12.0%
10
 
10.0%
8
 
8.0%
원어민 1
 
1.0%

Interactions

2023-12-10T19:00:31.965929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:00:38.330436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
partcpt_reqst_noevent_info_snpartcpt_se_nmpartcpt_hope_nation_nmadhrnc_ty_nmsexdstn_searea_nmadhrnc_se_nmvisit_nation_lang_ablty_valueengl_convrs_ablty_value
partcpt_reqst_no1.0001.0001.0001.0000.9380.1010.3890.7110.3450.000
event_info_sn1.0001.0000.9990.9991.0000.0000.2840.7840.2970.323
partcpt_se_nm1.0000.9991.0000.9991.0000.0000.2840.7840.2970.323
partcpt_hope_nation_nm1.0000.9990.9991.0001.0000.0000.2840.7840.2970.323
adhrnc_ty_nm0.9381.0001.0001.0001.0000.0000.2380.5720.4010.000
sexdstn_se0.1010.0000.0000.0000.0001.0000.0000.0000.0000.171
area_nm0.3890.2840.2840.2840.2380.0001.0000.6340.0000.000
adhrnc_se_nm0.7110.7840.7840.7840.5720.0000.6341.0000.0000.000
visit_nation_lang_ablty_value0.3450.2970.2970.2970.4010.0000.0000.0001.0000.168
engl_convrs_ablty_value0.0000.3230.3230.3230.0000.1710.0000.0000.1681.000
2023-12-10T19:00:38.654749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
adhrnc_se_nmpartcpt_se_nmsexdstn_searea_nmengl_convrs_ablty_valuepartcpt_hope_nation_nmvisit_nation_lang_ablty_valueevent_info_snadhrnc_ty_nm
adhrnc_se_nm1.0000.5770.0000.3380.0000.5770.0000.5770.236
partcpt_se_nm0.5771.0000.0000.2390.2230.9730.3530.9730.979
sexdstn_se0.0000.0001.0000.0000.1150.0000.0000.0000.000
area_nm0.3380.2390.0001.0000.0000.2390.0000.2390.101
engl_convrs_ablty_value0.0000.2230.1150.0001.0000.2230.1060.2230.000
partcpt_hope_nation_nm0.5770.9730.0000.2390.2231.0000.3530.9730.979
visit_nation_lang_ablty_value0.0000.3530.0000.0000.1060.3531.0000.3530.281
event_info_sn0.5770.9730.0000.2390.2230.9730.3531.0000.979
adhrnc_ty_nm0.2360.9790.0000.1010.0000.9790.2810.9791.000
2023-12-10T19:00:38.989549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
partcpt_reqst_noevent_info_snpartcpt_se_nmpartcpt_hope_nation_nmadhrnc_ty_nmsexdstn_searea_nmadhrnc_se_nmvisit_nation_lang_ablty_valueengl_convrs_ablty_value
partcpt_reqst_no1.0000.9950.9950.9950.6880.1580.1740.3870.2640.000
event_info_sn0.9951.0000.9730.9730.9790.0000.2390.5770.3530.223
partcpt_se_nm0.9950.9731.0000.9730.9790.0000.2390.5770.3530.223
partcpt_hope_nation_nm0.9950.9730.9731.0000.9790.0000.2390.5770.3530.223
adhrnc_ty_nm0.6880.9790.9790.9791.0000.0000.1010.2360.2810.000
sexdstn_se0.1580.0000.0000.0000.0001.0000.0000.0000.0000.115
area_nm0.1740.2390.2390.2390.1010.0001.0000.3380.0000.000
adhrnc_se_nm0.3870.5770.5770.5770.2360.0000.3381.0000.0000.000
visit_nation_lang_ablty_value0.2640.3530.3530.3530.2810.0000.0000.0001.0000.106
engl_convrs_ablty_value0.0000.2230.2230.2230.0000.1150.0000.0000.1061.000

Missing values

2023-12-10T19:00:32.221579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:00:32.527117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

partcpt_reqst_noevent_info_snpartcpt_se_nmpartcpt_hope_nation_nmadhrnc_ty_nmsexdstn_searea_nmadhrnc_se_nmvisit_nation_lang_ablty_valueengl_convrs_ablty_value
0739833020국가간 청소년교류인도네시아청소년(전문파견)여자경기대학생
1739943020국가간 청소년교류인도네시아청소년(전문파견)남자서울대학생중하중상
2739823020국가간 청소년교류인도네시아청소년(전문파견)남자전남대학생<NA>
3739853020국가간 청소년교류인도네시아청소년(전문파견)여자경기대학생
4739913020국가간 청소년교류인도네시아청소년(전문파견)여자서울대학생<NA><NA>
5739793020국가간 청소년교류인도네시아청소년(전문파견)여자부산대학생중상
6739983020국가간 청소년교류인도네시아청소년(전문파견)남자서울대학생원어민
7739953020국가간 청소년교류인도네시아청소년(전문파견)여자광주고등학생<NA><NA>
8739873020국가간 청소년교류인도네시아청소년(전문파견)여자대구대학생
9739843020국가간 청소년교류인도네시아청소년(전문파견)여자서울대학생중하
partcpt_reqst_noevent_info_snpartcpt_se_nmpartcpt_hope_nation_nmadhrnc_ty_nmsexdstn_searea_nmadhrnc_se_nmvisit_nation_lang_ablty_valueengl_convrs_ablty_value
90618311506한중 청소년교류중국청소년남자경북고등학생<NA><NA>
91607871506한중 청소년교류중국청소년여자서울고등학생중상
92608551506한중 청소년교류중국청소년관련학과 청소년여자강원대학생
93614661506한중 청소년교류중국청소년여자부산고등학생중하중하
94610401506한중 청소년교류중국청소년관련학과 청소년남자대구대학생중하
95612761506한중 청소년교류중국청소년여자대구고등학생중하
96611861506한중 청소년교류중국청소년여자경기고등학생
97619411506한중 청소년교류중국청소년남자경기대학생
98621921506한중 청소년교류중국청소년여자서울고등학생
99617331506한중 청소년교류중국청소년남자전북고등학생<NA><NA>