Overview

Dataset statistics

Number of variables13
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.5 KiB
Average record size in memory107.3 B

Variable types

Numeric1
Categorical6
Boolean6

Alerts

examin_ym has constant value ""Constant
book_disc_dvd_prchs_at is highly imbalanced (63.4%)Imbalance
pblprfr_dspy_exprn_prchs_at is highly imbalanced (63.4%)Imbalance
game_prchs_at is highly imbalanced (67.3%)Imbalance
music_strmng_dwld_vch_prchs_at is highly imbalanced (80.6%)Imbalance
mvp_strmng_dwld_vch_prchs_at is highly imbalanced (85.9%)Imbalance
cltur_dgtl_cntnts_etc_prchs_at is highly imbalanced (85.9%)Imbalance
respond_id has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:11:24.915874
Analysis finished2023-12-10 10:11:27.089837
Duration2.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

respond_id
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean103660.11
Minimum199
Maximum3242878
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:11:27.216388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum199
5-th percentile725.1
Q13399.75
median5873.5
Q310576.25
95-th percentile14898.95
Maximum3242878
Range3242679
Interquartile range (IQR)7176.5

Descriptive statistics

Standard deviation554781.24
Coefficient of variation (CV)5.351926
Kurtosis29.89392
Mean103660.11
Median Absolute Deviation (MAD)3239.5
Skewness5.5941265
Sum10366011
Variance3.0778222 × 1011
MonotonicityNot monotonic
2023-12-10T19:11:27.478052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
199 1
 
1.0%
7094 1
 
1.0%
9592 1
 
1.0%
9495 1
 
1.0%
8817 1
 
1.0%
8717 1
 
1.0%
8671 1
 
1.0%
8573 1
 
1.0%
8370 1
 
1.0%
7351 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
199 1
1.0%
354 1
1.0%
497 1
1.0%
547 1
1.0%
689 1
1.0%
727 1
1.0%
982 1
1.0%
988 1
1.0%
1267 1
1.0%
1467 1
1.0%
ValueCountFrequency (%)
3242878 1
1.0%
3242247 1
1.0%
3242000 1
1.0%
15661 1
1.0%
15392 1
1.0%
14873 1
1.0%
14840 1
1.0%
14550 1
1.0%
14227 1
1.0%
14152 1
1.0%

examin_ym
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
202204
100 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row202204
2nd row202204
3rd row202204
4th row202204
5th row202204

Common Values

ValueCountFrequency (%)
202204 100
100.0%

Length

2023-12-10T19:11:27.725257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:11:27.883073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202204 100
100.0%

sexdstn_flag_cd
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
M
60 
F
40 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowM
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
M 60
60.0%
F 40
40.0%

Length

2023-12-10T19:11:28.063606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:11:28.250003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 60
60.0%
f 40
40.0%

agrde_flag_nm
Categorical

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
40대
31 
50대
29 
60대
26 
30대
14 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row60대
2nd row40대
3rd row60대
4th row60대
5th row60대

Common Values

ValueCountFrequency (%)
40대 31
31.0%
50대 29
29.0%
60대 26
26.0%
30대 14
14.0%

Length

2023-12-10T19:11:28.421001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:11:28.618916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
40대 31
31.0%
50대 29
29.0%
60대 26
26.0%
30대 14
14.0%
Distinct15
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울특별시
32 
경기도
23 
인천광역시
12 
부산광역시
광주광역시
Other values (10)
22 

Length

Max length12
Median length5
Mean length4.56
Min length3

Unique

Unique3 ?
Unique (%)3.0%

Sample

1st row서울특별시
2nd row부산광역시
3rd row서울특별시
4th row전라남도
5th row충청북도

Common Values

ValueCountFrequency (%)
서울특별시 32
32.0%
경기도 23
23.0%
인천광역시 12
 
12.0%
부산광역시 6
 
6.0%
광주광역시 5
 
5.0%
충청북도 4
 
4.0%
전라남도 3
 
3.0%
대구광역시 3
 
3.0%
대전광역시 3
 
3.0%
충청남도(세종시 포함) 2
 
2.0%
Other values (5) 7
 
7.0%

Length

2023-12-10T19:11:28.818012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 32
31.4%
경기도 23
22.5%
인천광역시 12
 
11.8%
부산광역시 6
 
5.9%
광주광역시 5
 
4.9%
충청북도 4
 
3.9%
전라남도 3
 
2.9%
대구광역시 3
 
2.9%
대전광역시 3
 
2.9%
충청남도(세종시 2
 
2.0%
Other values (6) 9
 
8.8%
Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
300만원 미만
27 
300이상500만원 미만
24 
700만원 이상
22 
500이상700만원 미만
20 
무응답

Length

Max length13
Median length8
Mean length9.85
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row300이상500만원 미만
2nd row700만원 이상
3rd row500이상700만원 미만
4th row500이상700만원 미만
5th row300이상500만원 미만

Common Values

ValueCountFrequency (%)
300만원 미만 27
27.0%
300이상500만원 미만 24
24.0%
700만원 이상 22
22.0%
500이상700만원 미만 20
20.0%
무응답 7
 
7.0%

Length

2023-12-10T19:11:29.028803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:11:29.199578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미만 71
36.8%
300만원 27
 
14.0%
300이상500만원 24
 
12.4%
700만원 22
 
11.4%
이상 22
 
11.4%
500이상700만원 20
 
10.4%
무응답 7
 
3.6%

prchs_mth_nm
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
오프라인
53 
온라인
47 

Length

Max length4
Median length4
Mean length3.53
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row온라인
2nd row오프라인
3rd row온라인
4th row온라인
5th row온라인

Common Values

ValueCountFrequency (%)
오프라인 53
53.0%
온라인 47
47.0%

Length

2023-12-10T19:11:29.399169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:11:29.570212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
오프라인 53
53.0%
온라인 47
47.0%

book_disc_dvd_prchs_at
Boolean

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
False
93 
True
 
7
ValueCountFrequency (%)
False 93
93.0%
True 7
 
7.0%
2023-12-10T19:11:29.713751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

pblprfr_dspy_exprn_prchs_at
Boolean

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
False
93 
True
 
7
ValueCountFrequency (%)
False 93
93.0%
True 7
 
7.0%
2023-12-10T19:11:30.062333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

game_prchs_at
Boolean

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
False
94 
True
 
6
ValueCountFrequency (%)
False 94
94.0%
True 6
 
6.0%
2023-12-10T19:11:30.324852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
False
97 
True
 
3
ValueCountFrequency (%)
False 97
97.0%
True 3
 
3.0%
2023-12-10T19:11:30.461737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

mvp_strmng_dwld_vch_prchs_at
Boolean

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
False
98 
True
 
2
ValueCountFrequency (%)
False 98
98.0%
True 2
 
2.0%
2023-12-10T19:11:30.619659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
False
98 
True
 
2
ValueCountFrequency (%)
False 98
98.0%
True 2
 
2.0%
2023-12-10T19:11:30.821263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-10T19:11:26.345869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:11:30.994900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
respond_idsexdstn_flag_cdagrde_flag_nmanswrr_oc_area_nmhshld_income_dgree_nmprchs_mth_nmbook_disc_dvd_prchs_atpblprfr_dspy_exprn_prchs_atgame_prchs_atmusic_strmng_dwld_vch_prchs_atmvp_strmng_dwld_vch_prchs_atcltur_dgtl_cntnts_etc_prchs_at
respond_id1.0000.0000.1310.0000.0000.0000.0000.0000.0000.0000.0000.000
sexdstn_flag_cd0.0001.0000.0000.2700.1650.0000.0000.0000.0000.0000.0000.000
agrde_flag_nm0.1310.0001.0000.0000.0000.0000.0000.0000.1840.0000.0920.077
answrr_oc_area_nm0.0000.2700.0001.0000.0000.1970.0000.0000.0000.0000.1890.000
hshld_income_dgree_nm0.0000.1650.0000.0001.0000.0590.0000.0000.0000.0600.0000.000
prchs_mth_nm0.0000.0000.0000.1970.0591.0000.2220.2220.0000.1260.0000.000
book_disc_dvd_prchs_at0.0000.0000.0000.0000.0000.2221.0000.0000.0000.4260.0200.000
pblprfr_dspy_exprn_prchs_at0.0000.0000.0000.0000.0000.2220.0001.0000.0000.0000.0000.020
game_prchs_at0.0000.0000.1840.0000.0000.0000.0000.0001.0000.0000.0870.000
music_strmng_dwld_vch_prchs_at0.0000.0000.0000.0000.0600.1260.4260.0000.0001.0000.2420.242
mvp_strmng_dwld_vch_prchs_at0.0000.0000.0920.1890.0000.0000.0200.0000.0870.2421.0000.000
cltur_dgtl_cntnts_etc_prchs_at0.0000.0000.0770.0000.0000.0000.0000.0200.0000.2420.0001.000
2023-12-10T19:11:31.296768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
hshld_income_dgree_nmgame_prchs_atmusic_strmng_dwld_vch_prchs_atpblprfr_dspy_exprn_prchs_atbook_disc_dvd_prchs_atcltur_dgtl_cntnts_etc_prchs_atsexdstn_flag_cdprchs_mth_nmagrde_flag_nmmvp_strmng_dwld_vch_prchs_atanswrr_oc_area_nm
hshld_income_dgree_nm1.0000.0000.0700.0000.0000.0000.1970.0680.0000.0000.000
game_prchs_at0.0001.0000.0000.0000.0000.0000.0000.0000.1190.0550.000
music_strmng_dwld_vch_prchs_at0.0700.0001.0000.0000.2800.1550.0000.0800.0000.1550.000
pblprfr_dspy_exprn_prchs_at0.0000.0000.0001.0000.0000.0080.0000.1420.0000.0000.000
book_disc_dvd_prchs_at0.0000.0000.2800.0001.0000.0000.0000.1420.0000.0080.000
cltur_dgtl_cntnts_etc_prchs_at0.0000.0000.1550.0080.0001.0000.0000.0000.0470.0000.000
sexdstn_flag_cd0.1970.0000.0000.0000.0000.0001.0000.0000.0000.0000.226
prchs_mth_nm0.0680.0000.0800.1420.1420.0000.0001.0000.0000.0000.163
agrde_flag_nm0.0000.1190.0000.0000.0000.0470.0000.0001.0000.0580.000
mvp_strmng_dwld_vch_prchs_at0.0000.0550.1550.0000.0080.0000.0000.0000.0581.0000.156
answrr_oc_area_nm0.0000.0000.0000.0000.0000.0000.2260.1630.0000.1561.000
2023-12-10T19:11:31.601532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
respond_idsexdstn_flag_cdagrde_flag_nmanswrr_oc_area_nmhshld_income_dgree_nmprchs_mth_nmbook_disc_dvd_prchs_atpblprfr_dspy_exprn_prchs_atgame_prchs_atmusic_strmng_dwld_vch_prchs_atmvp_strmng_dwld_vch_prchs_atcltur_dgtl_cntnts_etc_prchs_at
respond_id1.0000.0000.0880.0000.0000.0000.0000.0000.0000.0000.0000.000
sexdstn_flag_cd0.0001.0000.0000.2260.1970.0000.0000.0000.0000.0000.0000.000
agrde_flag_nm0.0880.0001.0000.0000.0000.0000.0000.0000.1190.0000.0580.047
answrr_oc_area_nm0.0000.2260.0001.0000.0000.1630.0000.0000.0000.0000.1560.000
hshld_income_dgree_nm0.0000.1970.0000.0001.0000.0680.0000.0000.0000.0700.0000.000
prchs_mth_nm0.0000.0000.0000.1630.0681.0000.1420.1420.0000.0800.0000.000
book_disc_dvd_prchs_at0.0000.0000.0000.0000.0000.1421.0000.0000.0000.2800.0080.000
pblprfr_dspy_exprn_prchs_at0.0000.0000.0000.0000.0000.1420.0001.0000.0000.0000.0000.008
game_prchs_at0.0000.0000.1190.0000.0000.0000.0000.0001.0000.0000.0550.000
music_strmng_dwld_vch_prchs_at0.0000.0000.0000.0000.0700.0800.2800.0000.0001.0000.1550.155
mvp_strmng_dwld_vch_prchs_at0.0000.0000.0580.1560.0000.0000.0080.0000.0550.1551.0000.000
cltur_dgtl_cntnts_etc_prchs_at0.0000.0000.0470.0000.0000.0000.0000.0080.0000.1550.0001.000

Missing values

2023-12-10T19:11:26.566881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:11:26.971578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

respond_idexamin_ymsexdstn_flag_cdagrde_flag_nmanswrr_oc_area_nmhshld_income_dgree_nmprchs_mth_nmbook_disc_dvd_prchs_atpblprfr_dspy_exprn_prchs_atgame_prchs_atmusic_strmng_dwld_vch_prchs_atmvp_strmng_dwld_vch_prchs_atcltur_dgtl_cntnts_etc_prchs_at
0199202204F60대서울특별시300이상500만원 미만온라인NNNNNN
13242000202204F40대부산광역시700만원 이상오프라인NNNNNN
2354202204M60대서울특별시500이상700만원 미만온라인NNNNNN
3497202204M60대전라남도500이상700만원 미만온라인NNNNNN
4547202204M60대충청북도300이상500만원 미만온라인NYNNNN
5689202204M40대경기도500이상700만원 미만온라인NNNNNN
6727202204M50대경기도300이상500만원 미만오프라인NNNNNN
73242247202204M30대경기도300만원 미만온라인NNNNNN
8982202204M50대서울특별시300만원 미만온라인NNNNNN
9988202204F50대부산광역시300만원 미만오프라인NNNNNN
respond_idexamin_ymsexdstn_flag_cdagrde_flag_nmanswrr_oc_area_nmhshld_income_dgree_nmprchs_mth_nmbook_disc_dvd_prchs_atpblprfr_dspy_exprn_prchs_atgame_prchs_atmusic_strmng_dwld_vch_prchs_atmvp_strmng_dwld_vch_prchs_atcltur_dgtl_cntnts_etc_prchs_at
9013801202204M50대경기도300만원 미만오프라인NNNNNN
9113810202204F30대경기도700만원 이상오프라인NNNNNN
9214108202204M40대인천광역시300만원 미만오프라인NNNNNN
9314152202204M40대경기도700만원 이상온라인NNNNNN
9414227202204M60대경기도300이상500만원 미만온라인NNNNNN
9514550202204F60대서울특별시300만원 미만오프라인NNNNNN
9614840202204M50대인천광역시700만원 이상오프라인NNNNNN
9714873202204M40대경기도500이상700만원 미만오프라인NNNNNN
9815392202204F40대광주광역시300만원 미만오프라인NNNNNN
9915661202204M60대대전광역시300이상500만원 미만온라인NNNNNN