Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory67.3 B

Variable types

Numeric1
Categorical7

Alerts

examin_ym has constant value ""Constant
dmstc_tour_ty_value is highly imbalanced (56.5%)Imbalance
ovsea_tour_ty_value is highly imbalanced (66.8%)Imbalance
respond_id has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:14:11.739055
Analysis finished2023-12-10 10:14:13.099477
Duration1.36 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

respond_id
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean991335.72
Minimum1253
Maximum3238582
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:14:13.247647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1253
5-th percentile37587.55
Q1223451
median389829
Q32071792.5
95-th percentile3213792.4
Maximum3238582
Range3237329
Interquartile range (IQR)1848341.5

Descriptive statistics

Standard deviation1215367.5
Coefficient of variation (CV)1.2259898
Kurtosis-0.67705083
Mean991335.72
Median Absolute Deviation (MAD)169422
Skewness1.1160699
Sum99133572
Variance1.4771181 × 1012
MonotonicityNot monotonic
2023-12-10T19:14:13.603298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2784473 1
 
1.0%
414776 1
 
1.0%
411955 1
 
1.0%
7351 1
 
1.0%
3147880 1
 
1.0%
3042031 1
 
1.0%
394309 1
 
1.0%
37651 1
 
1.0%
3074011 1
 
1.0%
171294 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1253 1
1.0%
7351 1
1.0%
19786 1
1.0%
20232 1
1.0%
36382 1
1.0%
37651 1
1.0%
40348 1
1.0%
41300 1
1.0%
59654 1
1.0%
77675 1
1.0%
ValueCountFrequency (%)
3238582 1
1.0%
3237462 1
1.0%
3236126 1
1.0%
3232050 1
1.0%
3220488 1
1.0%
3213440 1
1.0%
3201603 1
1.0%
3167314 1
1.0%
3150958 1
1.0%
3147880 1
1.0%

examin_ym
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
202101
100 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row202101
2nd row202101
3rd row202101
4th row202101
5th row202101

Common Values

ValueCountFrequency (%)
202101 100
100.0%

Length

2023-12-10T19:14:13.891653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:14:14.121754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202101 100
100.0%

sexdstn_flag_cd
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
M
55 
F
45 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowM
3rd rowF
4th rowF
5th rowF

Common Values

ValueCountFrequency (%)
M 55
55.0%
F 45
45.0%

Length

2023-12-10T19:14:14.372957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:14:14.568735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 55
55.0%
f 45
45.0%

agrde_flag_nm
Categorical

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
40대
30 
20대
25 
50대
20 
60대
14 
30대
11 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row50대
2nd row20대
3rd row40대
4th row30대
5th row60대

Common Values

ValueCountFrequency (%)
40대 30
30.0%
20대 25
25.0%
50대 20
20.0%
60대 14
14.0%
30대 11
 
11.0%

Length

2023-12-10T19:14:14.767079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:14:14.957046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
40대 30
30.0%
20대 25
25.0%
50대 20
20.0%
60대 14
14.0%
30대 11
 
11.0%
Distinct16
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울특별시
21 
경기도
15 
부산광역시
11 
대전광역시
10 
대구광역시
Other values (11)
35 

Length

Max length12
Median length5
Mean length4.65
Min length3

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row충청북도
2nd row충청남도(세종시 포함)
3rd row전라북도
4th row대구광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
서울특별시 21
21.0%
경기도 15
15.0%
부산광역시 11
11.0%
대전광역시 10
10.0%
대구광역시 8
 
8.0%
광주광역시 5
 
5.0%
제주도 5
 
5.0%
충청북도 4
 
4.0%
경상북도 4
 
4.0%
인천광역시 4
 
4.0%
Other values (6) 13
13.0%

Length

2023-12-10T19:14:15.165093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 21
20.4%
경기도 15
14.6%
부산광역시 11
10.7%
대전광역시 10
9.7%
대구광역시 8
 
7.8%
광주광역시 5
 
4.9%
제주도 5
 
4.9%
인천광역시 4
 
3.9%
경상북도 4
 
3.9%
충청북도 4
 
3.9%
Other values (7) 16
15.5%
Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
700만원 이상
34 
500이상700만원 미만
25 
300이상500만원 미만
20 
300만원 미만
13 
모름

Length

Max length13
Median length8
Mean length9.77
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row500이상700만원 미만
2nd row모름
3rd row500이상700만원 미만
4th row500이상700만원 미만
5th row300이상500만원 미만

Common Values

ValueCountFrequency (%)
700만원 이상 34
34.0%
500이상700만원 미만 25
25.0%
300이상500만원 미만 20
20.0%
300만원 미만 13
 
13.0%
모름 8
 
8.0%

Length

2023-12-10T19:14:15.368360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:14:15.620056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미만 58
30.2%
700만원 34
17.7%
이상 34
17.7%
500이상700만원 25
13.0%
300이상500만원 20
 
10.4%
300만원 13
 
6.8%
모름 8
 
4.2%

dmstc_tour_ty_value
Categorical

IMBALANCE 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
개별 여행
86 
에어텔 또는 에어카텔 패키지 여행
11 
단체 패키지 여행
 
3

Length

Max length18
Median length5
Mean length6.55
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row에어텔 또는 에어카텔 패키지 여행
2nd row개별 여행
3rd row개별 여행
4th row개별 여행
5th row개별 여행

Common Values

ValueCountFrequency (%)
개별 여행 86
86.0%
에어텔 또는 에어카텔 패키지 여행 11
 
11.0%
단체 패키지 여행 3
 
3.0%

Length

2023-12-10T19:14:15.825809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:14:16.030538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여행 100
42.4%
개별 86
36.4%
패키지 14
 
5.9%
에어텔 11
 
4.7%
또는 11
 
4.7%
에어카텔 11
 
4.7%
단체 3
 
1.3%

ovsea_tour_ty_value
Categorical

IMBALANCE 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
여행경험없음
91 
개별 여행
 
5
단체 패키지 여행
 
4

Length

Max length9
Median length6
Mean length6.07
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여행경험없음
2nd row여행경험없음
3rd row여행경험없음
4th row여행경험없음
5th row여행경험없음

Common Values

ValueCountFrequency (%)
여행경험없음 91
91.0%
개별 여행 5
 
5.0%
단체 패키지 여행 4
 
4.0%

Length

2023-12-10T19:14:16.253098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:14:16.439400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여행경험없음 91
80.5%
여행 9
 
8.0%
개별 5
 
4.4%
단체 4
 
3.5%
패키지 4
 
3.5%

Interactions

2023-12-10T19:14:12.489310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:14:16.553389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
respond_idsexdstn_flag_cdagrde_flag_nmanswrr_oc_area_nmhshld_income_dgree_nmdmstc_tour_ty_valueovsea_tour_ty_value
respond_id1.0000.1740.2350.0000.2530.2780.000
sexdstn_flag_cd0.1741.0000.0000.0000.1240.0000.151
agrde_flag_nm0.2350.0001.0000.3970.4950.2010.000
answrr_oc_area_nm0.0000.0000.3971.0000.0000.0000.000
hshld_income_dgree_nm0.2530.1240.4950.0001.0000.1340.221
dmstc_tour_ty_value0.2780.0000.2010.0000.1341.0000.320
ovsea_tour_ty_value0.0000.1510.0000.0000.2210.3201.000
2023-12-10T19:14:16.732327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
sexdstn_flag_cdanswrr_oc_area_nmovsea_tour_ty_valueagrde_flag_nmdmstc_tour_ty_valuehshld_income_dgree_nm
sexdstn_flag_cd1.0000.0000.2480.0000.0000.148
answrr_oc_area_nm0.0001.0000.0000.1980.0000.000
ovsea_tour_ty_value0.2480.0001.0000.0000.1080.167
agrde_flag_nm0.0000.1980.0001.0000.1510.203
dmstc_tour_ty_value0.0000.0000.1080.1511.0000.098
hshld_income_dgree_nm0.1480.0000.1670.2030.0981.000
2023-12-10T19:14:16.931921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
respond_idsexdstn_flag_cdagrde_flag_nmanswrr_oc_area_nmhshld_income_dgree_nmdmstc_tour_ty_valueovsea_tour_ty_value
respond_id1.0000.1340.1650.0000.1720.1180.000
sexdstn_flag_cd0.1341.0000.0000.0000.1480.0000.248
agrde_flag_nm0.1650.0001.0000.1980.2030.1510.000
answrr_oc_area_nm0.0000.0000.1981.0000.0000.0000.000
hshld_income_dgree_nm0.1720.1480.2030.0001.0000.0980.167
dmstc_tour_ty_value0.1180.0000.1510.0000.0981.0000.108
ovsea_tour_ty_value0.0000.2480.0000.0000.1670.1081.000

Missing values

2023-12-10T19:14:12.752927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:14:12.994580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

respond_idexamin_ymsexdstn_flag_cdagrde_flag_nmanswrr_oc_area_nmhshld_income_dgree_nmdmstc_tour_ty_valueovsea_tour_ty_value
02784473202101M50대충청북도500이상700만원 미만에어텔 또는 에어카텔 패키지 여행여행경험없음
13220488202101M20대충청남도(세종시 포함)모름개별 여행여행경험없음
22968180202101F40대전라북도500이상700만원 미만개별 여행여행경험없음
3196273202101F30대대구광역시500이상700만원 미만개별 여행여행경험없음
41922037202101F60대부산광역시300이상500만원 미만개별 여행여행경험없음
589713202101F50대경상남도300이상500만원 미만에어텔 또는 에어카텔 패키지 여행여행경험없음
6165231202101F20대충청남도(세종시 포함)모름개별 여행여행경험없음
73232050202101F20대전라남도500이상700만원 미만개별 여행여행경험없음
82928987202101F40대부산광역시500이상700만원 미만개별 여행여행경험없음
92900735202101F20대서울특별시500이상700만원 미만개별 여행여행경험없음
respond_idexamin_ymsexdstn_flag_cdagrde_flag_nmanswrr_oc_area_nmhshld_income_dgree_nmdmstc_tour_ty_valueovsea_tour_ty_value
902953788202101F20대울산광역시300이상500만원 미만개별 여행여행경험없음
91151399202101M50대울산광역시700만원 이상개별 여행여행경험없음
92310220202101F50대서울특별시300이상500만원 미만개별 여행여행경험없음
93338915202101F40대서울특별시700만원 이상개별 여행여행경험없음
94302216202101F40대경상북도300만원 미만단체 패키지 여행여행경험없음
95365269202101F20대제주도300이상500만원 미만개별 여행여행경험없음
961253202101F50대서울특별시500이상700만원 미만개별 여행여행경험없음
9719786202101F60대충청남도(세종시 포함)300만원 미만개별 여행여행경험없음
98189449202101F60대경기도700만원 이상개별 여행여행경험없음
99202440202101M50대광주광역시700만원 이상개별 여행여행경험없음