Overview

Dataset statistics

Number of variables7
Number of observations26
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory66.1 B

Variable types

Categorical1
Text1
Numeric5

Dataset

Description2014년부산광역시강서구사회조사결과(가족들과의여가횟수)
Author부산광역시 강서구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3045858

Alerts

1~2회 정도 is highly overall correlated with 3~4회 정도 and 3 other fieldsHigh correlation
3~4회 정도 is highly overall correlated with 1~2회 정도 and 3 other fieldsHigh correlation
5회 이상 is highly overall correlated with 1~2회 정도 and 3 other fieldsHigh correlation
거의 안한다 (2~3달에 1회) is highly overall correlated with 1~2회 정도 and 3 other fieldsHigh correlation
전혀 안한다 is highly overall correlated with 1~2회 정도 and 3 other fieldsHigh correlation
항목 has unique valuesUnique
전혀 안한다 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:13:59.513019
Analysis finished2023-12-10 17:14:06.350674
Duration6.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct6
Distinct (%)23.1%
Missing0
Missing (%)0.0%
Memory size340.0 B
월가구소득
연령
직업
교육수준
성별

Length

Max length5
Median length2
Mean length3.1153846
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성별
2nd row성별
3rd row연령
4th row연령
5th row연령

Common Values

ValueCountFrequency (%)
월가구소득 7
26.9%
연령 6
23.1%
직업 5
19.2%
교육수준 4
15.4%
성별 2
 
7.7%
구역 2
 
7.7%

Length

2023-12-11T02:14:06.527332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:14:06.813248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
월가구소득 7
26.9%
연령 6
23.1%
직업 5
19.2%
교육수준 4
15.4%
성별 2
 
7.7%
구역 2
 
7.7%

항목
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-11T02:14:07.159408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length7
Mean length6.1923077
Min length1

Characters and Unicode

Total characters161
Distinct characters48
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row
2nd row
3rd row15-19세
4th row20-29세
5th row30-39세
ValueCountFrequency (%)
미만 6
 
14.0%
5
 
11.6%
1
 
2.3%
300 1
 
2.3%
100 1
 
2.3%
200만원 1
 
2.3%
200 1
 
2.3%
300만원 1
 
2.3%
400만원 1
 
2.3%
1
 
2.3%
Other values (24) 24
55.8%
2023-12-11T02:14:07.887619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 29
18.0%
17
 
10.6%
13
 
8.1%
7
 
4.3%
6
 
3.7%
6
 
3.7%
5 5
 
3.1%
9 5
 
3.1%
- 5
 
3.1%
~ 5
 
3.1%
Other values (38) 63
39.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 76
47.2%
Decimal Number 58
36.0%
Space Separator 17
 
10.6%
Dash Punctuation 5
 
3.1%
Math Symbol 5
 
3.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
17.1%
7
 
9.2%
6
 
7.9%
6
 
7.9%
4
 
5.3%
4
 
5.3%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (27) 27
35.5%
Decimal Number
ValueCountFrequency (%)
0 29
50.0%
5 5
 
8.6%
9 5
 
8.6%
2 4
 
6.9%
1 4
 
6.9%
3 4
 
6.9%
4 4
 
6.9%
6 3
 
5.2%
Space Separator
ValueCountFrequency (%)
17
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 85
52.8%
Hangul 76
47.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
17.1%
7
 
9.2%
6
 
7.9%
6
 
7.9%
4
 
5.3%
4
 
5.3%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (27) 27
35.5%
Common
ValueCountFrequency (%)
0 29
34.1%
17
20.0%
5 5
 
5.9%
9 5
 
5.9%
- 5
 
5.9%
~ 5
 
5.9%
2 4
 
4.7%
1 4
 
4.7%
3 4
 
4.7%
4 4
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 85
52.8%
Hangul 76
47.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 29
34.1%
17
20.0%
5 5
 
5.9%
9 5
 
5.9%
- 5
 
5.9%
~ 5
 
5.9%
2 4
 
4.7%
1 4
 
4.7%
3 4
 
4.7%
4 4
 
4.7%
Hangul
ValueCountFrequency (%)
13
17.1%
7
 
9.2%
6
 
7.9%
6
 
7.9%
4
 
5.3%
4
 
5.3%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (27) 27
35.5%

1~2회 정도
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44.592308
Minimum15.1
Maximum58.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-11T02:14:08.235217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15.1
5-th percentile18.975
Q140.45
median47.2
Q353.4
95-th percentile56.275
Maximum58.6
Range43.5
Interquartile range (IQR)12.95

Descriptive statistics

Standard deviation11.67157
Coefficient of variation (CV)0.26173953
Kurtosis1.1397236
Mean44.592308
Median Absolute Deviation (MAD)6.5
Skewness-1.3072428
Sum1159.4
Variance136.22554
MonotonicityNot monotonic
2023-12-11T02:14:08.556529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
54.6 2
 
7.7%
53.7 2
 
7.7%
46.9 1
 
3.8%
49.3 1
 
3.8%
39.2 1
 
3.8%
46.1 1
 
3.8%
58.6 1
 
3.8%
56.5 1
 
3.8%
48.4 1
 
3.8%
30.2 1
 
3.8%
Other values (14) 14
53.8%
ValueCountFrequency (%)
15.1 1
3.8%
16.8 1
3.8%
25.5 1
3.8%
30.2 1
3.8%
35.5 1
3.8%
36.9 1
3.8%
39.2 1
3.8%
44.2 1
3.8%
44.8 1
3.8%
45.1 1
3.8%
ValueCountFrequency (%)
58.6 1
3.8%
56.5 1
3.8%
55.6 1
3.8%
54.6 2
7.7%
53.7 2
7.7%
52.5 1
3.8%
51.3 1
3.8%
50.1 1
3.8%
49.3 1
3.8%
48.4 1
3.8%

3~4회 정도
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.011538
Minimum2
Maximum27.3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-11T02:14:08.797234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile3.075
Q16.525
median12.4
Q317.65
95-th percentile24.875
Maximum27.3
Range25.3
Interquartile range (IQR)11.125

Descriptive statistics

Standard deviation7.536701
Coefficient of variation (CV)0.57923212
Kurtosis-1.0414884
Mean13.011538
Median Absolute Deviation (MAD)5.8
Skewness0.26322148
Sum338.3
Variance56.801862
MonotonicityNot monotonic
2023-12-11T02:14:09.087327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
4.0 2
 
7.7%
12.2 1
 
3.8%
16.4 1
 
3.8%
17.7 1
 
3.8%
6.1 1
 
3.8%
27.3 1
 
3.8%
20.7 1
 
3.8%
12.0 1
 
3.8%
16.6 1
 
3.8%
14.0 1
 
3.8%
Other values (15) 15
57.7%
ValueCountFrequency (%)
2.0 1
3.8%
2.8 1
3.8%
3.9 1
3.8%
4.0 2
7.7%
5.8 1
3.8%
6.1 1
3.8%
7.8 1
3.8%
8.0 1
3.8%
8.6 1
3.8%
10.2 1
3.8%
ValueCountFrequency (%)
27.3 1
3.8%
25.1 1
3.8%
24.2 1
3.8%
22.7 1
3.8%
21.3 1
3.8%
20.7 1
3.8%
17.7 1
3.8%
17.5 1
3.8%
16.6 1
3.8%
16.4 1
3.8%

5회 이상
Real number (ℝ)

HIGH CORRELATION 

Distinct22
Distinct (%)84.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.5615385
Minimum0.3
Maximum7.7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-11T02:14:09.524642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.3
5-th percentile0.55
Q12.1
median3.6
Q34.825
95-th percentile6.65
Maximum7.7
Range7.4
Interquartile range (IQR)2.725

Descriptive statistics

Standard deviation1.9450608
Coefficient of variation (CV)0.54612938
Kurtosis-0.53431881
Mean3.5615385
Median Absolute Deviation (MAD)1.45
Skewness0.19890248
Sum92.6
Variance3.7832615
MonotonicityNot monotonic
2023-12-11T02:14:10.276036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
4.6 2
 
7.7%
2.9 2
 
7.7%
2.1 2
 
7.7%
1.8 2
 
7.7%
2.8 1
 
3.8%
2.7 1
 
3.8%
5.6 1
 
3.8%
1.0 1
 
3.8%
6.8 1
 
3.8%
7.7 1
 
3.8%
Other values (12) 12
46.2%
ValueCountFrequency (%)
0.3 1
3.8%
0.4 1
3.8%
1.0 1
3.8%
1.3 1
3.8%
1.8 2
7.7%
2.1 2
7.7%
2.7 1
3.8%
2.8 1
3.8%
2.9 2
7.7%
3.5 1
3.8%
ValueCountFrequency (%)
7.7 1
3.8%
6.8 1
3.8%
6.2 1
3.8%
5.6 1
3.8%
5.5 1
3.8%
5.0 1
3.8%
4.9 1
3.8%
4.6 2
7.7%
4.3 1
3.8%
4.2 1
3.8%

거의 안한다 (2~3달에 1회)
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.003846
Minimum10.6
Maximum35.9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-11T02:14:10.724235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10.6
5-th percentile12.4
Q116.825
median20.5
Q324.275
95-th percentile32.35
Maximum35.9
Range25.3
Interquartile range (IQR)7.45

Descriptive statistics

Standard deviation6.2106026
Coefficient of variation (CV)0.29568883
Kurtosis0.25858709
Mean21.003846
Median Absolute Deviation (MAD)3.8
Skewness0.58225188
Sum546.1
Variance38.571585
MonotonicityNot monotonic
2023-12-11T02:14:11.111088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
12.4 2
 
7.7%
15.5 2
 
7.7%
21.6 1
 
3.8%
35.9 1
 
3.8%
27.2 1
 
3.8%
10.6 1
 
3.8%
18.2 1
 
3.8%
21.2 1
 
3.8%
14.9 1
 
3.8%
20.1 1
 
3.8%
Other values (14) 14
53.8%
ValueCountFrequency (%)
10.6 1
3.8%
12.4 2
7.7%
14.9 1
3.8%
15.5 2
7.7%
16.7 1
3.8%
17.2 1
3.8%
18.2 1
3.8%
19.0 1
3.8%
19.1 1
3.8%
20.0 1
3.8%
ValueCountFrequency (%)
35.9 1
3.8%
33.3 1
3.8%
29.5 1
3.8%
27.2 1
3.8%
26.2 1
3.8%
26.0 1
3.8%
24.3 1
3.8%
24.2 1
3.8%
22.5 1
3.8%
21.7 1
3.8%

전혀 안한다
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.846154
Minimum0.6
Maximum61.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-11T02:14:11.471477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.6
5-th percentile2.825
Q17.025
median13.5
Q322.3
95-th percentile51.775
Maximum61.6
Range61
Interquartile range (IQR)15.275

Descriptive statistics

Standard deviation15.663875
Coefficient of variation (CV)0.87771714
Kurtosis1.9202638
Mean17.846154
Median Absolute Deviation (MAD)7.3
Skewness1.4821601
Sum464
Variance245.35698
MonotonicityNot monotonic
2023-12-11T02:14:12.367587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
18.3 1
 
3.8%
13.3 1
 
3.8%
12.0 1
 
3.8%
26.5 1
 
3.8%
0.6 1
 
3.8%
7.3 1
 
3.8%
4.5 1
 
3.8%
7.1 1
 
3.8%
14.6 1
 
3.8%
30.9 1
 
3.8%
Other values (16) 16
61.5%
ValueCountFrequency (%)
0.6 1
3.8%
2.4 1
3.8%
4.1 1
3.8%
4.5 1
3.8%
4.6 1
3.8%
6.2 1
3.8%
7.0 1
3.8%
7.1 1
3.8%
7.3 1
3.8%
7.9 1
3.8%
ValueCountFrequency (%)
61.6 1
3.8%
54.9 1
3.8%
42.4 1
3.8%
30.9 1
3.8%
30.7 1
3.8%
26.5 1
3.8%
22.8 1
3.8%
20.8 1
3.8%
20.3 1
3.8%
18.9 1
3.8%

Interactions

2023-12-11T02:14:04.757550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:00.157815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:01.246065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:02.344639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:03.489758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:04.933881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:00.352371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:01.486975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:02.558119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:03.711273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:05.147113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:00.540608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:01.715074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:02.791270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:04.007649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:05.387709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:00.786738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:01.917539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:03.032119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:04.299872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:05.633785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:01.005329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:02.125753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:03.249260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:04.554462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:14:12.598559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분항목1~2회 정도3~4회 정도5회 이상거의 안한다 (2~3달에 1회)전혀 안한다
구분1.0001.0000.5500.3690.0000.0000.000
항목1.0001.0001.0001.0001.0001.0001.000
1~2회 정도0.5501.0001.0000.5350.4190.6020.798
3~4회 정도0.3691.0000.5351.0000.8510.6770.565
5회 이상0.0001.0000.4190.8511.0000.7310.580
거의 안한다 (2~3달에 1회)0.0001.0000.6020.6770.7311.0000.605
전혀 안한다0.0001.0000.7980.5650.5800.6051.000
2023-12-11T02:14:12.908935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
1~2회 정도3~4회 정도5회 이상거의 안한다 (2~3달에 1회)전혀 안한다구분
1~2회 정도1.0000.7800.694-0.719-0.9010.261
3~4회 정도0.7801.0000.878-0.883-0.9130.129
5회 이상0.6940.8781.000-0.778-0.8290.000
거의 안한다 (2~3달에 1회)-0.719-0.883-0.7781.0000.7660.000
전혀 안한다-0.901-0.913-0.8290.7661.0000.078
구분0.2610.1290.0000.0000.0781.000

Missing values

2023-12-11T02:14:05.935484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:14:06.244698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분항목1~2회 정도3~4회 정도5회 이상거의 안한다 (2~3달에 1회)전혀 안한다
0성별45.112.22.821.618.3
1성별44.212.64.220.018.9
2연령15-19세46.78.64.629.510.6
3연령20-29세55.614.84.617.27.9
4연령30-39세52.522.76.212.46.2
5연령40-49세53.717.52.919.07.0
6연령50-59세44.88.02.124.320.8
7연령60세 이상25.54.02.126.042.4
8교육수준초졸이하16.82.81.324.254.9
9교육수준중졸35.55.81.826.230.7
구분항목1~2회 정도3~4회 정도5회 이상거의 안한다 (2~3달에 1회)전혀 안한다
16직업기능노무47.57.82.721.720.3
17월가구소득100만원 미만15.12.00.320.961.6
18월가구소득100 ~ 200만원 미만30.23.91.833.330.9
19월가구소득200 ~ 300만원 미만48.414.02.920.114.6
20월가구소득300 ~ 400만원 미만56.516.64.914.97.1
21월가구소득400 ~ 500만원 미만58.612.03.721.24.5
22월가구소득500 ~ 600만원 미만46.120.77.718.27.3
23월가구소득600만원이상54.627.36.810.60.6
24구역일반구역39.26.11.027.226.5
25구역개발구역49.317.75.615.512.0