Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Categorical3
Numeric4

Dataset

DescriptionSample
Author주식회사 여기어때컴퍼니
URLhttps://www.bigdata-finance.kr/dataset/datasetView.do?datastId=SET0400006

Alerts

기준년월 has constant value ""Constant
서비스이용일자 has constant value ""Constant
대상기준년월 has constant value ""Constant
서비스이용시간값 has 578 (5.8%) zerosZeros

Reproduction

Analysis started2023-12-10 13:06:28.985402
Analysis finished2023-12-10 13:06:33.150721
Duration4.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년월
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
202108
10000 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row202108
2nd row202108
3rd row202108
4th row202108
5th row202108

Common Values

ValueCountFrequency (%)
202108 10000
100.0%

Length

2023-12-10T22:06:33.235805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:06:33.336991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202108 10000
100.0%

서비스이용일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
20210801
10000 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20210801
2nd row20210801
3rd row20210801
4th row20210801
5th row20210801

Common Values

ValueCountFrequency (%)
20210801 10000
100.0%

Length

2023-12-10T22:06:33.440272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:06:33.545548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20210801 10000
100.0%

서비스이용시간값
Real number (ℝ)

ZEROS 

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.4938
Minimum0
Maximum18
Zeros578
Zeros (%)5.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-10T22:06:33.649451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15
median10
Q314
95-th percentile17
Maximum18
Range18
Interquartile range (IQR)9

Descriptive statistics

Standard deviation5.3355607
Coefficient of variation (CV)0.56200475
Kurtosis-1.1031508
Mean9.4938
Median Absolute Deviation (MAD)4
Skewness-0.29719264
Sum94938
Variance28.468208
MonotonicityNot monotonic
2023-12-10T22:06:33.812901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
17 725
 
7.2%
12 706
 
7.1%
13 704
 
7.0%
11 658
 
6.6%
14 654
 
6.5%
16 650
 
6.5%
15 624
 
6.2%
10 613
 
6.1%
0 578
 
5.8%
9 554
 
5.5%
Other values (9) 3534
35.3%
ValueCountFrequency (%)
0 578
5.8%
1 519
5.2%
2 507
5.1%
3 357
3.6%
4 341
3.4%
5 331
3.3%
6 379
3.8%
7 448
4.5%
8 505
5.1%
9 554
5.5%
ValueCountFrequency (%)
18 147
 
1.5%
17 725
7.2%
16 650
6.5%
15 624
6.2%
14 654
6.5%
13 704
7.0%
12 706
7.1%
11 658
6.6%
10 613
6.1%
9 554
5.5%

이용자위치경도값
Real number (ℝ)

Distinct354
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.56918
Minimum126.04
Maximum130.91
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-10T22:06:33.989040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.04
5-th percentile126.36
Q1126.85
median127.37
Q3128.37
95-th percentile129.27
Maximum130.91
Range4.87
Interquartile range (IQR)1.52

Descriptive statistics

Standard deviation0.91391409
Coefficient of variation (CV)0.0071640664
Kurtosis-0.80480634
Mean127.56918
Median Absolute Deviation (MAD)0.65
Skewness0.52259909
Sum1275691.8
Variance0.83523896
MonotonicityNot monotonic
2023-12-10T22:06:34.192667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.1 83
 
0.8%
127.05 83
 
0.8%
127.12 83
 
0.8%
127.07 83
 
0.8%
127.11 76
 
0.8%
126.94 71
 
0.7%
127.03 71
 
0.7%
127.08 70
 
0.7%
127.48 69
 
0.7%
126.86 69
 
0.7%
Other values (344) 9242
92.4%
ValueCountFrequency (%)
126.04 1
 
< 0.1%
126.1 2
 
< 0.1%
126.11 4
 
< 0.1%
126.12 5
0.1%
126.13 9
0.1%
126.14 4
 
< 0.1%
126.15 4
 
< 0.1%
126.16 7
0.1%
126.17 12
0.1%
126.18 11
0.1%
ValueCountFrequency (%)
130.91 1
 
< 0.1%
130.9 2
 
< 0.1%
130.88 2
 
< 0.1%
130.79 3
 
< 0.1%
129.58 1
 
< 0.1%
129.57 9
0.1%
129.56 4
< 0.1%
129.55 7
0.1%
129.54 8
0.1%
129.53 3
 
< 0.1%

이용자위치위도값
Real number (ℝ)

Distinct446
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.432501
Minimum33.2
Maximum38.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-10T22:06:34.352321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33.2
5-th percentile33.45
Q135.38
median36.88
Q337.55
95-th percentile37.93
Maximum38.5
Range5.3
Interquartile range (IQR)2.17

Descriptive statistics

Standard deviation1.3494998
Coefficient of variation (CV)0.037041096
Kurtosis-0.39027947
Mean36.432501
Median Absolute Deviation (MAD)0.84
Skewness-0.78784189
Sum364325.01
Variance1.8211496
MonotonicityNot monotonic
2023-12-10T22:06:34.513232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.49 106
 
1.1%
37.5 104
 
1.0%
37.55 101
 
1.0%
37.54 96
 
1.0%
37.48 89
 
0.9%
37.51 86
 
0.9%
37.6 82
 
0.8%
37.53 81
 
0.8%
37.61 80
 
0.8%
37.63 79
 
0.8%
Other values (436) 9096
91.0%
ValueCountFrequency (%)
33.2 3
 
< 0.1%
33.21 5
 
0.1%
33.22 8
 
0.1%
33.23 24
0.2%
33.24 51
0.5%
33.25 46
0.5%
33.26 36
0.4%
33.27 30
0.3%
33.28 14
 
0.1%
33.29 16
 
0.2%
ValueCountFrequency (%)
38.5 1
 
< 0.1%
38.49 1
 
< 0.1%
38.47 2
< 0.1%
38.44 4
< 0.1%
38.41 1
 
< 0.1%
38.38 3
< 0.1%
38.37 1
 
< 0.1%
38.36 1
 
< 0.1%
38.35 3
< 0.1%
38.33 2
< 0.1%

서비스이용횟수
Real number (ℝ)

Distinct447
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39.2061
Minimum1
Maximum2188
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-10T22:06:34.687677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median9
Q333
95-th percentile175
Maximum2188
Range2187
Interquartile range (IQR)30

Descriptive statistics

Standard deviation97.498378
Coefficient of variation (CV)2.4868165
Kurtosis93.966504
Mean39.2061
Median Absolute Deviation (MAD)8
Skewness7.6615559
Sum392061
Variance9505.9336
MonotonicityNot monotonic
2023-12-10T22:06:35.057380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1217
 
12.2%
2 900
 
9.0%
4 788
 
7.9%
3 675
 
6.8%
5 413
 
4.1%
6 337
 
3.4%
7 268
 
2.7%
8 257
 
2.6%
10 221
 
2.2%
9 213
 
2.1%
Other values (437) 4711
47.1%
ValueCountFrequency (%)
1 1217
12.2%
2 900
9.0%
3 675
6.8%
4 788
7.9%
5 413
 
4.1%
6 337
 
3.4%
7 268
 
2.7%
8 257
 
2.6%
9 213
 
2.1%
10 221
 
2.2%
ValueCountFrequency (%)
2188 1
< 0.1%
1842 1
< 0.1%
1768 1
< 0.1%
1611 1
< 0.1%
1515 1
< 0.1%
1409 1
< 0.1%
1404 1
< 0.1%
1306 1
< 0.1%
1284 1
< 0.1%
1206 1
< 0.1%

대상기준년월
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
202108
10000 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row202108
2nd row202108
3rd row202108
4th row202108
5th row202108

Common Values

ValueCountFrequency (%)
202108 10000
100.0%

Length

2023-12-10T22:06:35.215184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:06:35.316374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202108 10000
100.0%

Interactions

2023-12-10T22:06:32.411534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:30.854088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:31.473631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:31.948164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:32.524762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:31.030633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:31.577539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:32.076131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:32.641698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:31.281195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:31.689124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:32.195314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:32.744311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:31.382414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:31.795213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:06:32.301683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:06:35.384060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서비스이용시간값이용자위치경도값이용자위치위도값서비스이용횟수
서비스이용시간값1.0000.0840.0890.115
이용자위치경도값0.0841.0000.5440.078
이용자위치위도값0.0890.5441.0000.126
서비스이용횟수0.1150.0780.1261.000
2023-12-10T22:06:35.514475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서비스이용시간값이용자위치경도값이용자위치위도값서비스이용횟수
서비스이용시간값1.000-0.0590.0130.184
이용자위치경도값-0.0591.0000.005-0.041
이용자위치위도값0.0130.0051.0000.087
서비스이용횟수0.184-0.0410.0871.000

Missing values

2023-12-10T22:06:32.896621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:06:33.075431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준년월서비스이용일자서비스이용시간값이용자위치경도값이용자위치위도값서비스이용횟수대상기준년월
33298202108202108018126.6334.412202108
812462021082021080116126.4434.838202108
6824202108202108011126.933.396202108
863912021082021080116128.7635.184202108
27258202108202108016128.1435.171202108
21775202108202108014129.4835.747202108
41631202108202108019128.637.1111202108
566882021082021080112126.9337.47236202108
11194202108202108012126.5936.3418202108
17210202108202108013127.6937.671202108
기준년월서비스이용일자서비스이용시간값이용자위치경도값이용자위치위도값서비스이용횟수대상기준년월
648202021082021080113127.3937.7716202108
879442021082021080117126.5436.2665202108
20289202108202108014127.537.783202108
922202021082021080117128.3936.761202108
449702021082021080110127.137.4715202108
627982021082021080113126.8137.463202108
19388202108202108014126.8333.322202108
30021202108202108017127.1237.5226202108
37154202108202108018129.436.66202108
844832021082021080116127.5137.820202108