Overview

Dataset statistics

Number of variables11
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory986.3 KiB
Average record size in memory101.0 B

Variable types

DateTime1
Categorical5
Numeric5

Dataset

Description당사에 등록된 고객이 2015년1월1일 이후 3개 영업점(강남,강북,부산점)에 방문한 내역을 일별로 국적별, 연령대로 분류한 통계정보 입니다. 상세 국적이나 연령이 필요한 경우 요청해주시면 추가 개방하도록 하겠습니다.
URLhttps://www.data.go.kr/data/15116743/fileData.do

Alerts

영업점명 is highly overall correlated with 영업점코드High correlation
여권발급국가 is highly overall correlated with 여권발급국가명High correlation
영업점코드 is highly overall correlated with 영업점명High correlation
여권발급국가명 is highly overall correlated with 여권발급국가High correlation
60대이상 is highly overall correlated with 50대 and 2 other fieldsHigh correlation
50대 is highly overall correlated with 60대이상 and 3 other fieldsHigh correlation
40대 is highly overall correlated with 60대이상 and 3 other fieldsHigh correlation
30대 is highly overall correlated with 60대이상 and 3 other fieldsHigh correlation
20대 is highly overall correlated with 50대 and 2 other fieldsHigh correlation
60대이상 has 1311 (13.1%) zerosZeros
50대 has 1236 (12.4%) zerosZeros
40대 has 1824 (18.2%) zerosZeros
30대 has 2388 (23.9%) zerosZeros
20대 has 3698 (37.0%) zerosZeros

Reproduction

Analysis started2023-12-12 21:38:07.989954
Analysis finished2023-12-12 21:38:12.522310
Duration4.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2882
Distinct (%)28.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2015-01-01 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T06:38:12.601117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:12.788907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업점코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
CX
3468 
HT
3396 
LT
3136 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowHT
2nd rowCX
3rd rowHT
4th rowHT
5th rowHT

Common Values

ValueCountFrequency (%)
CX 3468
34.7%
HT 3396
34.0%
LT 3136
31.4%

Length

2023-12-13T06:38:12.969666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:38:13.101406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
cx 3468
34.7%
ht 3396
34.0%
lt 3136
31.4%

영업점명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
코엑스
3468 
드래곤
3396 
롯데
3136 

Length

Max length3
Median length3
Mean length2.6864
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row드래곤
2nd row코엑스
3rd row드래곤
4th row드래곤
5th row드래곤

Common Values

ValueCountFrequency (%)
코엑스 3468
34.7%
드래곤 3396
34.0%
롯데 3136
31.4%

Length

2023-12-13T06:38:13.240782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:38:13.356874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
코엑스 3468
34.7%
드래곤 3396
34.0%
롯데 3136
31.4%

성별
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
M
5032 
F
4636 
X
 
332

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowM
3rd rowF
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
M 5032
50.3%
F 4636
46.4%
X 332
 
3.3%

Length

2023-12-13T06:38:13.480594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:38:13.589980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 5032
50.3%
f 4636
46.4%
x 332
 
3.3%

여권발급국가
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
USA
2122 
CHN
2121 
ETC
2070 
KOR
1940 
JPN
1747 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowJPN
2nd rowCHN
3rd rowETC
4th rowJPN
5th rowUSA

Common Values

ValueCountFrequency (%)
USA 2122
21.2%
CHN 2121
21.2%
ETC 2070
20.7%
KOR 1940
19.4%
JPN 1747
17.5%

Length

2023-12-13T06:38:13.743376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:38:13.889003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
usa 2122
21.2%
chn 2121
21.2%
etc 2070
20.7%
kor 1940
19.4%
jpn 1747
17.5%

여권발급국가명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미국
2122 
중국
2121 
기타
2070 
대한민국(영주권자)
1940 
일본
1747 

Length

Max length10
Median length2
Mean length3.552
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일본
2nd row중국
3rd row기타
4th row일본
5th row미국

Common Values

ValueCountFrequency (%)
미국 2122
21.2%
중국 2121
21.2%
기타 2070
20.7%
대한민국(영주권자) 1940
19.4%
일본 1747
17.5%

Length

2023-12-13T06:38:14.062516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:38:14.209392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미국 2122
21.2%
중국 2121
21.2%
기타 2070
20.7%
대한민국(영주권자 1940
19.4%
일본 1747
17.5%

60대이상
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct163
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.9803
Minimum0
Maximum230
Zeros1311
Zeros (%)13.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:38:14.363177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median5
Q313
95-th percentile50
Maximum230
Range230
Interquartile range (IQR)11

Descriptive statistics

Standard deviation20.187349
Coefficient of variation (CV)1.6850454
Kurtosis19.367633
Mean11.9803
Median Absolute Deviation (MAD)4
Skewness3.8560608
Sum119803
Variance407.52906
MonotonicityNot monotonic
2023-12-13T06:38:14.506659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1311
 
13.1%
1 1052
 
10.5%
2 844
 
8.4%
3 746
 
7.5%
4 621
 
6.2%
5 544
 
5.4%
6 477
 
4.8%
7 405
 
4.0%
9 341
 
3.4%
8 338
 
3.4%
Other values (153) 3321
33.2%
ValueCountFrequency (%)
0 1311
13.1%
1 1052
10.5%
2 844
8.4%
3 746
7.5%
4 621
6.2%
5 544
5.4%
6 477
 
4.8%
7 405
 
4.0%
8 338
 
3.4%
9 341
 
3.4%
ValueCountFrequency (%)
230 1
< 0.1%
219 1
< 0.1%
193 1
< 0.1%
190 1
< 0.1%
189 1
< 0.1%
187 1
< 0.1%
177 1
< 0.1%
171 1
< 0.1%
170 1
< 0.1%
169 1
< 0.1%

50대
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct363
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.0963
Minimum0
Maximum771
Zeros1236
Zeros (%)12.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:38:14.640940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median6
Q320
95-th percentile113
Maximum771
Range771
Interquartile range (IQR)18

Descriptive statistics

Standard deviation58.566574
Coefficient of variation (CV)2.4305215
Kurtosis29.814418
Mean24.0963
Median Absolute Deviation (MAD)5
Skewness4.9625511
Sum240963
Variance3430.0436
MonotonicityNot monotonic
2023-12-13T06:38:15.116082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1236
 
12.4%
1 1019
 
10.2%
2 851
 
8.5%
3 671
 
6.7%
4 537
 
5.4%
5 427
 
4.3%
6 330
 
3.3%
7 281
 
2.8%
8 267
 
2.7%
9 229
 
2.3%
Other values (353) 4152
41.5%
ValueCountFrequency (%)
0 1236
12.4%
1 1019
10.2%
2 851
8.5%
3 671
6.7%
4 537
5.4%
5 427
 
4.3%
6 330
 
3.3%
7 281
 
2.8%
8 267
 
2.7%
9 229
 
2.3%
ValueCountFrequency (%)
771 1
< 0.1%
675 1
< 0.1%
650 1
< 0.1%
621 1
< 0.1%
601 1
< 0.1%
587 1
< 0.1%
577 1
< 0.1%
568 1
< 0.1%
535 1
< 0.1%
515 1
< 0.1%

40대
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct381
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.5271
Minimum0
Maximum677
Zeros1824
Zeros (%)18.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:38:15.290265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median7
Q322
95-th percentile140
Maximum677
Range677
Interquartile range (IQR)21

Descriptive statistics

Standard deviation62.746013
Coefficient of variation (CV)2.2794269
Kurtosis22.206574
Mean27.5271
Median Absolute Deviation (MAD)7
Skewness4.3485015
Sum275271
Variance3937.0622
MonotonicityNot monotonic
2023-12-13T06:38:15.434079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1824
18.2%
1 948
 
9.5%
2 651
 
6.5%
3 473
 
4.7%
4 366
 
3.7%
5 344
 
3.4%
8 276
 
2.8%
6 258
 
2.6%
7 249
 
2.5%
10 233
 
2.3%
Other values (371) 4378
43.8%
ValueCountFrequency (%)
0 1824
18.2%
1 948
9.5%
2 651
 
6.5%
3 473
 
4.7%
4 366
 
3.7%
5 344
 
3.4%
6 258
 
2.6%
7 249
 
2.5%
8 276
 
2.8%
9 213
 
2.1%
ValueCountFrequency (%)
677 1
< 0.1%
651 1
< 0.1%
601 1
< 0.1%
590 1
< 0.1%
587 1
< 0.1%
540 1
< 0.1%
535 1
< 0.1%
534 1
< 0.1%
532 1
< 0.1%
526 1
< 0.1%

30대
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct303
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.8475
Minimum0
Maximum527
Zeros2388
Zeros (%)23.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:38:15.598824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median6
Q322
95-th percentile125
Maximum527
Range527
Interquartile range (IQR)21

Descriptive statistics

Standard deviation47.771427
Coefficient of variation (CV)2.0032048
Kurtosis15.58354
Mean23.8475
Median Absolute Deviation (MAD)6
Skewness3.5557791
Sum238475
Variance2282.1093
MonotonicityNot monotonic
2023-12-13T06:38:15.745679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2388
23.9%
1 932
 
9.3%
2 619
 
6.2%
3 412
 
4.1%
4 343
 
3.4%
5 287
 
2.9%
6 269
 
2.7%
7 238
 
2.4%
8 221
 
2.2%
9 211
 
2.1%
Other values (293) 4080
40.8%
ValueCountFrequency (%)
0 2388
23.9%
1 932
 
9.3%
2 619
 
6.2%
3 412
 
4.1%
4 343
 
3.4%
5 287
 
2.9%
6 269
 
2.7%
7 238
 
2.4%
8 221
 
2.2%
9 211
 
2.1%
ValueCountFrequency (%)
527 1
< 0.1%
459 1
< 0.1%
458 1
< 0.1%
447 1
< 0.1%
399 1
< 0.1%
390 1
< 0.1%
388 1
< 0.1%
374 1
< 0.1%
373 1
< 0.1%
370 1
< 0.1%

20대
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct136
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.031
Minimum0
Maximum224
Zeros3698
Zeros (%)37.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:38:15.888551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q311
95-th percentile49
Maximum224
Range224
Interquartile range (IQR)11

Descriptive statistics

Standard deviation18.354458
Coefficient of variation (CV)1.8297735
Kurtosis14.020327
Mean10.031
Median Absolute Deviation (MAD)2
Skewness3.1936965
Sum100310
Variance336.88613
MonotonicityNot monotonic
2023-12-13T06:38:16.011072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3698
37.0%
1 909
 
9.1%
2 593
 
5.9%
3 445
 
4.5%
5 320
 
3.2%
4 314
 
3.1%
6 275
 
2.8%
7 255
 
2.5%
8 231
 
2.3%
9 188
 
1.9%
Other values (126) 2772
27.7%
ValueCountFrequency (%)
0 3698
37.0%
1 909
 
9.1%
2 593
 
5.9%
3 445
 
4.5%
4 314
 
3.1%
5 320
 
3.2%
6 275
 
2.8%
7 255
 
2.5%
8 231
 
2.3%
9 188
 
1.9%
ValueCountFrequency (%)
224 1
 
< 0.1%
185 1
 
< 0.1%
176 1
 
< 0.1%
162 1
 
< 0.1%
158 1
 
< 0.1%
150 3
< 0.1%
147 1
 
< 0.1%
146 2
< 0.1%
143 1
 
< 0.1%
140 1
 
< 0.1%

Interactions

2023-12-13T06:38:11.645210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:09.280480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:09.845502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:10.467415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:11.021857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:11.747763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:09.391190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:09.989189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:10.577100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:11.131939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:11.867657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:09.510175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:10.125441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:10.704291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:11.297097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:11.967963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:09.616338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:10.229908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:10.808834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:11.415263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:12.077860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:09.719233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:10.342484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:10.923761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:38:11.537766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:38:16.108725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업점코드영업점명성별여권발급국가여권발급국가명60대이상50대40대30대20대
영업점코드1.0001.0000.1450.0250.0250.2610.3160.2900.2860.241
영업점명1.0001.0000.1450.0250.0250.2610.3160.2900.2860.241
성별0.1450.1451.0000.1130.1130.3850.2970.3310.3930.352
여권발급국가0.0250.0250.1131.0001.0000.4380.5030.5030.4840.415
여권발급국가명0.0250.0250.1131.0001.0000.4380.5030.5030.4840.415
60대이상0.2610.2610.3850.4380.4381.0000.9040.8440.8090.458
50대0.3160.3160.2970.5030.5030.9041.0000.9260.8880.501
40대0.2900.2900.3310.5030.5030.8440.9261.0000.9250.589
30대0.2860.2860.3930.4840.4840.8090.8880.9251.0000.757
20대0.2410.2410.3520.4150.4150.4580.5010.5890.7571.000
2023-12-13T06:38:16.238970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업점명성별여권발급국가영업점코드여권발급국가명
영업점명1.0000.0440.0181.0000.018
성별0.0441.0000.0850.0440.085
여권발급국가0.0180.0851.0000.0181.000
영업점코드1.0000.0440.0181.0000.018
여권발급국가명0.0180.0851.0000.0181.000
2023-12-13T06:38:16.340077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
60대이상50대40대30대20대영업점코드영업점명성별여권발급국가여권발급국가명
60대이상1.0000.7430.6920.6030.4790.1560.1560.2460.1970.197
50대0.7431.0000.8440.7560.6390.2000.2000.1860.2320.232
40대0.6920.8441.0000.8690.7750.1810.1810.2100.2320.232
30대0.6030.7560.8691.0000.8470.1780.1780.2570.2210.221
20대0.4790.6390.7750.8471.0000.1480.1480.2260.1850.185
영업점코드0.1560.2000.1810.1780.1481.0001.0000.0440.0180.018
영업점명0.1560.2000.1810.1780.1481.0001.0000.0440.0180.018
성별0.2460.1860.2100.2570.2260.0440.0441.0000.0850.085
여권발급국가0.1970.2320.2320.2210.1850.0180.0180.0851.0001.000
여권발급국가명0.1970.2320.2320.2210.1850.0180.0180.0851.0001.000

Missing values

2023-12-13T06:38:12.235710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:38:12.430548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

방문일자영업점코드영업점명성별여권발급국가여권발급국가명60대이상50대40대30대20대
501862019-05-23HT드래곤FJPN일본458412
674372021-07-02CX코엑스MCHN중국42119126783
40382015-05-04HT드래곤FETC기타076196
764522022-06-23HT드래곤MJPN일본41020
306392017-08-22HT드래곤MUSA미국1314512
669702021-06-13CX코엑스MUSA미국1631575
318662017-10-01CX코엑스MUSA미국132619154
233142016-12-31CX코엑스FJPN일본2318413318
117562016-01-01CX코엑스MKOR대한민국(영주권자)17231773
664532021-05-24CX코엑스METC기타308321210
방문일자영업점코드영업점명성별여권발급국가여권발급국가명60대이상50대40대30대20대
306752017-08-23LT롯데FJPN일본57284
661592021-05-12LT롯데FCHN중국13700
658422021-04-30CX코엑스FCHN중국2221042
733082022-02-21HT드래곤FUSA미국43000
493202019-04-24HT드래곤MUSA미국1033100
126472016-01-28LT롯데FJPN일본95531
176082016-07-02HT드래곤FCHN중국94740147
762202022-06-14HT드래곤MUSA미국68700
76262015-08-25HT드래곤XUSA미국00010
400032018-06-21LT롯데FKOR대한민국(영주권자)15000