Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells7
Missing cells (%)< 0.1%
Duplicate rows4
Duplicate rows (%)< 0.1%
Total size in memory488.3 KiB
Average record size in memory50.0 B

Variable types

DateTime1
Text2
Numeric1
Categorical1

Dataset

Description공공데이터 제공신청 자료
Author통계청
URLhttps://www.data.go.kr/data/15062103/fileData.do

Alerts

Dataset has 4 (< 0.1%) duplicate rowsDuplicates
판매가격 is highly skewed (γ1 = 37.67029558)Skewed

Reproduction

Analysis started2023-12-12 19:34:52.522750
Analysis finished2023-12-12 19:34:54.069754
Duration1.55 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct251
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-02-07 00:00:00
Maximum2020-10-14 00:00:00
2023-12-13T04:34:54.174331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:34:54.350662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct3602
Distinct (%)36.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T04:34:54.749337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length143
Median length95
Mean length36.3483
Min length14

Characters and Unicode

Total characters363483
Distinct characters778
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1615 ?
Unique (%)16.2%

Sample

1st row[인터파크] 씨엘블루 황사방역마스크 KF94 비말차단 마스크 1개
2nd row[쿠팡] [] 황사 마스크 KF80
3rd row3Q 미세 먼지 KF94 대형 1회용 일회용 마스크 1개
4th row[티몬] [퍼스트위크] K94 황사 마스크 화이트 대형 1개 개별포장 20매 KF94 마스크의 명품 소중한숨
5th row[G마켓] 국산 올가드 KF94 황사마스크 대형(화이트) x 1개
ValueCountFrequency (%)
마스크 7248
 
9.1%
6704
 
8.4%
kf94 6026
 
7.6%
kf80 3539
 
4.4%
황사 3437
 
4.3%
대형 3317
 
4.2%
미세먼지 2005
 
2.5%
1개 1516
 
1.9%
방역 1419
 
1.8%
황사마스크 1078
 
1.4%
Other values (3741) 43291
54.4%
2023-12-13T04:34:55.347032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
69580
 
19.1%
15484
 
4.3%
13439
 
3.7%
13280
 
3.7%
[ 12277
 
3.4%
] 12276
 
3.4%
K 10409
 
2.9%
F 10236
 
2.8%
9 6996
 
1.9%
4 6938
 
1.9%
Other values (768) 192568
53.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 194030
53.4%
Space Separator 69580
 
19.1%
Decimal Number 34558
 
9.5%
Uppercase Letter 26621
 
7.3%
Close Punctuation 14105
 
3.9%
Open Punctuation 14097
 
3.9%
Lowercase Letter 5699
 
1.6%
Dash Punctuation 2704
 
0.7%
Other Punctuation 1556
 
0.4%
Math Symbol 510
 
0.1%
Other values (2) 23
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15484
 
8.0%
13439
 
6.9%
13280
 
6.8%
6223
 
3.2%
5720
 
2.9%
5711
 
2.9%
4649
 
2.4%
4227
 
2.2%
4046
 
2.1%
3629
 
1.9%
Other values (684) 117622
60.6%
Uppercase Letter
ValueCountFrequency (%)
K 10409
39.1%
F 10236
38.5%
G 936
 
3.5%
N 458
 
1.7%
S 444
 
1.7%
A 432
 
1.6%
D 423
 
1.6%
O 382
 
1.4%
M 328
 
1.2%
E 294
 
1.1%
Other values (16) 2279
 
8.6%
Lowercase Letter
ValueCountFrequency (%)
k 646
 
11.3%
f 565
 
9.9%
e 455
 
8.0%
l 432
 
7.6%
a 399
 
7.0%
o 363
 
6.4%
m 330
 
5.8%
r 299
 
5.2%
s 280
 
4.9%
i 268
 
4.7%
Other values (16) 1662
29.2%
Decimal Number
ValueCountFrequency (%)
9 6996
20.2%
4 6938
20.1%
0 6848
19.8%
1 5842
16.9%
8 4081
11.8%
5 1779
 
5.1%
3 1066
 
3.1%
2 581
 
1.7%
6 278
 
0.8%
7 149
 
0.4%
Other Punctuation
ValueCountFrequency (%)
/ 1112
71.5%
, 313
 
20.1%
. 62
 
4.0%
% 34
 
2.2%
! 17
 
1.1%
* 6
 
0.4%
: 4
 
0.3%
· 4
 
0.3%
2
 
0.1%
& 2
 
0.1%
Math Symbol
ValueCountFrequency (%)
+ 486
95.3%
~ 20
 
3.9%
| 4
 
0.8%
Open Punctuation
ValueCountFrequency (%)
[ 12277
87.1%
( 1820
 
12.9%
Close Punctuation
ValueCountFrequency (%)
] 12276
87.0%
) 1829
 
13.0%
Other Symbol
ValueCountFrequency (%)
9
90.0%
1
 
10.0%
Space Separator
ValueCountFrequency (%)
69580
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2704
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 194030
53.4%
Common 137133
37.7%
Latin 32320
 
8.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15484
 
8.0%
13439
 
6.9%
13280
 
6.8%
6223
 
3.2%
5720
 
2.9%
5711
 
2.9%
4649
 
2.4%
4227
 
2.2%
4046
 
2.1%
3629
 
1.9%
Other values (684) 117622
60.6%
Latin
ValueCountFrequency (%)
K 10409
32.2%
F 10236
31.7%
G 936
 
2.9%
k 646
 
2.0%
f 565
 
1.7%
N 458
 
1.4%
e 455
 
1.4%
S 444
 
1.4%
l 432
 
1.3%
A 432
 
1.3%
Other values (42) 7307
22.6%
Common
ValueCountFrequency (%)
69580
50.7%
[ 12277
 
9.0%
] 12276
 
9.0%
9 6996
 
5.1%
4 6938
 
5.1%
0 6848
 
5.0%
1 5842
 
4.3%
8 4081
 
3.0%
- 2704
 
2.0%
) 1829
 
1.3%
Other values (22) 7762
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 194028
53.4%
ASCII 169437
46.6%
Misc Symbols 10
 
< 0.1%
None 4
 
< 0.1%
Punctuation 2
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
69580
41.1%
[ 12277
 
7.2%
] 12276
 
7.2%
K 10409
 
6.1%
F 10236
 
6.0%
9 6996
 
4.1%
4 6938
 
4.1%
0 6848
 
4.0%
1 5842
 
3.4%
8 4081
 
2.4%
Other values (70) 23954
 
14.1%
Hangul
ValueCountFrequency (%)
15484
 
8.0%
13439
 
6.9%
13280
 
6.8%
6223
 
3.2%
5720
 
2.9%
5711
 
2.9%
4649
 
2.4%
4227
 
2.2%
4046
 
2.1%
3629
 
1.9%
Other values (683) 117620
60.6%
Misc Symbols
ValueCountFrequency (%)
9
90.0%
1
 
10.0%
None
ValueCountFrequency (%)
· 4
100.0%
Punctuation
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Distinct1245
Distinct (%)12.5%
Missing7
Missing (%)0.1%
Memory size156.2 KiB
2023-12-13T04:34:55.802763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length4.5789052
Min length1

Characters and Unicode

Total characters45757
Distinct characters621
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique364 ?
Unique (%)3.6%

Sample

1st row인터파크
2nd row쿠팡
3rd row롯데쇼핑
4th row티몬
5th rowG마켓
ValueCountFrequency (%)
g마켓 662
 
5.9%
11번가 570
 
5.1%
옥션 535
 
4.8%
쿠팡 529
 
4.7%
위메프 418
 
3.8%
인터파크 319
 
2.9%
티몬 208
 
1.9%
롯데on 153
 
1.4%
스토어 101
 
0.9%
주식회사 93
 
0.8%
Other values (1371) 7558
67.8%
2023-12-13T04:34:56.365229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1571
 
3.4%
1 1219
 
2.7%
1209
 
2.6%
1153
 
2.5%
1046
 
2.3%
G 862
 
1.9%
843
 
1.8%
733
 
1.6%
717
 
1.6%
664
 
1.5%
Other values (611) 35740
78.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35247
77.0%
Uppercase Letter 3747
 
8.2%
Lowercase Letter 3593
 
7.9%
Decimal Number 1901
 
4.2%
Space Separator 1153
 
2.5%
Dash Punctuation 75
 
0.2%
Other Punctuation 41
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1571
 
4.5%
1209
 
3.4%
1046
 
3.0%
843
 
2.4%
733
 
2.1%
717
 
2.0%
664
 
1.9%
647
 
1.8%
622
 
1.8%
578
 
1.6%
Other values (547) 26617
75.5%
Uppercase Letter
ValueCountFrequency (%)
G 862
23.0%
N 380
 
10.1%
O 303
 
8.1%
S 273
 
7.3%
E 206
 
5.5%
A 169
 
4.5%
I 146
 
3.9%
B 139
 
3.7%
H 133
 
3.5%
M 131
 
3.5%
Other values (15) 1005
26.8%
Lowercase Letter
ValueCountFrequency (%)
e 408
11.4%
a 368
 
10.2%
o 323
 
9.0%
r 293
 
8.2%
l 260
 
7.2%
i 231
 
6.4%
s 227
 
6.3%
t 180
 
5.0%
n 175
 
4.9%
c 166
 
4.6%
Other values (15) 962
26.8%
Decimal Number
ValueCountFrequency (%)
1 1219
64.1%
9 137
 
7.2%
7 123
 
6.5%
2 97
 
5.1%
0 65
 
3.4%
5 63
 
3.3%
6 62
 
3.3%
3 54
 
2.8%
4 50
 
2.6%
8 31
 
1.6%
Other Punctuation
ValueCountFrequency (%)
. 38
92.7%
: 3
 
7.3%
Space Separator
ValueCountFrequency (%)
1153
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 75
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35247
77.0%
Latin 7340
 
16.0%
Common 3170
 
6.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1571
 
4.5%
1209
 
3.4%
1046
 
3.0%
843
 
2.4%
733
 
2.1%
717
 
2.0%
664
 
1.9%
647
 
1.8%
622
 
1.8%
578
 
1.6%
Other values (547) 26617
75.5%
Latin
ValueCountFrequency (%)
G 862
 
11.7%
e 408
 
5.6%
N 380
 
5.2%
a 368
 
5.0%
o 323
 
4.4%
O 303
 
4.1%
r 293
 
4.0%
S 273
 
3.7%
l 260
 
3.5%
i 231
 
3.1%
Other values (40) 3639
49.6%
Common
ValueCountFrequency (%)
1 1219
38.5%
1153
36.4%
9 137
 
4.3%
7 123
 
3.9%
2 97
 
3.1%
- 75
 
2.4%
0 65
 
2.1%
5 63
 
2.0%
6 62
 
2.0%
3 54
 
1.7%
Other values (4) 122
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35247
77.0%
ASCII 10510
 
23.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1571
 
4.5%
1209
 
3.4%
1046
 
3.0%
843
 
2.4%
733
 
2.1%
717
 
2.0%
664
 
1.9%
647
 
1.8%
622
 
1.8%
578
 
1.6%
Other values (547) 26617
75.5%
ASCII
ValueCountFrequency (%)
1 1219
 
11.6%
1153
 
11.0%
G 862
 
8.2%
e 408
 
3.9%
N 380
 
3.6%
a 368
 
3.5%
o 323
 
3.1%
O 303
 
2.9%
r 293
 
2.8%
S 273
 
2.6%
Other values (54) 4928
46.9%

판매가격
Real number (ℝ)

SKEWED 

Distinct1304
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15131.774
Minimum10
Maximum3680000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:34:56.536348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile690
Q11100
median2940
Q317500
95-th percentile55722.5
Maximum3680000
Range3679990
Interquartile range (IQR)16400

Descriptive statistics

Standard deviation56283.686
Coefficient of variation (CV)3.7195694
Kurtosis2090.4389
Mean15131.774
Median Absolute Deviation (MAD)2200
Skewness37.670296
Sum1.5131774 × 108
Variance3.1678533 × 109
MonotonicityNot monotonic
2023-12-13T04:34:56.696998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
990 187
 
1.9%
1500 155
 
1.6%
690 107
 
1.1%
1200 105
 
1.1%
9900 95
 
0.9%
750 90
 
0.9%
790 88
 
0.9%
1100 88
 
0.9%
1300 87
 
0.9%
980 86
 
0.9%
Other values (1294) 8912
89.1%
ValueCountFrequency (%)
10 1
 
< 0.1%
40 2
 
< 0.1%
100 1
 
< 0.1%
150 2
 
< 0.1%
180 1
 
< 0.1%
190 9
0.1%
330 2
 
< 0.1%
340 8
0.1%
360 1
 
< 0.1%
390 6
0.1%
ValueCountFrequency (%)
3680000 1
< 0.1%
2050000 1
< 0.1%
1750000 1
< 0.1%
1159630 1
< 0.1%
1000450 1
< 0.1%
693000 1
< 0.1%
490000 2
< 0.1%
488950 1
< 0.1%
483120 1
< 0.1%
480000 1
< 0.1%

작업ID
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
490008
4504 
490013
2312 
490015
1910 
490009
1274 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row490008
2nd row490009
3rd row490008
4th row490008
5th row490008

Common Values

ValueCountFrequency (%)
490008 4504
45.0%
490013 2312
23.1%
490015 1910
19.1%
490009 1274
 
12.7%

Length

2023-12-13T04:34:56.833904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:34:56.944103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
490008 4504
45.0%
490013 2312
23.1%
490015 1910
19.1%
490009 1274
 
12.7%

Interactions

2023-12-13T04:34:53.623911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:34:57.028049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
판매가격작업ID
판매가격1.0000.037
작업ID0.0371.000
2023-12-13T04:34:57.126263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
판매가격작업ID
판매가격1.0000.025
작업ID0.0251.000

Missing values

2023-12-13T04:34:53.842860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:34:54.003715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

수집일자상품명판매처판매가격작업ID
361442020-07-09[인터파크] 씨엘블루 황사방역마스크 KF94 비말차단 마스크 1개인터파크1500490008
617712020-08-26[쿠팡] [] 황사 마스크 KF80쿠팡1200490009
4902020-02-103Q 미세 먼지 KF94 대형 1회용 일회용 마스크 1개롯데쇼핑3910490008
365542020-07-10[티몬] [퍼스트위크] K94 황사 마스크 화이트 대형 1개 개별포장 20매 KF94 마스크의 명품 소중한숨티몬26900490008
352882020-07-07[G마켓] 국산 올가드 KF94 황사마스크 대형(화이트) x 1개G마켓1210490008
257372020-06-10[롯데ON] 메디 미세먼지 황사마스크 KF94 화이트 대형 3P 10팩 (30매)롯데ON55960490015
803692020-09-24[한국생활건강] [] 숨프리 황사 방역 마스크 KF94한국생활건강13500490008
223522020-05-29황사 메디마스크 KF80 대형 30매 (3매입x10개) - 롯데ON롯데ON57000490013
901182020-10-07[오로드 스토어] [] 황사 마스크 KF80오로드 스토어630490009
360782020-07-09[이서윤] KF94 3단 황사 방역 마스크이서윤7000490008
수집일자상품명판매처판매가격작업ID
937862020-10-12[강릉팩] [] 황사 마스크 플러스 KF80강릉팩500490009
256872020-06-10[실크트리] 당일출고 KF80 KF94 유아용 소형 대형 마스크 1매 개별포장실크트리1600490015
819222020-09-26[탱이전자] [] 성광 퓨어 마스크 KF94탱이전자600490008
645512020-08-31[옥션] [] 합리적인 마스크 KF94옥션780490008
625782020-08-27[미래를여는사람들] [] 데일리 방역마스크 화이트 대형 KF94미래를여는사람들1480490015
663322020-09-03[눈이 큰 개구리] [] KF94 황사 미세먼지 방역 마스크눈이 큰 개구리640490008
666322020-09-03[위메프] [] 황사 미세먼지 마스크 KF80위메프1260490009
535542020-08-12[썸타는언니] 제로베이 미세황사 마스크 KF94 대형 화이트 10매 개별포장 + 인액트 브이클리어런스 수딩 핸드젤 (에탄올62%) 200ml 1개 손청결제 핸드클린 손세정제썸타는언니23000490008
34132020-03-08우한 폐렴 코로나19 KF94, n95, n99 마스크 대형 50개1매당 3500원 - 주식회사디온테크주식회사디온테크3500490015
302292020-06-25[마음쇼핑] 미세 황사 마스크 KF94마음쇼핑1980490008

Duplicate rows

Most frequently occurring

수집일자상품명판매처판매가격작업ID# duplicates
02020-02-13캐드원 피키마스크 KF94 초미세먼지 황사 마스크 - 피키다이어트피키다이어트172004900082
12020-02-19테라브레스 미세먼지 마스크 KF94 뽀로로 소형1개 - 옥션옥션23704900082
22020-02-21엘지생활건강 테라브레스 미세먼지 마스크 KF94 소형1개 - 11번가11번가29404900082
32020-06-22[메디캣] 메디인 황사 방역용 마스크 KF80 의약외품 대형 1개메디캣15004900092