Overview

Dataset statistics

Number of variables7
Number of observations1497
Missing cells808
Missing cells (%)7.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory83.5 KiB
Average record size in memory57.1 B

Variable types

Numeric1
Text3
DateTime2
Categorical1

Dataset

Description성남시 종량제봉투판매소현황에 대한 데이터로 종량제봉투판매소명, 전화번호, 지정일자, 주소 등의 항목을 제공합니다.
Author경기도 성남시
URLhttps://www.data.go.kr/data/3073807/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 808 (54.0%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:31:38.163702
Analysis finished2023-12-12 10:31:38.774670
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1497
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean749
Minimum1
Maximum1497
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T19:31:38.834407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile75.8
Q1375
median749
Q31123
95-th percentile1422.2
Maximum1497
Range1496
Interquartile range (IQR)748

Descriptive statistics

Standard deviation432.29099
Coefficient of variation (CV)0.57715753
Kurtosis-1.2
Mean749
Median Absolute Deviation (MAD)374
Skewness0
Sum1121253
Variance186875.5
MonotonicityStrictly increasing
2023-12-12T19:31:38.953421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
996 1
 
0.1%
1005 1
 
0.1%
1004 1
 
0.1%
1003 1
 
0.1%
1002 1
 
0.1%
1001 1
 
0.1%
1000 1
 
0.1%
999 1
 
0.1%
998 1
 
0.1%
Other values (1487) 1487
99.3%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1497 1
0.1%
1496 1
0.1%
1495 1
0.1%
1494 1
0.1%
1493 1
0.1%
1492 1
0.1%
1491 1
0.1%
1490 1
0.1%
1489 1
0.1%
1488 1
0.1%
Distinct1438
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Memory size11.8 KiB
2023-12-12T19:31:39.155138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length22
Mean length10.269873
Min length2

Characters and Unicode

Total characters15374
Distinct characters454
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1389 ?
Unique (%)92.8%

Sample

1st row한아름슈퍼(금화슈퍼)
2nd row알뜰공판장
3rd row은창마트
4th row세븐일레븐 성남제일점
5th row가락공판장
ValueCountFrequency (%)
gs25 229
 
8.9%
씨유(cu 164
 
6.4%
세븐일레븐 114
 
4.4%
이마트24 74
 
2.9%
주)코리아세븐 73
 
2.8%
씨유 70
 
2.7%
지에스25 41
 
1.6%
지에스(gs)25 15
 
0.6%
코리아세븐 14
 
0.5%
익스프레스 13
 
0.5%
Other values (1457) 1770
68.7%
2023-12-12T19:31:39.484709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1083
 
7.0%
987
 
6.4%
2 445
 
2.9%
) 401
 
2.6%
399
 
2.6%
( 399
 
2.6%
349
 
2.3%
345
 
2.2%
339
 
2.2%
5 338
 
2.2%
Other values (444) 10289
66.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11510
74.9%
Space Separator 1083
 
7.0%
Uppercase Letter 1038
 
6.8%
Decimal Number 921
 
6.0%
Close Punctuation 401
 
2.6%
Open Punctuation 399
 
2.6%
Lowercase Letter 21
 
0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
987
 
8.6%
399
 
3.5%
349
 
3.0%
345
 
3.0%
339
 
2.9%
308
 
2.7%
306
 
2.7%
269
 
2.3%
267
 
2.3%
264
 
2.3%
Other values (393) 7677
66.7%
Uppercase Letter
ValueCountFrequency (%)
S 293
28.2%
G 279
26.9%
C 178
17.1%
U 174
16.8%
R 19
 
1.8%
H 13
 
1.3%
T 11
 
1.1%
K 10
 
1.0%
M 10
 
1.0%
A 10
 
1.0%
Other values (14) 41
 
3.9%
Lowercase Letter
ValueCountFrequency (%)
s 3
14.3%
e 3
14.3%
g 2
9.5%
d 2
9.5%
l 2
9.5%
a 2
9.5%
b 1
 
4.8%
u 1
 
4.8%
c 1
 
4.8%
t 1
 
4.8%
Other values (3) 3
14.3%
Decimal Number
ValueCountFrequency (%)
2 445
48.3%
5 338
36.7%
4 91
 
9.9%
1 19
 
2.1%
3 11
 
1.2%
8 5
 
0.5%
9 4
 
0.4%
0 3
 
0.3%
6 3
 
0.3%
7 2
 
0.2%
Space Separator
ValueCountFrequency (%)
1083
100.0%
Close Punctuation
ValueCountFrequency (%)
) 401
100.0%
Open Punctuation
ValueCountFrequency (%)
( 399
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11511
74.9%
Common 2804
 
18.2%
Latin 1059
 
6.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
987
 
8.6%
399
 
3.5%
349
 
3.0%
345
 
3.0%
339
 
2.9%
308
 
2.7%
306
 
2.7%
269
 
2.3%
267
 
2.3%
264
 
2.3%
Other values (394) 7678
66.7%
Latin
ValueCountFrequency (%)
S 293
27.7%
G 279
26.3%
C 178
16.8%
U 174
16.4%
R 19
 
1.8%
H 13
 
1.2%
T 11
 
1.0%
K 10
 
0.9%
M 10
 
0.9%
A 10
 
0.9%
Other values (27) 62
 
5.9%
Common
ValueCountFrequency (%)
1083
38.6%
2 445
15.9%
) 401
 
14.3%
( 399
 
14.2%
5 338
 
12.1%
4 91
 
3.2%
1 19
 
0.7%
3 11
 
0.4%
8 5
 
0.2%
9 4
 
0.1%
Other values (3) 8
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11510
74.9%
ASCII 3863
 
25.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1083
28.0%
2 445
11.5%
) 401
 
10.4%
( 399
 
10.3%
5 338
 
8.7%
S 293
 
7.6%
G 279
 
7.2%
C 178
 
4.6%
U 174
 
4.5%
4 91
 
2.4%
Other values (40) 182
 
4.7%
Hangul
ValueCountFrequency (%)
987
 
8.6%
399
 
3.5%
349
 
3.0%
345
 
3.0%
339
 
2.9%
308
 
2.7%
306
 
2.7%
269
 
2.3%
267
 
2.3%
264
 
2.3%
Other values (393) 7677
66.7%
None
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct636
Distinct (%)92.3%
Missing808
Missing (%)54.0%
Memory size11.8 KiB
2023-12-12T19:31:39.699708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length8
Mean length9.413643
Min length8

Characters and Unicode

Total characters6486
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique619 ?
Unique (%)89.8%

Sample

1st row756-6344
2nd row721-0793
3rd row722-5873
4th row731-8843
5th row757-2955
ValueCountFrequency (%)
1577-0711 28
 
4.1%
02-2630-8800 5
 
0.7%
02-1577-0711 4
 
0.6%
080-855-5525 4
 
0.6%
1644-5425 3
 
0.4%
052-916-5006 3
 
0.4%
02-3284-8112 3
 
0.4%
031-711-5428 2
 
0.3%
031-711-7852 2
 
0.3%
02-3284-8509 2
 
0.3%
Other values (626) 633
91.9%
2023-12-12T19:31:40.045919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7 1041
16.0%
- 918
14.2%
1 774
11.9%
0 725
11.2%
5 588
9.1%
3 583
9.0%
2 462
7.1%
4 398
 
6.1%
8 394
 
6.1%
6 316
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5568
85.8%
Dash Punctuation 918
 
14.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
7 1041
18.7%
1 774
13.9%
0 725
13.0%
5 588
10.6%
3 583
10.5%
2 462
8.3%
4 398
 
7.1%
8 394
 
7.1%
6 316
 
5.7%
9 287
 
5.2%
Dash Punctuation
ValueCountFrequency (%)
- 918
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6486
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
7 1041
16.0%
- 918
14.2%
1 774
11.9%
0 725
11.2%
5 588
9.1%
3 583
9.0%
2 462
7.1%
4 398
 
6.1%
8 394
 
6.1%
6 316
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6486
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7 1041
16.0%
- 918
14.2%
1 774
11.9%
0 725
11.2%
5 588
9.1%
3 583
9.0%
2 462
7.1%
4 398
 
6.1%
8 394
 
6.1%
6 316
 
4.9%
Distinct986
Distinct (%)65.9%
Missing0
Missing (%)0.0%
Memory size11.8 KiB
Minimum2006-01-01 00:00:00
Maximum2023-09-18 00:00:00
2023-12-12T19:31:40.177197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:31:40.597135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.8 KiB
분당구
630 
중원구
445 
수정구
422 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수정구
2nd row수정구
3rd row수정구
4th row수정구
5th row수정구

Common Values

ValueCountFrequency (%)
분당구 630
42.1%
중원구 445
29.7%
수정구 422
28.2%

Length

2023-12-12T19:31:40.709678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:31:40.847507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
분당구 630
42.1%
중원구 445
29.7%
수정구 422
28.2%
Distinct1389
Distinct (%)92.8%
Missing0
Missing (%)0.0%
Memory size11.8 KiB
2023-12-12T19:31:41.118051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length42
Mean length14.09686
Min length5

Characters and Unicode

Total characters21103
Distinct characters191
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1298 ?
Unique (%)86.7%

Sample

1st row수정구 탄리로30번길 10-2
2nd row수정구 수정로188번길 25
3rd row수정구 탄리로 38
4th row수정구 산성대로283번길 8
5th row수정구 탄리로42번길 22
ValueCountFrequency (%)
분당구 628
 
13.1%
중원구 440
 
9.2%
수정구 422
 
8.8%
성남시 74
 
1.5%
산성대로 57
 
1.2%
경기도 47
 
1.0%
성남대로 46
 
1.0%
8 40
 
0.8%
광명로 37
 
0.8%
6 36
 
0.8%
Other values (1125) 2955
61.8%
2023-12-12T19:31:41.567226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3287
 
15.6%
1522
 
7.2%
1492
 
7.1%
1 1282
 
6.1%
726
 
3.4%
2 720
 
3.4%
700
 
3.3%
647
 
3.1%
645
 
3.1%
645
 
3.1%
Other values (181) 9437
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12035
57.0%
Decimal Number 5419
25.7%
Space Separator 3287
 
15.6%
Dash Punctuation 202
 
1.0%
Other Punctuation 65
 
0.3%
Open Punctuation 45
 
0.2%
Close Punctuation 45
 
0.2%
Uppercase Letter 4
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1522
 
12.6%
1492
 
12.4%
726
 
6.0%
700
 
5.8%
647
 
5.4%
645
 
5.4%
645
 
5.4%
528
 
4.4%
494
 
4.1%
479
 
4.0%
Other values (161) 4157
34.5%
Decimal Number
ValueCountFrequency (%)
1 1282
23.7%
2 720
13.3%
3 579
10.7%
4 506
 
9.3%
5 472
 
8.7%
6 421
 
7.8%
0 388
 
7.2%
7 387
 
7.1%
8 347
 
6.4%
9 317
 
5.8%
Uppercase Letter
ValueCountFrequency (%)
F 2
50.0%
A 1
25.0%
B 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 64
98.5%
/ 1
 
1.5%
Space Separator
ValueCountFrequency (%)
3287
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 202
100.0%
Open Punctuation
ValueCountFrequency (%)
( 45
100.0%
Close Punctuation
ValueCountFrequency (%)
) 45
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12035
57.0%
Common 9063
42.9%
Latin 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1522
 
12.6%
1492
 
12.4%
726
 
6.0%
700
 
5.8%
647
 
5.4%
645
 
5.4%
645
 
5.4%
528
 
4.4%
494
 
4.1%
479
 
4.0%
Other values (161) 4157
34.5%
Common
ValueCountFrequency (%)
3287
36.3%
1 1282
 
14.1%
2 720
 
7.9%
3 579
 
6.4%
4 506
 
5.6%
5 472
 
5.2%
6 421
 
4.6%
0 388
 
4.3%
7 387
 
4.3%
8 347
 
3.8%
Other values (6) 674
 
7.4%
Latin
ValueCountFrequency (%)
F 2
40.0%
A 1
20.0%
e 1
20.0%
B 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12035
57.0%
ASCII 9068
43.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3287
36.2%
1 1282
 
14.1%
2 720
 
7.9%
3 579
 
6.4%
4 506
 
5.6%
5 472
 
5.2%
6 421
 
4.6%
0 388
 
4.3%
7 387
 
4.3%
8 347
 
3.8%
Other values (10) 679
 
7.5%
Hangul
ValueCountFrequency (%)
1522
 
12.6%
1492
 
12.4%
726
 
6.0%
700
 
5.8%
647
 
5.4%
645
 
5.4%
645
 
5.4%
528
 
4.4%
494
 
4.1%
479
 
4.0%
Other values (161) 4157
34.5%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.8 KiB
Minimum2023-09-25 00:00:00
Maximum2023-09-25 00:00:00
2023-12-12T19:31:41.687444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:31:41.783826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T19:31:38.537899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:31:41.863730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업장주소 구별
연번1.0000.556
사업장주소 구별0.5561.000
2023-12-12T19:31:41.955425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업장주소 구별
연번1.0000.398
사업장주소 구별0.3981.000

Missing values

2023-12-12T19:31:38.648594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:31:38.737843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번판매소명전화번호지정일자사업장주소 구별사업장주소데이터기준일자
01한아름슈퍼(금화슈퍼)756-63442006-01-01수정구수정구 탄리로30번길 10-22023-09-25
12알뜰공판장721-07932011-03-11수정구수정구 수정로188번길 252023-09-25
23은창마트722-58732014-04-22수정구수정구 탄리로 382023-09-25
34세븐일레븐 성남제일점731-88432008-05-28수정구수정구 산성대로283번길 82023-09-25
45가락공판장757-29552006-10-23수정구수정구 탄리로42번길 222023-09-25
56동네슈퍼757-08212006-01-01수정구수정구 시민로163번길 32023-09-25
67세븐일레븐 성남구시청점<NA>2023-03-13수정구수정구 수정로 1722023-09-25
78대성슈퍼751-36892006-01-01수정구수정구 성남대로1258번길 5-42023-09-25
89대원공판장757-41432006-01-01수정구수정구 시민로133번길 162023-09-25
910부흥공판장757-33272006-01-01수정구수정구 산성대로215번길 162023-09-25
연번판매소명전화번호지정일자사업장주소 구별사업장주소데이터기준일자
14871488낙생농협미금지점 락마트031-710-41772017-02-21분당구분당구 돌마로 86, 1층 (구미동, 엘레강스프라자)2023-09-25
14881489씨유(CU) 미금먹자골목점<NA>2018-10-02분당구분당구 미금일로86번길 9 (구미동)2023-09-25
14891490GS25 분당신영점<NA>2018-05-21분당구분당구 탄천상로 164, 1층 F-7호, F-8호 (구미동, 시그마2오피스텔)2023-09-25
14901491씨스페이스 미금점031-711-54282018-10-18분당구분당구 미금일로90번길 22, 1층 (구미동)2023-09-25
14911492위드미분당구미점<NA>2017-02-21분당구분당구 미금일로74번길 20, 1층 (구미동)2023-09-25
14921493(주)가온메디 분당지점<NA>2020-09-28분당구분당구 미금일로 58, 1층 101호 (구미동, 까치마을롯데, 선경아파트)2023-09-25
14931494진로할인마트<NA>2015-02-04분당구분당구 미금로 184, 까치마을 제상가동 지하층 (구미동, 까치마을1단지대우아파트)2023-09-25
14941495씨유(CU) 분당까치마을점<NA>2016-12-29분당구분당구 미금일로 22, 제분산상가동 103호 (구미1동, 까치마을 2단지)2023-09-25
14951496씨유(CU) 미금헤리츠점<NA>2015-10-15분당구분당구 성남대로 151, 110호/분당엠코헤리츠 (구미동)2023-09-25
14961497(주)코리아세븐 분당구미타운점<NA>2015-12-02분당구분당구 미금일로90번길 28 (구미동)2023-09-25