Overview

Dataset statistics

Number of variables4
Number of observations353
Missing cells20
Missing cells (%)1.4%
Duplicate rows1
Duplicate rows (%)0.3%
Total size in memory11.5 KiB
Average record size in memory33.4 B

Variable types

Numeric1
Text2
DateTime1

Dataset

Description경기도 김포시 안전상비의약품 판매업체 현황 정보에 대한 데이터로 판매점포명, 소재지주소, 데이터기준일자 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15034900/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (0.3%) duplicate rowsDuplicates
순번 has 5 (1.4%) missing valuesMissing
판매점포명 has 5 (1.4%) missing valuesMissing
소재지(도로명) has 5 (1.4%) missing valuesMissing
데이터기준일자 has 5 (1.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 02:12:16.517524
Analysis finished2023-12-12 02:12:17.316952
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

MISSING 

Distinct348
Distinct (%)100.0%
Missing5
Missing (%)1.4%
Infinite0
Infinite (%)0.0%
Mean174.5
Minimum1
Maximum348
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T11:12:17.418041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile18.35
Q187.75
median174.5
Q3261.25
95-th percentile330.65
Maximum348
Range347
Interquartile range (IQR)173.5

Descriptive statistics

Standard deviation100.60318
Coefficient of variation (CV)0.57652253
Kurtosis-1.2
Mean174.5
Median Absolute Deviation (MAD)87
Skewness0
Sum60726
Variance10121
MonotonicityStrictly increasing
2023-12-12T11:12:17.892985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
231 1
 
0.3%
239 1
 
0.3%
238 1
 
0.3%
237 1
 
0.3%
236 1
 
0.3%
235 1
 
0.3%
234 1
 
0.3%
233 1
 
0.3%
232 1
 
0.3%
230 1
 
0.3%
Other values (338) 338
95.8%
(Missing) 5
 
1.4%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
348 1
0.3%
347 1
0.3%
346 1
0.3%
345 1
0.3%
344 1
0.3%
343 1
0.3%
342 1
0.3%
341 1
0.3%
340 1
0.3%
339 1
0.3%

판매점포명
Text

MISSING 

Distinct348
Distinct (%)100.0%
Missing5
Missing (%)1.4%
Memory size2.9 KiB
2023-12-12T11:12:18.222198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length10.741379
Min length5

Characters and Unicode

Total characters3738
Distinct characters233
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique348 ?
Unique (%)100.0%

Sample

1st row(주)코리아세븐 김포쌍용점
2nd row씨유(CU) 김포학운빌리지점
3rd row지에스25 다원김포자이점
4th rowGS25 풍무예지점
5th row씨유 사우풍년점
ValueCountFrequency (%)
씨유 71
 
12.5%
지에스25 44
 
7.7%
세븐일레븐 39
 
6.8%
지에스(gs)25 15
 
2.6%
주)코리아세븐 12
 
2.1%
씨유(cu 7
 
1.2%
gs25 6
 
1.1%
미니스톱 4
 
0.7%
25 3
 
0.5%
cu 3
 
0.5%
Other values (348) 366
64.2%
2023-12-12T11:12:18.686178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
331
 
8.9%
224
 
6.0%
189
 
5.1%
180
 
4.8%
2 135
 
3.6%
134
 
3.6%
128
 
3.4%
5 124
 
3.3%
118
 
3.2%
115
 
3.1%
Other values (223) 2060
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2924
78.2%
Decimal Number 273
 
7.3%
Space Separator 224
 
6.0%
Uppercase Letter 199
 
5.3%
Close Punctuation 57
 
1.5%
Open Punctuation 57
 
1.5%
Lowercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
331
 
11.3%
189
 
6.5%
180
 
6.2%
134
 
4.6%
128
 
4.4%
118
 
4.0%
115
 
3.9%
91
 
3.1%
82
 
2.8%
81
 
2.8%
Other values (203) 1475
50.4%
Uppercase Letter
ValueCountFrequency (%)
S 70
35.2%
G 69
34.7%
C 29
14.6%
U 25
 
12.6%
K 2
 
1.0%
R 2
 
1.0%
L 1
 
0.5%
H 1
 
0.5%
Decimal Number
ValueCountFrequency (%)
2 135
49.5%
5 124
45.4%
4 6
 
2.2%
1 5
 
1.8%
3 2
 
0.7%
6 1
 
0.4%
Lowercase Letter
ValueCountFrequency (%)
e 2
50.0%
u 1
25.0%
c 1
25.0%
Space Separator
ValueCountFrequency (%)
224
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2924
78.2%
Common 611
 
16.3%
Latin 203
 
5.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
331
 
11.3%
189
 
6.5%
180
 
6.2%
134
 
4.6%
128
 
4.4%
118
 
4.0%
115
 
3.9%
91
 
3.1%
82
 
2.8%
81
 
2.8%
Other values (203) 1475
50.4%
Latin
ValueCountFrequency (%)
S 70
34.5%
G 69
34.0%
C 29
14.3%
U 25
 
12.3%
K 2
 
1.0%
R 2
 
1.0%
e 2
 
1.0%
L 1
 
0.5%
H 1
 
0.5%
u 1
 
0.5%
Common
ValueCountFrequency (%)
224
36.7%
2 135
22.1%
5 124
20.3%
) 57
 
9.3%
( 57
 
9.3%
4 6
 
1.0%
1 5
 
0.8%
3 2
 
0.3%
6 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2924
78.2%
ASCII 814
 
21.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
331
 
11.3%
189
 
6.5%
180
 
6.2%
134
 
4.6%
128
 
4.4%
118
 
4.0%
115
 
3.9%
91
 
3.1%
82
 
2.8%
81
 
2.8%
Other values (203) 1475
50.4%
ASCII
ValueCountFrequency (%)
224
27.5%
2 135
16.6%
5 124
15.2%
S 70
 
8.6%
G 69
 
8.5%
) 57
 
7.0%
( 57
 
7.0%
C 29
 
3.6%
U 25
 
3.1%
4 6
 
0.7%
Other values (10) 18
 
2.2%

소재지(도로명)
Text

MISSING 

Distinct348
Distinct (%)100.0%
Missing5
Missing (%)1.4%
Memory size2.9 KiB
2023-12-12T11:12:19.029381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length50.5
Mean length35.034483
Min length17

Characters and Unicode

Total characters12192
Distinct characters249
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique348 ?
Unique (%)100.0%

Sample

1st row경기도 김포시 중봉로58번길 60, 쌍용아파트 1층 102호,103호 (감정동)
2nd row경기도 김포시 양촌읍 삼도로174번길 18-4, 101호,102호
3rd row경기도 김포시 걸포2로 60, 상가동 103호,104호 (걸포동, 한강메트로자이3단지)
4th row경기도 김포시 풍무로146번길 52-18 (풍무동)
5th row경기도 김포시 풍년로 28, 1층 101호 (사우동)
ValueCountFrequency (%)
경기도 348
 
14.6%
김포시 348
 
14.6%
1층 80
 
3.4%
구래동 48
 
2.0%
장기동 40
 
1.7%
양촌읍 38
 
1.6%
상가동 35
 
1.5%
풍무동 34
 
1.4%
통진읍 29
 
1.2%
고촌읍 27
 
1.1%
Other values (691) 1349
56.8%
2023-12-12T11:12:19.518861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2028
 
16.6%
1 817
 
6.7%
526
 
4.3%
504
 
4.1%
, 430
 
3.5%
397
 
3.3%
385
 
3.2%
378
 
3.1%
356
 
2.9%
0 356
 
2.9%
Other values (239) 6015
49.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6638
54.4%
Decimal Number 2507
 
20.6%
Space Separator 2028
 
16.6%
Other Punctuation 430
 
3.5%
Close Punctuation 240
 
2.0%
Open Punctuation 240
 
2.0%
Dash Punctuation 68
 
0.6%
Uppercase Letter 25
 
0.2%
Math Symbol 13
 
0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
526
 
7.9%
504
 
7.6%
397
 
6.0%
385
 
5.8%
378
 
5.7%
356
 
5.4%
354
 
5.3%
313
 
4.7%
211
 
3.2%
179
 
2.7%
Other values (215) 3035
45.7%
Decimal Number
ValueCountFrequency (%)
1 817
32.6%
0 356
14.2%
2 284
 
11.3%
3 232
 
9.3%
4 160
 
6.4%
8 160
 
6.4%
5 139
 
5.5%
6 130
 
5.2%
7 129
 
5.1%
9 100
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
B 13
52.0%
A 4
 
16.0%
S 3
 
12.0%
D 2
 
8.0%
R 2
 
8.0%
M 1
 
4.0%
Space Separator
ValueCountFrequency (%)
2028
100.0%
Other Punctuation
ValueCountFrequency (%)
, 430
100.0%
Close Punctuation
ValueCountFrequency (%)
) 240
100.0%
Open Punctuation
ValueCountFrequency (%)
( 240
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%
Math Symbol
ValueCountFrequency (%)
~ 13
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6638
54.4%
Common 5526
45.3%
Latin 28
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
526
 
7.9%
504
 
7.6%
397
 
6.0%
385
 
5.8%
378
 
5.7%
356
 
5.4%
354
 
5.3%
313
 
4.7%
211
 
3.2%
179
 
2.7%
Other values (215) 3035
45.7%
Common
ValueCountFrequency (%)
2028
36.7%
1 817
14.8%
, 430
 
7.8%
0 356
 
6.4%
2 284
 
5.1%
) 240
 
4.3%
( 240
 
4.3%
3 232
 
4.2%
4 160
 
2.9%
8 160
 
2.9%
Other values (6) 579
 
10.5%
Latin
ValueCountFrequency (%)
B 13
46.4%
A 4
 
14.3%
S 3
 
10.7%
D 2
 
7.1%
R 2
 
7.1%
e 2
 
7.1%
1
 
3.6%
M 1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6638
54.4%
ASCII 5553
45.5%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2028
36.5%
1 817
14.7%
, 430
 
7.7%
0 356
 
6.4%
2 284
 
5.1%
) 240
 
4.3%
( 240
 
4.3%
3 232
 
4.2%
4 160
 
2.9%
8 160
 
2.9%
Other values (13) 606
 
10.9%
Hangul
ValueCountFrequency (%)
526
 
7.9%
504
 
7.6%
397
 
6.0%
385
 
5.8%
378
 
5.7%
356
 
5.4%
354
 
5.3%
313
 
4.7%
211
 
3.2%
179
 
2.7%
Other values (215) 3035
45.7%
Number Forms
ValueCountFrequency (%)
1
100.0%

데이터기준일자
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)0.3%
Missing5
Missing (%)1.4%
Memory size2.9 KiB
Minimum2023-06-15 00:00:00
Maximum2023-06-15 00:00:00
2023-12-12T11:12:19.659057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:12:19.790816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T11:12:16.846950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T11:12:17.019664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:12:17.127130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T11:12:17.239861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번판매점포명소재지(도로명)데이터기준일자
01(주)코리아세븐 김포쌍용점경기도 김포시 중봉로58번길 60, 쌍용아파트 1층 102호,103호 (감정동)2023-06-15
12씨유(CU) 김포학운빌리지점경기도 김포시 양촌읍 삼도로174번길 18-4, 101호,102호2023-06-15
23지에스25 다원김포자이점경기도 김포시 걸포2로 60, 상가동 103호,104호 (걸포동, 한강메트로자이3단지)2023-06-15
34GS25 풍무예지점경기도 김포시 풍무로146번길 52-18 (풍무동)2023-06-15
45씨유 사우풍년점경기도 김포시 풍년로 28, 1층 101호 (사우동)2023-06-15
56씨유 학운대성점경기도 김포시 양촌읍 황금1로80번길 39, 대성디자인포장센타 105,106호2023-06-15
67지에스(GS)25 한강라베니체경기도 김포시 김포한강4로 8, 2층 219호 (장기동)2023-06-15
78지에스(GS)25 풍무신안점경기도 김포시 풍무로 111, 신안아파트 상가동 1층 101호 (풍무동)2023-06-15
89지에스25 김포우리병원점경기도 김포시 감암로 1, 임상의학연구소 일부 1층 (걸포동)2023-06-15
910지에스(GS)25 월곶타운경기도 김포시 월곶면 군하로 2562023-06-15
순번판매점포명소재지(도로명)데이터기준일자
343344미니스톱 김포애기봉점경기도 김포시 월곶면 애기봉로 4722023-06-15
344345세븐일레븐 김포양곡점경기도 김포시 양촌읍 양곡3로1번길 722023-06-15
345346GS25김포귀전경기도 김포시 통진읍 월하로 5212023-06-15
346347세븐일레븐 김포월곶점경기도 김포시 월곶면 김포대학로 42023-06-15
347348씨유양곡신협점경기도 김포시 양촌읍 양곡1로 332023-06-15
348<NA><NA><NA><NA>
349<NA><NA><NA><NA>
350<NA><NA><NA><NA>
351<NA><NA><NA><NA>
352<NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

순번판매점포명소재지(도로명)데이터기준일자# duplicates
0<NA><NA><NA><NA>5