Overview

Dataset statistics

Number of variables5
Number of observations89
Missing cells32
Missing cells (%)7.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.7 KiB
Average record size in memory42.5 B

Variable types

Numeric1
Text4

Dataset

Description인천광역시 미추홀구의 페인트 판매업소에 대한 데이터입니다. 상호명, 도로명주소, 전화번호 등의 항목을 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15087078/fileData.do

Alerts

도로명주소 has 2 (2.2%) missing valuesMissing
전화번호 has 30 (33.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:56:12.424551
Analysis finished2023-12-12 06:56:13.464719
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct89
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45
Minimum1
Maximum89
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size933.0 B
2023-12-12T15:56:13.545578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.4
Q123
median45
Q367
95-th percentile84.6
Maximum89
Range88
Interquartile range (IQR)44

Descriptive statistics

Standard deviation25.836021
Coefficient of variation (CV)0.57413381
Kurtosis-1.2
Mean45
Median Absolute Deviation (MAD)22
Skewness0
Sum4005
Variance667.5
MonotonicityStrictly increasing
2023-12-12T15:56:13.687161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
68 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
60 1
 
1.1%
59 1
 
1.1%
Other values (79) 79
88.8%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%
81 1
1.1%
80 1
1.1%
Distinct87
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size844.0 B
2023-12-12T15:56:14.171090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length13
Mean length8.7752809
Min length4

Characters and Unicode

Total characters781
Distinct characters167
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)96.6%

Sample

1st rowKCC페인트대인페인트
2nd row노루페인트 천일피앤씨
3rd row삼화페인트 주안상사
4th row노루페인트 근원실업
5th row노루페인트 융신기업
ValueCountFrequency (%)
삼화페인트 15
 
11.3%
노루페인트 11
 
8.3%
제비스코 5
 
3.8%
페인트 3
 
2.3%
노루표페인트 2
 
1.5%
곰팡이방지페인트 1
 
0.8%
현광상사 1
 
0.8%
제아e&c 1
 
0.8%
에이치케이케미칼 1
 
0.8%
줄눈시공욕실줄눈시공페인트시공공사줄눈실리콘시공 1
 
0.8%
Other values (92) 92
69.2%
2023-12-12T15:56:14.484547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
79
 
10.1%
69
 
8.8%
69
 
8.8%
44
 
5.6%
31
 
4.0%
23
 
2.9%
22
 
2.8%
21
 
2.7%
15
 
1.9%
14
 
1.8%
Other values (157) 394
50.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 725
92.8%
Space Separator 44
 
5.6%
Uppercase Letter 11
 
1.4%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
10.9%
69
 
9.5%
69
 
9.5%
31
 
4.3%
23
 
3.2%
22
 
3.0%
21
 
2.9%
15
 
2.1%
14
 
1.9%
14
 
1.9%
Other values (152) 368
50.8%
Uppercase Letter
ValueCountFrequency (%)
C 7
63.6%
K 3
27.3%
E 1
 
9.1%
Space Separator
ValueCountFrequency (%)
44
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 725
92.8%
Common 45
 
5.8%
Latin 11
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
10.9%
69
 
9.5%
69
 
9.5%
31
 
4.3%
23
 
3.2%
22
 
3.0%
21
 
2.9%
15
 
2.1%
14
 
1.9%
14
 
1.9%
Other values (152) 368
50.8%
Latin
ValueCountFrequency (%)
C 7
63.6%
K 3
27.3%
E 1
 
9.1%
Common
ValueCountFrequency (%)
44
97.8%
& 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 725
92.8%
ASCII 56
 
7.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
79
 
10.9%
69
 
9.5%
69
 
9.5%
31
 
4.3%
23
 
3.2%
22
 
3.0%
21
 
2.9%
15
 
2.1%
14
 
1.9%
14
 
1.9%
Other values (152) 368
50.8%
ASCII
ValueCountFrequency (%)
44
78.6%
C 7
 
12.5%
K 3
 
5.4%
& 1
 
1.8%
E 1
 
1.8%
Distinct88
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size844.0 B
2023-12-12T15:56:14.838363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length29
Mean length21.280899
Min length14

Characters and Unicode

Total characters1894
Distinct characters57
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique87 ?
Unique (%)97.8%

Sample

1st row인천광역시 미추홀구 숭의동 170-34
2nd row인천광역시 미추홀구 주안동 761-1 1층 노루페인트 천일피앤씨
3rd row인천광역시 미추홀구 주안동 406-21
4th row인천광역시 미추홀구 용현동 39-25
5th row인천광역시 미추홀구 학익동 587-70
ValueCountFrequency (%)
인천광역시 89
24.0%
미추홀구 89
24.0%
주안동 35
 
9.4%
숭의동 15
 
4.0%
도화동 15
 
4.0%
용현동 11
 
3.0%
학익동 9
 
2.4%
1층 3
 
0.8%
노루페인트 2
 
0.5%
관교동 2
 
0.5%
Other values (99) 101
27.2%
2023-12-12T15:56:15.363159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
286
 
15.1%
93
 
4.9%
91
 
4.8%
90
 
4.8%
89
 
4.7%
89
 
4.7%
89
 
4.7%
89
 
4.7%
89
 
4.7%
89
 
4.7%
Other values (47) 800
42.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1113
58.8%
Decimal Number 412
 
21.8%
Space Separator 286
 
15.1%
Dash Punctuation 83
 
4.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
93
8.4%
91
8.2%
90
8.1%
89
8.0%
89
8.0%
89
8.0%
89
8.0%
89
8.0%
89
8.0%
89
8.0%
Other values (35) 216
19.4%
Decimal Number
ValueCountFrequency (%)
1 83
20.1%
4 53
12.9%
2 52
12.6%
6 43
10.4%
8 34
8.3%
3 33
 
8.0%
7 32
 
7.8%
5 30
 
7.3%
0 29
 
7.0%
9 23
 
5.6%
Space Separator
ValueCountFrequency (%)
286
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 83
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1113
58.8%
Common 781
41.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
93
8.4%
91
8.2%
90
8.1%
89
8.0%
89
8.0%
89
8.0%
89
8.0%
89
8.0%
89
8.0%
89
8.0%
Other values (35) 216
19.4%
Common
ValueCountFrequency (%)
286
36.6%
- 83
 
10.6%
1 83
 
10.6%
4 53
 
6.8%
2 52
 
6.7%
6 43
 
5.5%
8 34
 
4.4%
3 33
 
4.2%
7 32
 
4.1%
5 30
 
3.8%
Other values (2) 52
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1113
58.8%
ASCII 781
41.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
286
36.6%
- 83
 
10.6%
1 83
 
10.6%
4 53
 
6.8%
2 52
 
6.7%
6 43
 
5.5%
8 34
 
4.4%
3 33
 
4.2%
7 32
 
4.1%
5 30
 
3.8%
Other values (2) 52
 
6.7%
Hangul
ValueCountFrequency (%)
93
8.4%
91
8.2%
90
8.1%
89
8.0%
89
8.0%
89
8.0%
89
8.0%
89
8.0%
89
8.0%
89
8.0%
Other values (35) 216
19.4%

도로명주소
Text

MISSING 

Distinct87
Distinct (%)100.0%
Missing2
Missing (%)2.2%
Memory size844.0 B
2023-12-12T15:56:15.737243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length30
Mean length21.850575
Min length17

Characters and Unicode

Total characters1901
Distinct characters113
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique87 ?
Unique (%)100.0%

Sample

1st row인천광역시 미추홀구 독배로 470
2nd row인천광역시 미추홀구 인주대로 302-1 1층 노루페인트 천일피앤씨
3rd row인천광역시 미추홀구 경인로 412
4th row인천광역시 미추홀구 인하로 160
5th row인천광역시 미추홀구 아암대로253번길 37
ValueCountFrequency (%)
인천광역시 87
23.2%
미추홀구 87
23.2%
인주대로 6
 
1.6%
경인로 5
 
1.3%
한나루로 4
 
1.1%
12 3
 
0.8%
연송로 3
 
0.8%
51 3
 
0.8%
석정로 3
 
0.8%
경원대로 3
 
0.8%
Other values (151) 171
45.6%
2023-12-12T15:56:16.242604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
288
 
15.1%
115
 
6.0%
92
 
4.8%
91
 
4.8%
91
 
4.8%
90
 
4.7%
87
 
4.6%
87
 
4.6%
87
 
4.6%
87
 
4.6%
Other values (103) 786
41.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1233
64.9%
Decimal Number 352
 
18.5%
Space Separator 288
 
15.1%
Dash Punctuation 21
 
1.1%
Open Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%
Uppercase Letter 2
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
115
 
9.3%
92
 
7.5%
91
 
7.4%
91
 
7.4%
90
 
7.3%
87
 
7.1%
87
 
7.1%
87
 
7.1%
87
 
7.1%
85
 
6.9%
Other values (86) 321
26.0%
Decimal Number
ValueCountFrequency (%)
1 63
17.9%
3 52
14.8%
2 46
13.1%
5 34
9.7%
4 32
9.1%
6 31
8.8%
7 27
7.7%
0 24
 
6.8%
8 23
 
6.5%
9 20
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
F 1
50.0%
Space Separator
ValueCountFrequency (%)
288
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1233
64.9%
Common 666
35.0%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
115
 
9.3%
92
 
7.5%
91
 
7.4%
91
 
7.4%
90
 
7.3%
87
 
7.1%
87
 
7.1%
87
 
7.1%
87
 
7.1%
85
 
6.9%
Other values (86) 321
26.0%
Common
ValueCountFrequency (%)
288
43.2%
1 63
 
9.5%
3 52
 
7.8%
2 46
 
6.9%
5 34
 
5.1%
4 32
 
4.8%
6 31
 
4.7%
7 27
 
4.1%
0 24
 
3.6%
8 23
 
3.5%
Other values (5) 46
 
6.9%
Latin
ValueCountFrequency (%)
B 1
50.0%
F 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1233
64.9%
ASCII 668
35.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
288
43.1%
1 63
 
9.4%
3 52
 
7.8%
2 46
 
6.9%
5 34
 
5.1%
4 32
 
4.8%
6 31
 
4.6%
7 27
 
4.0%
0 24
 
3.6%
8 23
 
3.4%
Other values (7) 48
 
7.2%
Hangul
ValueCountFrequency (%)
115
 
9.3%
92
 
7.5%
91
 
7.4%
91
 
7.4%
90
 
7.3%
87
 
7.1%
87
 
7.1%
87
 
7.1%
87
 
7.1%
85
 
6.9%
Other values (86) 321
26.0%

전화번호
Text

MISSING 

Distinct59
Distinct (%)100.0%
Missing30
Missing (%)33.7%
Memory size844.0 B
2023-12-12T15:56:16.532884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.542373
Min length12

Characters and Unicode

Total characters740
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)100.0%

Sample

1st row032-882-2077
2nd row032-866-7007
3rd row032-424-2888
4th row032-863-4923
5th row032-833-4000
ValueCountFrequency (%)
032-882-2077 1
 
1.7%
0507-1362-7809 1
 
1.7%
070-4560-4874 1
 
1.7%
032-868-6446 1
 
1.7%
0507-1383-6236 1
 
1.7%
032-873-4642 1
 
1.7%
032-763-8296 1
 
1.7%
032-888-5800 1
 
1.7%
032-833-5169 1
 
1.7%
0507-1420-4688 1
 
1.7%
Other values (49) 49
83.1%
2023-12-12T15:56:16.939958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 118
15.9%
- 118
15.9%
3 92
12.4%
2 89
12.0%
8 68
9.2%
7 66
8.9%
4 56
7.6%
6 39
 
5.3%
5 37
 
5.0%
1 32
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 622
84.1%
Dash Punctuation 118
 
15.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 118
19.0%
3 92
14.8%
2 89
14.3%
8 68
10.9%
7 66
10.6%
4 56
9.0%
6 39
 
6.3%
5 37
 
5.9%
1 32
 
5.1%
9 25
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 118
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 740
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 118
15.9%
- 118
15.9%
3 92
12.4%
2 89
12.0%
8 68
9.2%
7 66
8.9%
4 56
7.6%
6 39
 
5.3%
5 37
 
5.0%
1 32
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 740
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 118
15.9%
- 118
15.9%
3 92
12.4%
2 89
12.0%
8 68
9.2%
7 66
8.9%
4 56
7.6%
6 39
 
5.3%
5 37
 
5.0%
1 32
 
4.3%

Interactions

2023-12-12T15:56:13.150687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:56:17.035152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호명지번주소도로명주소전화번호
연번1.0000.8730.9311.0001.000
상호명0.8731.0000.9961.0001.000
지번주소0.9310.9961.0001.0001.000
도로명주소1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000

Missing values

2023-12-12T15:56:13.256559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:56:13.342570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:56:13.424776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번상호명지번주소도로명주소전화번호
01KCC페인트대인페인트인천광역시 미추홀구 숭의동 170-34인천광역시 미추홀구 독배로 470032-882-2077
12노루페인트 천일피앤씨인천광역시 미추홀구 주안동 761-1 1층 노루페인트 천일피앤씨인천광역시 미추홀구 인주대로 302-1 1층 노루페인트 천일피앤씨032-866-7007
23삼화페인트 주안상사인천광역시 미추홀구 주안동 406-21인천광역시 미추홀구 경인로 412032-424-2888
34노루페인트 근원실업인천광역시 미추홀구 용현동 39-25인천광역시 미추홀구 인하로 160032-863-4923
45노루페인트 융신기업인천광역시 미추홀구 학익동 587-70인천광역시 미추홀구 아암대로253번길 37032-833-4000
56제비스코 우성상사인천광역시 미추홀구 숭의동 431-5인천광역시 미추홀구 석정로 41 (숭의동)032-882-6333
67노루페인트 세계상사인천광역시 미추홀구 관교동 463인천광역시 미추홀구 경원대로716번길 3032-438-1886
78삼화페인트인천광역시 미추홀구 관교동 467-7인천광역시 미추홀구 인하로396번길 27<NA>
89KCC페인트청용상사인천광역시 미추홀구 용현동 628-84인천광역시 미추홀구 낙섬서로 53032-884-7749
910노루페인트 마고도장인천광역시 미추홀구 도화동 747 노루페인트 마고도장인천광역시 미추홀구 송림로 323-1 노루페인트 마고도장032-883-3366
연번상호명지번주소도로명주소전화번호
7980청년시공팀 인천점인천광역시 미추홀구 학익동 665-5인천광역시 미추홀구 매소홀로 486 에덴에셀빌딩 B1F0507-1371-8485
8081로즈인테리어인천광역시 미추홀구 학익동 128-28인천광역시 미추홀구 소성로 2010507-1408-5921
8182풍경인테리어인천광역시 미추홀구 학익동 698-3인천광역시 미추홀구 한나루로357번길 5-32032-204-8204
8283바름시공 인천지사인천광역시 미추홀구 숭의동 8-217인천광역시 미추홀구 수봉로95번길 35<NA>
8384이음티앤씨인천광역시 미추홀구 숭의동 169-50 1104호인천광역시 미추홀구 미추로19번길 21 1104호<NA>
8485반딧불이 인천중구점인천광역시 미추홀구 도화동 108-1인천광역시 미추홀구 장고개로36번길 36<NA>
8586신영산업개발인천광역시 미추홀구 주안동 761-4인천광역시 미추홀구 한나루로502번길 470507-1310-1990
8687태경종합관리시스템인천광역시 미추홀구 숭의동 285-3 2층인천광역시 미추홀구 독정이로 51 2층032-884-9112
8788동서도장건업인천광역시 미추홀구 학익동 261-9인천광역시 미추홀구 소성로135번길 25-160507-1385-4939
8889주안인테리어인천광역시 미추홀구 주안동 11-40인천광역시 미추홀구 석정로351번길 51<NA>