Overview

Dataset statistics

Number of variables7
Number of observations110
Missing cells5
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.3 KiB
Average record size in memory58.2 B

Variable types

Numeric1
Categorical3
Text3

Dataset

Description인천광역시 남동구 자원봉사센터 할인가맹점에 대한 데이터로 연번, 업종, 상호, 주소, 전화번호, 할인율, 데이터기준일을 제공합니다.
URLhttps://www.data.go.kr/data/15087743/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
업종 is highly overall correlated with 할인율High correlation
할인율 is highly overall correlated with 업종High correlation
전화번호 has 5 (4.5%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:14:11.940434
Analysis finished2023-12-12 17:14:12.624454
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct110
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean55.5
Minimum1
Maximum110
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-13T02:14:12.714572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.45
Q128.25
median55.5
Q382.75
95-th percentile104.55
Maximum110
Range109
Interquartile range (IQR)54.5

Descriptive statistics

Standard deviation31.898276
Coefficient of variation (CV)0.57474371
Kurtosis-1.2
Mean55.5
Median Absolute Deviation (MAD)27.5
Skewness0
Sum6105
Variance1017.5
MonotonicityStrictly increasing
2023-12-13T02:14:12.849541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
71 1
 
0.9%
82 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
75 1
 
0.9%
Other values (100) 100
90.9%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
110 1
0.9%
109 1
0.9%
108 1
0.9%
107 1
0.9%
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%
101 1
0.9%

업종
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)22.7%
Missing0
Missing (%)0.0%
Memory size1012.0 B
이미용
30 
식당/베이커리
23 
세탁소
자동차 정비
생활/인테리어
Other values (20)
39 

Length

Max length8
Median length7
Mean length4.2454545
Min length2

Unique

Unique4 ?
Unique (%)3.6%

Sample

1st row병원
2nd row병원
3rd row병원
4th row병원
5th row병원

Common Values

ValueCountFrequency (%)
이미용 30
27.3%
식당/베이커리 23
20.9%
세탁소 6
 
5.5%
자동차 정비 6
 
5.5%
생활/인테리어 6
 
5.5%
병원 5
 
4.5%
건자재 2
 
1.8%
보청기 2
 
1.8%
안경 2
 
1.8%
화원 2
 
1.8%
Other values (15) 26
23.6%

Length

2023-12-13T02:14:12.972146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
이미용 30
25.6%
식당/베이커리 23
19.7%
세탁소 6
 
5.1%
자동차 6
 
5.1%
정비 6
 
5.1%
생활/인테리어 6
 
5.1%
병원 5
 
4.3%
잡화 2
 
1.7%
서비스업 2
 
1.7%
기타/소독 2
 
1.7%
Other values (17) 29
24.8%

상호
Text

Distinct92
Distinct (%)83.6%
Missing0
Missing (%)0.0%
Memory size1012.0 B
2023-12-13T02:14:13.220824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length7.4272727
Min length3

Characters and Unicode

Total characters817
Distinct characters247
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)67.3%

Sample

1st row푸른세상안과
2nd row구월백세플란트치과의원
3rd row예온치과
4th row서울바른척도병원
5th row이젠성형외과
ValueCountFrequency (%)
오토오아시스 4
 
2.8%
명품노래연습장 2
 
1.4%
솜틀이불집 2
 
1.4%
에칭스토리 2
 
1.4%
논현점 2
 
1.4%
파리바게뜨 2
 
1.4%
관인)제일요리학원 2
 
1.4%
간석장례식장 2
 
1.4%
간석장례문화원(구 2
 
1.4%
한국자세교정수련원 2
 
1.4%
Other values (104) 122
84.7%
2023-12-13T02:14:13.625595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
4.2%
32
 
3.9%
21
 
2.6%
18
 
2.2%
17
 
2.1%
16
 
2.0%
16
 
2.0%
14
 
1.7%
14
 
1.7%
12
 
1.5%
Other values (237) 623
76.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 738
90.3%
Space Separator 34
 
4.2%
Lowercase Letter 12
 
1.5%
Close Punctuation 10
 
1.2%
Open Punctuation 10
 
1.2%
Uppercase Letter 10
 
1.2%
Other Punctuation 2
 
0.2%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
4.3%
21
 
2.8%
18
 
2.4%
17
 
2.3%
16
 
2.2%
16
 
2.2%
14
 
1.9%
14
 
1.9%
12
 
1.6%
12
 
1.6%
Other values (223) 566
76.7%
Lowercase Letter
ValueCountFrequency (%)
o 4
33.3%
s 2
16.7%
r 2
16.7%
t 2
16.7%
m 2
16.7%
Uppercase Letter
ValueCountFrequency (%)
M 3
30.0%
S 3
30.0%
T 2
20.0%
K 2
20.0%
Space Separator
ValueCountFrequency (%)
34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 739
90.5%
Common 56
 
6.9%
Latin 22
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
4.3%
21
 
2.8%
18
 
2.4%
17
 
2.3%
16
 
2.2%
16
 
2.2%
14
 
1.9%
14
 
1.9%
12
 
1.6%
12
 
1.6%
Other values (224) 567
76.7%
Latin
ValueCountFrequency (%)
o 4
18.2%
M 3
13.6%
S 3
13.6%
T 2
9.1%
K 2
9.1%
s 2
9.1%
r 2
9.1%
t 2
9.1%
m 2
9.1%
Common
ValueCountFrequency (%)
34
60.7%
) 10
 
17.9%
( 10
 
17.9%
. 2
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 738
90.3%
ASCII 78
 
9.5%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
34
43.6%
) 10
 
12.8%
( 10
 
12.8%
o 4
 
5.1%
M 3
 
3.8%
S 3
 
3.8%
T 2
 
2.6%
K 2
 
2.6%
s 2
 
2.6%
r 2
 
2.6%
Other values (3) 6
 
7.7%
Hangul
ValueCountFrequency (%)
32
 
4.3%
21
 
2.8%
18
 
2.4%
17
 
2.3%
16
 
2.2%
16
 
2.2%
14
 
1.9%
14
 
1.9%
12
 
1.6%
12
 
1.6%
Other values (223) 566
76.7%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct93
Distinct (%)84.5%
Missing0
Missing (%)0.0%
Memory size1012.0 B
2023-12-13T02:14:13.940133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length34
Mean length22.327273
Min length14

Characters and Unicode

Total characters2456
Distinct characters109
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)69.1%

Sample

1st row인천광역시 남동구 인하로 497-5
2nd row인천광역시 남동구 인하로 507번길 74
3rd row인천광역시 남동구 구월로 274 3층
4th row인천광역시 남동구 구월로233 (구월동SJ세종프라자)
5th row인천광역시 남동구 남동대로 899 4~5층
ValueCountFrequency (%)
남동구 109
21.8%
인천광역시 107
21.4%
구월2동 10
 
2.0%
1층 7
 
1.4%
백범로 6
 
1.2%
남동대로 6
 
1.2%
만수동 5
 
1.0%
만수로 5
 
1.0%
논현동 4
 
0.8%
구월4동 4
 
0.8%
Other values (169) 236
47.3%
2023-12-13T02:14:14.363363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
389
15.8%
183
 
7.5%
143
 
5.8%
125
 
5.1%
115
 
4.7%
112
 
4.6%
110
 
4.5%
109
 
4.4%
108
 
4.4%
1 106
 
4.3%
Other values (99) 956
38.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1458
59.4%
Decimal Number 529
 
21.5%
Space Separator 389
 
15.8%
Dash Punctuation 54
 
2.2%
Other Punctuation 9
 
0.4%
Open Punctuation 7
 
0.3%
Close Punctuation 7
 
0.3%
Uppercase Letter 2
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
183
12.6%
143
 
9.8%
125
 
8.6%
115
 
7.9%
112
 
7.7%
110
 
7.5%
109
 
7.5%
108
 
7.4%
53
 
3.6%
30
 
2.1%
Other values (81) 370
25.4%
Decimal Number
ValueCountFrequency (%)
1 106
20.0%
2 88
16.6%
4 65
12.3%
3 53
10.0%
5 44
8.3%
7 43
8.1%
0 37
 
7.0%
9 35
 
6.6%
8 33
 
6.2%
6 25
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
J 1
50.0%
Space Separator
ValueCountFrequency (%)
389
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 54
100.0%
Other Punctuation
ValueCountFrequency (%)
, 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1458
59.4%
Common 996
40.6%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
183
12.6%
143
 
9.8%
125
 
8.6%
115
 
7.9%
112
 
7.7%
110
 
7.5%
109
 
7.5%
108
 
7.4%
53
 
3.6%
30
 
2.1%
Other values (81) 370
25.4%
Common
ValueCountFrequency (%)
389
39.1%
1 106
 
10.6%
2 88
 
8.8%
4 65
 
6.5%
- 54
 
5.4%
3 53
 
5.3%
5 44
 
4.4%
7 43
 
4.3%
0 37
 
3.7%
9 35
 
3.5%
Other values (6) 82
 
8.2%
Latin
ValueCountFrequency (%)
S 1
50.0%
J 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1458
59.4%
ASCII 998
40.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
389
39.0%
1 106
 
10.6%
2 88
 
8.8%
4 65
 
6.5%
- 54
 
5.4%
3 53
 
5.3%
5 44
 
4.4%
7 43
 
4.3%
0 37
 
3.7%
9 35
 
3.5%
Other values (8) 84
 
8.4%
Hangul
ValueCountFrequency (%)
183
12.6%
143
 
9.8%
125
 
8.6%
115
 
7.9%
112
 
7.7%
110
 
7.5%
109
 
7.5%
108
 
7.4%
53
 
3.6%
30
 
2.1%
Other values (81) 370
25.4%

전화번호
Text

MISSING 

Distinct88
Distinct (%)83.8%
Missing5
Missing (%)4.5%
Memory size1012.0 B
2023-12-13T02:14:14.598638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.971429
Min length9

Characters and Unicode

Total characters1257
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)67.6%

Sample

1st row032-431-9999
2nd row032-441-2282
3rd row032-469-2870
4th row1522-9988
5th row032-863-3993
ValueCountFrequency (%)
032-271-2180 2
 
1.9%
032-463-0204 2
 
1.9%
032-468-7141 2
 
1.9%
032-425-8922 2
 
1.9%
032-429-2216 2
 
1.9%
032-468-0950 2
 
1.9%
032-464-1198 2
 
1.9%
032-472-8685 2
 
1.9%
032-465-6902 2
 
1.9%
032-422-3287 2
 
1.9%
Other values (78) 85
81.0%
2023-12-13T02:14:15.002681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 209
16.6%
2 188
15.0%
3 176
14.0%
0 167
13.3%
4 140
11.1%
6 100
8.0%
8 61
 
4.9%
1 59
 
4.7%
5 56
 
4.5%
9 53
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1048
83.4%
Dash Punctuation 209
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 188
17.9%
3 176
16.8%
0 167
15.9%
4 140
13.4%
6 100
9.5%
8 61
 
5.8%
1 59
 
5.6%
5 56
 
5.3%
9 53
 
5.1%
7 48
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 209
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1257
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 209
16.6%
2 188
15.0%
3 176
14.0%
0 167
13.3%
4 140
11.1%
6 100
8.0%
8 61
 
4.9%
1 59
 
4.7%
5 56
 
4.5%
9 53
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1257
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 209
16.6%
2 188
15.0%
3 176
14.0%
0 167
13.3%
4 140
11.1%
6 100
8.0%
8 61
 
4.9%
1 59
 
4.7%
5 56
 
4.5%
9 53
 
4.2%

할인율
Categorical

HIGH CORRELATION 

Distinct40
Distinct (%)36.4%
Missing0
Missing (%)0.0%
Memory size1012.0 B
전품목 20%
15 
전품목 10%
12 
10%
전품목 5%
5%
Other values (35)
59 

Length

Max length32
Median length19
Mean length8.7727273
Min length2

Unique

Unique21 ?
Unique (%)19.1%

Sample

1st row10~20%(비급여 일부품목)
2nd row10%(비급여일부)
3rd row일부품목(임플란트 80만원, 네비게이션 임플란트 90만원)
4th row일부품목 10%(비급여)
5th row일부품목 20%(수술외모든시술5%)

Common Values

ValueCountFrequency (%)
전품목 20% 15
 
13.6%
전품목 10% 12
 
10.9%
10% 9
 
8.2%
전품목 5% 8
 
7.3%
5% 7
 
6.4%
일부품목 10%(제품은 제외) 6
 
5.5%
일부품목30% 6
 
5.5%
일부품목 10%(펌,염색 시술만) 3
 
2.7%
전체품목 10% 3
 
2.7%
전체품목 5% (현금결제시) 2
 
1.8%
Other values (30) 39
35.5%

Length

2023-12-13T02:14:15.515003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전품목 36
17.6%
10 26
12.7%
5 19
 
9.3%
20 18
 
8.8%
일부품목 17
 
8.3%
제외 8
 
3.9%
전체품목 7
 
3.4%
10%(제품은 6
 
2.9%
일부품목30 6
 
2.9%
30 6
 
2.9%
Other values (41) 55
27.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1012.0 B
2023-07-24
110 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-24
2nd row2023-07-24
3rd row2023-07-24
4th row2023-07-24
5th row2023-07-24

Common Values

ValueCountFrequency (%)
2023-07-24 110
100.0%

Length

2023-12-13T02:14:15.635687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:14:15.741531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-24 110
100.0%

Interactions

2023-12-13T02:14:12.360064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:14:15.835896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종상호주소전화번호할인율
연번1.0000.8710.0000.0000.0000.746
업종0.8711.0001.0001.0001.0000.973
상호0.0001.0001.0001.0001.0001.000
주소0.0001.0001.0001.0001.0001.000
전화번호0.0001.0001.0001.0001.0001.000
할인율0.7460.9731.0001.0001.0001.000
2023-12-13T02:14:15.946841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종할인율
업종1.0000.606
할인율0.6061.000
2023-12-13T02:14:16.050757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종할인율
연번1.0000.4940.278
업종0.4941.0000.606
할인율0.2780.6061.000

Missing values

2023-12-13T02:14:12.464259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:14:12.582881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종상호주소전화번호할인율데이터기준일자
01병원푸른세상안과인천광역시 남동구 인하로 497-5032-431-999910~20%(비급여 일부품목)2023-07-24
12병원구월백세플란트치과의원인천광역시 남동구 인하로 507번길 74032-441-228210%(비급여일부)2023-07-24
23병원예온치과인천광역시 남동구 구월로 274 3층032-469-2870일부품목(임플란트 80만원, 네비게이션 임플란트 90만원)2023-07-24
34병원서울바른척도병원인천광역시 남동구 구월로233 (구월동SJ세종프라자)1522-9988일부품목 10%(비급여)2023-07-24
45병원이젠성형외과인천광역시 남동구 남동대로 899 4~5층032-863-3993일부품목 20%(수술외모든시술5%)2023-07-24
56약국한마음약국인천광역시 남동구 만수로 14-70032-461-097210%(일부품목, 처방전 제외)2023-07-24
67보청기금강보청기 인천구월센터인천광역시 남동구 구월로 263 퍼스트하임프라자 2층032-466-000625%(보청기)2023-07-24
78보청기조현난청연구소 벨톤보청기인천광역시 남동구 구월로 223 위너스프라자 304호032-432-111440%2023-07-24
89안경하이눈안경인천광역시 남동구 석정로 507032-425-1001전품목 20%2023-07-24
910안경밝은안경콘택트인천광역시 남동구 백범로 312번길 79032-433-9789전품목 20%2023-07-24
연번업종상호주소전화번호할인율데이터기준일자
100101사진관에칭스토리인천광역시 남동구 구월2동 1248-44032-271-218010%(사진,조각품)2023-07-24
101102자동차 정비오토오아시스 논현신도시점인천광역시 남동구 청릉대로 542032-432-0129일부품목30%2023-07-24
102103자동차 정비오토오아시스오일점인천광역시 남동구 구월동 1264-14032-464-5154일부품목30%2023-07-24
103104자동차 정비오토오아시스 홈플러스구월점인천광역시 남동구 예술로 198032-432-7466일부품목30%2023-07-24
104105부동산주공공인 중개사 사무소인천광역시 남동구 논현동 603-1 휴먼시아 상가 동산마을아파트 103호032-446-96005%2023-07-24
105106이동통신(주)신이텔레콤(KT대리점)인천광역시 남동구 구월2동 1241-16 위너플라자 104, 105호032-439-801030%2023-07-24
106107판촉물평인상사인천광역시 남동구 남동대로 898번길 3032-426-208510%2023-07-24
107108도자기해량도자기인천광역시 남동구 구월2동 힐캐슬프라자 210호032-461-117510%(일부품목)2023-07-24
108109자동차정비M.S motors(모토쓰)남동구 간석4동 237-2032-428-8833전품목 5%2023-07-24
109110기타/소독클린존세븐인천광역시 남동구 장승남로47번길 32-1, 202호(만수동)<NA>일부품목 20%(가정집바퀴벌레소독)2023-07-24