Overview

Dataset statistics

Number of variables5
Number of observations52
Missing cells16
Missing cells (%)6.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory43.5 B

Variable types

Numeric1
Text3
DateTime1

Dataset

Description인천광역시 남동구 전화권유판매업현황에 대한 데이터로 연번, 상호명, 소재지주소, 소재지전화번호, 데이터기준일자 항목을 제공합니다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15067145&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일자 has constant value ""Constant
소재지전화번호 has 16 (30.8%) missing valuesMissing
번호 has unique valuesUnique
상호명 has unique valuesUnique

Reproduction

Analysis started2024-04-17 10:35:47.247069
Analysis finished2024-04-17 10:35:47.673968
Duration0.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.5
Minimum1
Maximum52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2024-04-17T19:35:47.739698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.55
Q113.75
median26.5
Q339.25
95-th percentile49.45
Maximum52
Range51
Interquartile range (IQR)25.5

Descriptive statistics

Standard deviation15.154757
Coefficient of variation (CV)0.57187763
Kurtosis-1.2
Mean26.5
Median Absolute Deviation (MAD)13
Skewness0
Sum1378
Variance229.66667
MonotonicityStrictly increasing
2024-04-17T19:35:47.876289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
28 1
 
1.9%
30 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
Other values (42) 42
80.8%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%
44 1
1.9%
43 1
1.9%

상호명
Text

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2024-04-17T19:35:48.399376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11.5
Mean length7.4615385
Min length2

Characters and Unicode

Total characters388
Distinct characters137
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row이룸컴퍼니
2nd row주식회사 드림솔루텍
3rd row주식회사 리치인입찰
4th row아울렛네트웍스
5th row코드컴퍼니
ValueCountFrequency (%)
주식회사 18
 
24.0%
이룸컴퍼니 1
 
1.3%
인천전기통신 1
 
1.3%
셀리턴홀딩스 1
 
1.3%
에스티지24 1
 
1.3%
고미텔레콤 1
 
1.3%
듀오정보(주 1
 
1.3%
인천 1
 
1.3%
경기지사 1
 
1.3%
애드어네스 1
 
1.3%
Other values (48) 48
64.0%
2024-04-17T19:35:48.720962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
6.2%
23
 
5.9%
21
 
5.4%
19
 
4.9%
19
 
4.9%
14
 
3.6%
12
 
3.1%
10
 
2.6%
10
 
2.6%
10
 
2.6%
Other values (127) 226
58.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 345
88.9%
Space Separator 24
 
6.2%
Close Punctuation 5
 
1.3%
Open Punctuation 5
 
1.3%
Decimal Number 5
 
1.3%
Uppercase Letter 2
 
0.5%
Lowercase Letter 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
6.7%
21
 
6.1%
19
 
5.5%
19
 
5.5%
14
 
4.1%
12
 
3.5%
10
 
2.9%
10
 
2.9%
10
 
2.9%
7
 
2.0%
Other values (115) 200
58.0%
Decimal Number
ValueCountFrequency (%)
4 1
20.0%
2 1
20.0%
9 1
20.0%
6 1
20.0%
3 1
20.0%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
B 1
50.0%
Lowercase Letter
ValueCountFrequency (%)
a 1
50.0%
c 1
50.0%
Space Separator
ValueCountFrequency (%)
24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 345
88.9%
Common 39
 
10.1%
Latin 4
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
6.7%
21
 
6.1%
19
 
5.5%
19
 
5.5%
14
 
4.1%
12
 
3.5%
10
 
2.9%
10
 
2.9%
10
 
2.9%
7
 
2.0%
Other values (115) 200
58.0%
Common
ValueCountFrequency (%)
24
61.5%
) 5
 
12.8%
( 5
 
12.8%
4 1
 
2.6%
2 1
 
2.6%
9 1
 
2.6%
6 1
 
2.6%
3 1
 
2.6%
Latin
ValueCountFrequency (%)
S 1
25.0%
B 1
25.0%
a 1
25.0%
c 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 345
88.9%
ASCII 43
 
11.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24
55.8%
) 5
 
11.6%
( 5
 
11.6%
4 1
 
2.3%
2 1
 
2.3%
S 1
 
2.3%
B 1
 
2.3%
9 1
 
2.3%
6 1
 
2.3%
3 1
 
2.3%
Other values (2) 2
 
4.7%
Hangul
ValueCountFrequency (%)
23
 
6.7%
21
 
6.1%
19
 
5.5%
19
 
5.5%
14
 
4.1%
12
 
3.5%
10
 
2.9%
10
 
2.9%
10
 
2.9%
7
 
2.0%
Other values (115) 200
58.0%
Distinct51
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size548.0 B
2024-04-17T19:35:48.982589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length42
Mean length34
Min length23

Characters and Unicode

Total characters1768
Distinct characters113
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)96.2%

Sample

1st row인천광역시 남동구 인주대로623번길 11 401-1호 (구월동)
2nd row인천광역시 남동구 예술로 230 201호 (구월동)
3rd row인천광역시 남동구 은청로 18 205호 (고잔동)
4th row인천광역시 남동구 선수촌공원로23번길 11 502호 내부 제7호 (구월동)
5th row인천광역시 남동구 미래로 14 603호 (구월동)
ValueCountFrequency (%)
인천광역시 52
 
15.1%
남동구 52
 
15.1%
구월동 29
 
8.4%
간석동 8
 
2.3%
4층 7
 
2.0%
2층 6
 
1.7%
논현동 5
 
1.4%
1층 5
 
1.4%
만수동 4
 
1.2%
서창동 4
 
1.2%
Other values (137) 173
50.1%
2024-04-17T19:35:49.353903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
349
19.7%
116
 
6.6%
89
 
5.0%
67
 
3.8%
66
 
3.7%
1 62
 
3.5%
57
 
3.2%
2 54
 
3.1%
53
 
3.0%
52
 
2.9%
Other values (103) 803
45.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 963
54.5%
Space Separator 349
 
19.7%
Decimal Number 333
 
18.8%
Close Punctuation 52
 
2.9%
Open Punctuation 52
 
2.9%
Dash Punctuation 12
 
0.7%
Uppercase Letter 7
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
116
 
12.0%
89
 
9.2%
67
 
7.0%
66
 
6.9%
57
 
5.9%
53
 
5.5%
52
 
5.4%
52
 
5.4%
52
 
5.4%
37
 
3.8%
Other values (86) 322
33.4%
Decimal Number
ValueCountFrequency (%)
1 62
18.6%
2 54
16.2%
0 47
14.1%
4 40
12.0%
3 34
10.2%
5 26
7.8%
8 21
 
6.3%
7 17
 
5.1%
6 17
 
5.1%
9 15
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
K 3
42.9%
T 3
42.9%
B 1
 
14.3%
Space Separator
ValueCountFrequency (%)
349
100.0%
Close Punctuation
ValueCountFrequency (%)
) 52
100.0%
Open Punctuation
ValueCountFrequency (%)
( 52
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 963
54.5%
Common 798
45.1%
Latin 7
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
116
 
12.0%
89
 
9.2%
67
 
7.0%
66
 
6.9%
57
 
5.9%
53
 
5.5%
52
 
5.4%
52
 
5.4%
52
 
5.4%
37
 
3.8%
Other values (86) 322
33.4%
Common
ValueCountFrequency (%)
349
43.7%
1 62
 
7.8%
2 54
 
6.8%
) 52
 
6.5%
( 52
 
6.5%
0 47
 
5.9%
4 40
 
5.0%
3 34
 
4.3%
5 26
 
3.3%
8 21
 
2.6%
Other values (4) 61
 
7.6%
Latin
ValueCountFrequency (%)
K 3
42.9%
T 3
42.9%
B 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 963
54.5%
ASCII 805
45.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
349
43.4%
1 62
 
7.7%
2 54
 
6.7%
) 52
 
6.5%
( 52
 
6.5%
0 47
 
5.8%
4 40
 
5.0%
3 34
 
4.2%
5 26
 
3.2%
8 21
 
2.6%
Other values (7) 68
 
8.4%
Hangul
ValueCountFrequency (%)
116
 
12.0%
89
 
9.2%
67
 
7.0%
66
 
6.9%
57
 
5.9%
53
 
5.5%
52
 
5.4%
52
 
5.4%
52
 
5.4%
37
 
3.8%
Other values (86) 322
33.4%

소재지전화번호
Text

MISSING 

Distinct35
Distinct (%)97.2%
Missing16
Missing (%)30.8%
Memory size548.0 B
2024-04-17T19:35:49.550175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.694444
Min length9

Characters and Unicode

Total characters421
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)94.4%

Sample

1st row032-422-1065
2nd row032-212-3900
3rd row032-466-4433
4th row032-215-7100
5th row032-238-1700
ValueCountFrequency (%)
032-422-1065 2
 
5.6%
1566-2705 1
 
2.8%
032-505-8702 1
 
2.8%
032-715-8420 1
 
2.8%
032-715-6167 1
 
2.8%
032-715-8413 1
 
2.8%
070-4185-9721 1
 
2.8%
032-422-8300 1
 
2.8%
032-819-9999 1
 
2.8%
032-930-1742 1
 
2.8%
Other values (25) 25
69.4%
2024-04-17T19:35:49.860250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 71
16.9%
- 67
15.9%
2 54
12.8%
3 46
10.9%
1 36
8.6%
6 31
7.4%
4 28
 
6.7%
7 26
 
6.2%
5 23
 
5.5%
9 23
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 354
84.1%
Dash Punctuation 67
 
15.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 71
20.1%
2 54
15.3%
3 46
13.0%
1 36
10.2%
6 31
8.8%
4 28
 
7.9%
7 26
 
7.3%
5 23
 
6.5%
9 23
 
6.5%
8 16
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 67
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 421
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 71
16.9%
- 67
15.9%
2 54
12.8%
3 46
10.9%
1 36
8.6%
6 31
7.4%
4 28
 
6.7%
7 26
 
6.2%
5 23
 
5.5%
9 23
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 421
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 71
16.9%
- 67
15.9%
2 54
12.8%
3 46
10.9%
1 36
8.6%
6 31
7.4%
4 28
 
6.7%
7 26
 
6.2%
5 23
 
5.5%
9 23
 
5.5%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size548.0 B
Minimum2023-05-11 00:00:00
Maximum2023-05-11 00:00:00
2024-04-17T19:35:49.976115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:35:50.069406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-17T19:35:47.467753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T19:35:50.130024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호상호명소재지주소소재지전화번호
번호1.0001.0000.9290.926
상호명1.0001.0001.0001.000
소재지주소0.9291.0001.0001.000
소재지전화번호0.9261.0001.0001.000

Missing values

2024-04-17T19:35:47.559682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T19:35:47.641370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호상호명소재지주소소재지전화번호데이터기준일자
01이룸컴퍼니인천광역시 남동구 인주대로623번길 11 401-1호 (구월동)<NA>2023-05-11
12주식회사 드림솔루텍인천광역시 남동구 예술로 230 201호 (구월동)032-422-10652023-05-11
23주식회사 리치인입찰인천광역시 남동구 은청로 18 205호 (고잔동)<NA>2023-05-11
34아울렛네트웍스인천광역시 남동구 선수촌공원로23번길 11 502호 내부 제7호 (구월동)<NA>2023-05-11
45코드컴퍼니인천광역시 남동구 미래로 14 603호 (구월동)032-212-39002023-05-11
56주식회사 다움인천광역시 남동구 서창남로 51 1층 108호 (서창동)032-466-44332023-05-11
67369솔루션인천광역시 남동구 예술로192번길 31 5층 502호 (구월동)<NA>2023-05-11
78주식회사 더함코리아인천광역시 남동구 용천로 3 410호 (구월동)<NA>2023-05-11
89큐엔컴퍼니인천광역시 남동구 구월남로 148 403호 (구월동)032-215-71002023-05-11
910아울렛컴퍼니인천광역시 남동구 선수촌공원로23번길 11 504호 내부19호 (구월동)032-238-17002023-05-11
번호상호명소재지주소소재지전화번호데이터기준일자
4243주식회사 리드캡인천광역시 남동구 인하로 497-8 1층 (구월동)<NA>2023-05-11
4344(주)파트너스 지원센터인천광역시 남동구 성리로 6 2층 (구월동)032-713-00002023-05-11
4445해움인천광역시 남동구 경인로644번길 98 4층 (간석동)070-8666-65452023-05-11
4546주식회사 드림네트웍스인천광역시 남동구 예술로 230 청진네오스빌 202 203 204호 (구월동)032-422-10652023-05-11
4647주식회사에스엔쏠루션인천광역시 남동구 석산로 112 (간석동)<NA>2023-05-11
4748케이앤티인천광역시 남동구 예술로 206 3층 제비305호 (구월동 구월중앙프라자)1600-85842023-05-11
4849스마트비컴퍼니인천광역시 남동구 인주대로591번길 64 305호 (구월동)070-4617-15502023-05-11
4950(주)리더엘컴퍼니인천광역시 남동구 예술로192번길 31 5층 (구월동 대건빌딩)1644-21732023-05-11
5051(주)제이에이네트웍스인천광역시 남동구 백범로 443-1 4층 (간석동 청한빌딩)032-819-99992023-05-11
5152노무법인 인천삼신인천광역시 남동구 문화로 135 4층 (구월동 삼성빌딩)032-434-54002023-05-11