Overview

Dataset statistics

Number of variables5
Number of observations26
Missing cells14
Missing cells (%)10.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory47.1 B

Variable types

Numeric2
Text3

Dataset

Description인천광역시 계양구 관내 여행업 현황에 대한 데이터로, 연번, 업체명, 소재지(도로명 주소), 전화번호, 우편번호 등을 제공합니다.
Author인천광역시 계양구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15038928&srcSe=7661IVAWM27C61E190

Alerts

전화번호 has 14 (53.8%) missing valuesMissing
연번 has unique valuesUnique
업체명 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2024-01-28 12:22:41.869344
Analysis finished2024-01-28 12:22:42.469209
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.5
Minimum1
Maximum26
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2024-01-28T21:22:42.526330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.25
Q17.25
median13.5
Q319.75
95-th percentile24.75
Maximum26
Range25
Interquartile range (IQR)12.5

Descriptive statistics

Standard deviation7.6485293
Coefficient of variation (CV)0.56655772
Kurtosis-1.2
Mean13.5
Median Absolute Deviation (MAD)6.5
Skewness0
Sum351
Variance58.5
MonotonicityStrictly increasing
2024-01-28T21:22:42.621133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
1 1
 
3.8%
15 1
 
3.8%
26 1
 
3.8%
25 1
 
3.8%
24 1
 
3.8%
23 1
 
3.8%
22 1
 
3.8%
21 1
 
3.8%
20 1
 
3.8%
19 1
 
3.8%
Other values (16) 16
61.5%
ValueCountFrequency (%)
1 1
3.8%
2 1
3.8%
3 1
3.8%
4 1
3.8%
5 1
3.8%
6 1
3.8%
7 1
3.8%
8 1
3.8%
9 1
3.8%
10 1
3.8%
ValueCountFrequency (%)
26 1
3.8%
25 1
3.8%
24 1
3.8%
23 1
3.8%
22 1
3.8%
21 1
3.8%
20 1
3.8%
19 1
3.8%
18 1
3.8%
17 1
3.8%

업체명
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-01-28T21:22:42.770335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length7.7692308
Min length3

Characters and Unicode

Total characters202
Distinct characters99
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row(주)화이트투어
2nd row이지관광여행사
3rd row(주) 좋은세상바라기
4th row유아굿
5th row벤츠투어
ValueCountFrequency (%)
주식회사 3
 
8.8%
여행사 2
 
5.9%
주)화이트투어 1
 
2.9%
케이월드투어 1
 
2.9%
새인천 1
 
2.9%
주)소풍관광 1
 
2.9%
스마일시스템 1
 
2.9%
flutter(플루터 1
 
2.9%
tjstory 1
 
2.9%
디에고투어 1
 
2.9%
Other values (21) 21
61.8%
2024-01-28T21:22:43.024854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
6.4%
( 11
 
5.4%
) 11
 
5.4%
8
 
4.0%
7
 
3.5%
7
 
3.5%
7
 
3.5%
6
 
3.0%
5
 
2.5%
5
 
2.5%
Other values (89) 122
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 149
73.8%
Uppercase Letter 18
 
8.9%
Open Punctuation 11
 
5.4%
Close Punctuation 11
 
5.4%
Space Separator 8
 
4.0%
Lowercase Letter 5
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
8.7%
7
 
4.7%
7
 
4.7%
7
 
4.7%
6
 
4.0%
5
 
3.4%
5
 
3.4%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (68) 87
58.4%
Uppercase Letter
ValueCountFrequency (%)
T 4
22.2%
U 2
11.1%
R 2
11.1%
N 1
 
5.6%
A 1
 
5.6%
H 1
 
5.6%
S 1
 
5.6%
Y 1
 
5.6%
O 1
 
5.6%
E 1
 
5.6%
Other values (3) 3
16.7%
Lowercase Letter
ValueCountFrequency (%)
y 1
20.0%
r 1
20.0%
o 1
20.0%
t 1
20.0%
s 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 149
73.8%
Common 30
 
14.9%
Latin 23
 
11.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
8.7%
7
 
4.7%
7
 
4.7%
7
 
4.7%
6
 
4.0%
5
 
3.4%
5
 
3.4%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (68) 87
58.4%
Latin
ValueCountFrequency (%)
T 4
17.4%
U 2
 
8.7%
R 2
 
8.7%
N 1
 
4.3%
A 1
 
4.3%
H 1
 
4.3%
S 1
 
4.3%
Y 1
 
4.3%
O 1
 
4.3%
E 1
 
4.3%
Other values (8) 8
34.8%
Common
ValueCountFrequency (%)
( 11
36.7%
) 11
36.7%
8
26.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 149
73.8%
ASCII 53
 
26.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
13
 
8.7%
7
 
4.7%
7
 
4.7%
7
 
4.7%
6
 
4.0%
5
 
3.4%
5
 
3.4%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (68) 87
58.4%
ASCII
ValueCountFrequency (%)
( 11
20.8%
) 11
20.8%
8
15.1%
T 4
 
7.5%
U 2
 
3.8%
R 2
 
3.8%
N 1
 
1.9%
A 1
 
1.9%
H 1
 
1.9%
S 1
 
1.9%
Other values (11) 11
20.8%

소재지
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-01-28T21:22:43.231573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length38
Mean length34.076923
Min length23

Characters and Unicode

Total characters886
Distinct characters96
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row인천광역시 계양구 계산새로 85, 302호 (용종동, 보람비지니스프라자)
2nd row인천광역시 계양구 계산로112번길 1, 302호 (계산동)
3rd row인천광역시 계양구 경명대로1017번길 25 (계산동)
4th row인천광역시 계양구 계산새로 71, 비동 1122호 (계산동, 하이베라스)
5th row인천광역시 계양구 계양문화로 54, 10층 94호 (계산동, 대산월드프라자)
ValueCountFrequency (%)
인천광역시 26
 
15.1%
계양구 26
 
15.1%
계산동 14
 
8.1%
작전동 6
 
3.5%
계산새로 3
 
1.7%
302호 3
 
1.7%
계양대로 3
 
1.7%
계양문화로 3
 
1.7%
하이베라스 2
 
1.2%
3층 2
 
1.2%
Other values (74) 84
48.8%
2024-01-28T21:22:43.551733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
146
 
16.5%
57
 
6.4%
1 35
 
4.0%
35
 
4.0%
30
 
3.4%
28
 
3.2%
) 27
 
3.0%
27
 
3.0%
( 27
 
3.0%
26
 
2.9%
Other values (86) 448
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 516
58.2%
Space Separator 146
 
16.5%
Decimal Number 143
 
16.1%
Close Punctuation 27
 
3.0%
Open Punctuation 27
 
3.0%
Other Punctuation 25
 
2.8%
Dash Punctuation 1
 
0.1%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
11.0%
35
 
6.8%
30
 
5.8%
28
 
5.4%
27
 
5.2%
26
 
5.0%
26
 
5.0%
26
 
5.0%
26
 
5.0%
26
 
5.0%
Other values (70) 209
40.5%
Decimal Number
ValueCountFrequency (%)
1 35
24.5%
2 21
14.7%
0 19
13.3%
4 15
10.5%
5 14
 
9.8%
3 11
 
7.7%
7 9
 
6.3%
8 9
 
6.3%
6 7
 
4.9%
9 3
 
2.1%
Space Separator
ValueCountFrequency (%)
146
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Other Punctuation
ValueCountFrequency (%)
, 25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 516
58.2%
Common 369
41.6%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
11.0%
35
 
6.8%
30
 
5.8%
28
 
5.4%
27
 
5.2%
26
 
5.0%
26
 
5.0%
26
 
5.0%
26
 
5.0%
26
 
5.0%
Other values (70) 209
40.5%
Common
ValueCountFrequency (%)
146
39.6%
1 35
 
9.5%
) 27
 
7.3%
( 27
 
7.3%
, 25
 
6.8%
2 21
 
5.7%
0 19
 
5.1%
4 15
 
4.1%
5 14
 
3.8%
3 11
 
3.0%
Other values (5) 29
 
7.9%
Latin
ValueCountFrequency (%)
C 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 516
58.2%
ASCII 370
41.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
146
39.5%
1 35
 
9.5%
) 27
 
7.3%
( 27
 
7.3%
, 25
 
6.8%
2 21
 
5.7%
0 19
 
5.1%
4 15
 
4.1%
5 14
 
3.8%
3 11
 
3.0%
Other values (6) 30
 
8.1%
Hangul
ValueCountFrequency (%)
57
 
11.0%
35
 
6.8%
30
 
5.8%
28
 
5.4%
27
 
5.2%
26
 
5.0%
26
 
5.0%
26
 
5.0%
26
 
5.0%
26
 
5.0%
Other values (70) 209
40.5%

전화번호
Text

MISSING 

Distinct11
Distinct (%)91.7%
Missing14
Missing (%)53.8%
Memory size340.0 B
2024-01-28T21:22:43.690522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.083333
Min length12

Characters and Unicode

Total characters145
Distinct characters10
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)83.3%

Sample

1st row032-555-2229
2nd row032-555-2223
3rd row032-545-5550
4th row032-555-5511
5th row032-555-5511
ValueCountFrequency (%)
032-555-5511 2
16.7%
032-555-2229 1
8.3%
032-555-2223 1
8.3%
032-545-5550 1
8.3%
032-555-7007 1
8.3%
032-544-5900 1
8.3%
032-541-5959 1
8.3%
032-553-7288 1
8.3%
032-551-7520 1
8.3%
070-4848-1000 1
8.3%
2024-01-28T21:22:43.925555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 35
24.1%
- 24
16.6%
0 22
15.2%
2 20
13.8%
3 13
 
9.0%
1 8
 
5.5%
7 8
 
5.5%
4 6
 
4.1%
8 5
 
3.4%
9 4
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 121
83.4%
Dash Punctuation 24
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 35
28.9%
0 22
18.2%
2 20
16.5%
3 13
 
10.7%
1 8
 
6.6%
7 8
 
6.6%
4 6
 
5.0%
8 5
 
4.1%
9 4
 
3.3%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 145
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 35
24.1%
- 24
16.6%
0 22
15.2%
2 20
13.8%
3 13
 
9.0%
1 8
 
5.5%
7 8
 
5.5%
4 6
 
4.1%
8 5
 
3.4%
9 4
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 145
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 35
24.1%
- 24
16.6%
0 22
15.2%
2 20
13.8%
3 13
 
9.0%
1 8
 
5.5%
7 8
 
5.5%
4 6
 
4.1%
8 5
 
3.4%
9 4
 
2.8%

우편번호
Real number (ℝ)

Distinct16
Distinct (%)61.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21068.423
Minimum21006
Maximum21122
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2024-01-28T21:22:44.025035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum21006
5-th percentile21032.5
Q121049.25
median21067
Q321086.75
95-th percentile21111.75
Maximum21122
Range116
Interquartile range (IQR)37.5

Descriptive statistics

Standard deviation27.812476
Coefficient of variation (CV)0.0013201024
Kurtosis-0.089836965
Mean21068.423
Median Absolute Deviation (MAD)19
Skewness0.072737295
Sum547779
Variance773.53385
MonotonicityNot monotonic
2024-01-28T21:22:44.110329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
21069 4
15.4%
21048 3
11.5%
21065 2
 
7.7%
21111 2
 
7.7%
21091 2
 
7.7%
21060 2
 
7.7%
21037 2
 
7.7%
21087 1
 
3.8%
21086 1
 
3.8%
21053 1
 
3.8%
Other values (6) 6
23.1%
ValueCountFrequency (%)
21006 1
 
3.8%
21031 1
 
3.8%
21037 2
7.7%
21048 3
11.5%
21053 1
 
3.8%
21054 1
 
3.8%
21060 2
7.7%
21065 2
7.7%
21069 4
15.4%
21080 1
 
3.8%
ValueCountFrequency (%)
21122 1
 
3.8%
21112 1
 
3.8%
21111 2
7.7%
21091 2
7.7%
21087 1
 
3.8%
21086 1
 
3.8%
21080 1
 
3.8%
21069 4
15.4%
21065 2
7.7%
21060 2
7.7%

Interactions

2024-01-28T21:22:42.230536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:22:42.085945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:22:42.289444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:22:42.151185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T21:22:44.170829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명소재지전화번호우편번호
연번1.0001.0001.0000.8340.523
업체명1.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.000
전화번호0.8341.0001.0001.0000.911
우편번호0.5231.0001.0000.9111.000
2024-01-28T21:22:44.241221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번우편번호
연번1.0000.004
우편번호0.0041.000

Missing values

2024-01-28T21:22:42.367293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T21:22:42.433228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명소재지전화번호우편번호
01(주)화이트투어인천광역시 계양구 계산새로 85, 302호 (용종동, 보람비지니스프라자)032-555-222921065
12이지관광여행사인천광역시 계양구 계산로112번길 1, 302호 (계산동)032-555-222321086
23(주) 좋은세상바라기인천광역시 계양구 경명대로1017번길 25 (계산동)032-545-555021037
34유아굿인천광역시 계양구 계산새로 71, 비동 1122호 (계산동, 하이베라스)<NA>21060
45벤츠투어인천광역시 계양구 계양문화로 54, 10층 94호 (계산동, 대산월드프라자)032-555-551121069
56(주)청우관광여행사인천광역시 계양구 경명대로 1016, 302호 (계산동)032-555-551121048
67아주관광닷컴인천광역시 계양구 주부토로 467, 406호 (작전동)<NA>21087
78(주)참존여행인천광역시 계양구 계양대로 185 (계산동)032-555-700721048
89행복드림인천광역시 계양구 계양대로 27 (작전동)032-544-590021111
910(주)현대항공인천광역시 계양구 계양대로 82 (작전동)<NA>21091
연번업체명소재지전화번호우편번호
1617동성전자 주식회사인천광역시 계양구 계산천서로 34 (계산동)032-551-752021054
1718샤니투어(SHANY TOUR)인천광역시 계양구 황어로134번길 8, 102호 (장기동)<NA>21006
1819(주)투어랜드인천광역시 계양구 경명대로1029번길 5-4, 202호 (계산동)070-4848-100021037
1920디에고투어인천광역시 계양구 경명대로 1143, 3층 (임학동)<NA>21031
2021TJstory인천광역시 계양구 계양문화로 54, 대산월드프라자 10층 123호 (계산동)<NA>21069
2122FLUTTER(플루터)인천광역시 계양구 계산새로87번길 15, 강남오피스텔 714호 (용종동)<NA>21065
2223스마일시스템 주식회사인천광역시 계양구 아나지로 195, 4층 (효성동)<NA>21112
2324(주)소풍관광인천광역시 계양구 계양대로205번길 17, 광백오피스텔 602호 (계산동)032-715-772821048
2425새인천 여행사인천광역시 계양구 새벌로 112, 현대프라자 410호 (효성동)<NA>21111
2526주식회사 미르국제인천광역시 계양구 계산새로 71, 하이베라스 C동 510호 (계산동)<NA>21060