Overview

Dataset statistics

Number of variables5
Number of observations45
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory43.9 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description대구지역 국제회의업 현황정보(업종중분류, 업체명, 소재지 등)일부 업체의 연락처는 개인정보(휴대전화)가 포함되어 제공되지 않음을 양해바랍니다.
Author대구광역시
URLhttps://www.data.go.kr/data/15054189/fileData.do

Alerts

연번 is highly overall correlated with 구군High correlation
구군 is highly overall correlated with 연번High correlation
업종중분류 is highly imbalanced (84.6%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-21 01:20:17.131291
Analysis finished2024-04-21 01:20:19.043676
Duration1.91 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23
Minimum1
Maximum45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2024-04-21T10:20:19.118200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.2
Q112
median23
Q334
95-th percentile42.8
Maximum45
Range44
Interquartile range (IQR)22

Descriptive statistics

Standard deviation13.133926
Coefficient of variation (CV)0.57104024
Kurtosis-1.2
Mean23
Median Absolute Deviation (MAD)11
Skewness0
Sum1035
Variance172.5
MonotonicityStrictly increasing
2024-04-21T10:20:19.247452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
1 1
 
2.2%
35 1
 
2.2%
26 1
 
2.2%
27 1
 
2.2%
28 1
 
2.2%
29 1
 
2.2%
30 1
 
2.2%
31 1
 
2.2%
32 1
 
2.2%
33 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
1 1
2.2%
2 1
2.2%
3 1
2.2%
4 1
2.2%
5 1
2.2%
6 1
2.2%
7 1
2.2%
8 1
2.2%
9 1
2.2%
10 1
2.2%
ValueCountFrequency (%)
45 1
2.2%
44 1
2.2%
43 1
2.2%
42 1
2.2%
41 1
2.2%
40 1
2.2%
39 1
2.2%
38 1
2.2%
37 1
2.2%
36 1
2.2%

구군
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)15.6%
Missing0
Missing (%)0.0%
Memory size492.0 B
북구
14 
수성구
13 
중구
달서구
동구
Other values (2)

Length

Max length3
Median length2
Mean length2.4222222
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
북구 14
31.1%
수성구 13
28.9%
중구 6
13.3%
달서구 4
 
8.9%
동구 3
 
6.7%
남구 3
 
6.7%
달성군 2
 
4.4%

Length

2024-04-21T10:20:19.360307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:20:19.463730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
북구 14
31.1%
수성구 13
28.9%
중구 6
13.3%
달서구 4
 
8.9%
동구 3
 
6.7%
남구 3
 
6.7%
달성군 2
 
4.4%

업종중분류
Categorical

IMBALANCE 

Distinct2
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size492.0 B
국제회의기획업
44 
국제회의시설업
 
1

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique1 ?
Unique (%)2.2%

Sample

1st row국제회의기획업
2nd row국제회의기획업
3rd row국제회의기획업
4th row국제회의기획업
5th row국제회의기획업

Common Values

ValueCountFrequency (%)
국제회의기획업 44
97.8%
국제회의시설업 1
 
2.2%

Length

2024-04-21T10:20:19.568258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:20:19.652070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국제회의기획업 44
97.8%
국제회의시설업 1
 
2.2%
Distinct44
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size492.0 B
2024-04-21T10:20:19.829910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length11
Mean length8.1555556
Min length3

Characters and Unicode

Total characters367
Distinct characters128
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)95.6%

Sample

1st row(주)코리아 커뮤니케이션즈
2nd row주식회사 소스
3rd row㈜삼성여행사
4th row마이스코㈜
5th row여행스케치 주식회사
ValueCountFrequency (%)
주식회사 8
 
14.3%
주)엑스코 2
 
3.6%
㈜레드컴 1
 
1.8%
특수법인 1
 
1.8%
대구광역시관광협회 1
 
1.8%
주)덱스코 1
 
1.8%
더파워 1
 
1.8%
대구엠비씨미디컴(주 1
 
1.8%
주)문화뱅크 1
 
1.8%
더블유관광협동조합 1
 
1.8%
Other values (38) 38
67.9%
2024-04-21T10:20:20.185531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
6.5%
( 17
 
4.6%
) 17
 
4.6%
15
 
4.1%
12
 
3.3%
12
 
3.3%
11
 
3.0%
10
 
2.7%
10
 
2.7%
9
 
2.5%
Other values (118) 230
62.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 299
81.5%
Open Punctuation 17
 
4.6%
Close Punctuation 17
 
4.6%
Other Symbol 12
 
3.3%
Space Separator 11
 
3.0%
Lowercase Letter 8
 
2.2%
Uppercase Letter 3
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
8.0%
15
 
5.0%
12
 
4.0%
10
 
3.3%
10
 
3.3%
9
 
3.0%
8
 
2.7%
8
 
2.7%
6
 
2.0%
5
 
1.7%
Other values (104) 192
64.2%
Lowercase Letter
ValueCountFrequency (%)
o 2
25.0%
n 1
12.5%
i 1
12.5%
t 1
12.5%
u 1
12.5%
l 1
12.5%
s 1
12.5%
Uppercase Letter
ValueCountFrequency (%)
T 1
33.3%
M 1
33.3%
I 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Other Symbol
ValueCountFrequency (%)
12
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 311
84.7%
Common 45
 
12.3%
Latin 11
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
7.7%
15
 
4.8%
12
 
3.9%
12
 
3.9%
10
 
3.2%
10
 
3.2%
9
 
2.9%
8
 
2.6%
8
 
2.6%
6
 
1.9%
Other values (105) 197
63.3%
Latin
ValueCountFrequency (%)
o 2
18.2%
n 1
9.1%
i 1
9.1%
t 1
9.1%
u 1
9.1%
l 1
9.1%
s 1
9.1%
T 1
9.1%
M 1
9.1%
I 1
9.1%
Common
ValueCountFrequency (%)
( 17
37.8%
) 17
37.8%
11
24.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 299
81.5%
ASCII 56
 
15.3%
None 12
 
3.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
8.0%
15
 
5.0%
12
 
4.0%
10
 
3.3%
10
 
3.3%
9
 
3.0%
8
 
2.7%
8
 
2.7%
6
 
2.0%
5
 
1.7%
Other values (104) 192
64.2%
ASCII
ValueCountFrequency (%)
( 17
30.4%
) 17
30.4%
11
19.6%
o 2
 
3.6%
n 1
 
1.8%
i 1
 
1.8%
t 1
 
1.8%
u 1
 
1.8%
l 1
 
1.8%
s 1
 
1.8%
Other values (3) 3
 
5.4%
None
ValueCountFrequency (%)
12
100.0%
Distinct43
Distinct (%)95.6%
Missing0
Missing (%)0.0%
Memory size492.0 B
2024-04-21T10:20:20.470220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length33
Mean length28.6
Min length17

Characters and Unicode

Total characters1287
Distinct characters120
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)91.1%

Sample

1st row대구광역시 중구 대봉로 200 (대봉동)
2nd row대구광역시 중구 봉산문화길 38 (봉산동)
3rd row대구광역시 중구 국채보상로 515, 2층 (서문로2가, 갑을빌딩)
4th row대구광역시 중구 서성로 26, 501호(계산동2가, 정무빌딩)
5th row대구광역시 중구 서성로26, 정무빌딩 501호
ValueCountFrequency (%)
대구광역시 45
 
18.3%
북구 14
 
5.7%
수성구 13
 
5.3%
산격동 8
 
3.3%
중구 6
 
2.4%
달서구 4
 
1.6%
동대구로 4
 
1.6%
3층 4
 
1.6%
동구 3
 
1.2%
남구 3
 
1.2%
Other values (119) 142
57.7%
2024-04-21T10:20:20.860949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
204
 
15.9%
96
 
7.5%
59
 
4.6%
53
 
4.1%
49
 
3.8%
45
 
3.5%
45
 
3.5%
1 44
 
3.4%
) 44
 
3.4%
( 44
 
3.4%
Other values (110) 604
46.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 745
57.9%
Decimal Number 205
 
15.9%
Space Separator 204
 
15.9%
Close Punctuation 44
 
3.4%
Open Punctuation 44
 
3.4%
Other Punctuation 31
 
2.4%
Uppercase Letter 8
 
0.6%
Dash Punctuation 5
 
0.4%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
96
 
12.9%
59
 
7.9%
53
 
7.1%
49
 
6.6%
45
 
6.0%
45
 
6.0%
43
 
5.8%
24
 
3.2%
20
 
2.7%
20
 
2.7%
Other values (89) 291
39.1%
Decimal Number
ValueCountFrequency (%)
1 44
21.5%
3 33
16.1%
2 32
15.6%
0 26
12.7%
8 15
 
7.3%
5 15
 
7.3%
4 13
 
6.3%
6 12
 
5.9%
7 11
 
5.4%
9 4
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
A 2
25.0%
B 2
25.0%
T 2
25.0%
C 2
25.0%
Other Punctuation
ValueCountFrequency (%)
, 29
93.5%
. 2
 
6.5%
Space Separator
ValueCountFrequency (%)
204
100.0%
Close Punctuation
ValueCountFrequency (%)
) 44
100.0%
Open Punctuation
ValueCountFrequency (%)
( 44
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 745
57.9%
Common 534
41.5%
Latin 8
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
96
 
12.9%
59
 
7.9%
53
 
7.1%
49
 
6.6%
45
 
6.0%
45
 
6.0%
43
 
5.8%
24
 
3.2%
20
 
2.7%
20
 
2.7%
Other values (89) 291
39.1%
Common
ValueCountFrequency (%)
204
38.2%
1 44
 
8.2%
) 44
 
8.2%
( 44
 
8.2%
3 33
 
6.2%
2 32
 
6.0%
, 29
 
5.4%
0 26
 
4.9%
8 15
 
2.8%
5 15
 
2.8%
Other values (7) 48
 
9.0%
Latin
ValueCountFrequency (%)
A 2
25.0%
B 2
25.0%
T 2
25.0%
C 2
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 745
57.9%
ASCII 542
42.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
204
37.6%
1 44
 
8.1%
) 44
 
8.1%
( 44
 
8.1%
3 33
 
6.1%
2 32
 
5.9%
, 29
 
5.4%
0 26
 
4.8%
8 15
 
2.8%
5 15
 
2.8%
Other values (11) 56
 
10.3%
Hangul
ValueCountFrequency (%)
96
 
12.9%
59
 
7.9%
53
 
7.1%
49
 
6.6%
45
 
6.0%
45
 
6.0%
43
 
5.8%
24
 
3.2%
20
 
2.7%
20
 
2.7%
Other values (89) 291
39.1%

Interactions

2024-04-21T10:20:18.768621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:20:20.947273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구군업종중분류업체명소재지
연번1.0000.8980.0001.0000.967
구군0.8981.0000.0001.0001.000
업종중분류0.0000.0001.0000.0000.000
업체명1.0001.0000.0001.0001.000
소재지0.9671.0000.0001.0001.000
2024-04-21T10:20:21.039499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구군업종중분류
구군1.0000.000
업종중분류0.0001.000
2024-04-21T10:20:21.113720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구군업종중분류
연번1.0000.7160.000
구군0.7161.0000.000
업종중분류0.0000.0001.000

Missing values

2024-04-21T10:20:18.909139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:20:19.002022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구군업종중분류업체명소재지
01중구국제회의기획업(주)코리아 커뮤니케이션즈대구광역시 중구 대봉로 200 (대봉동)
12중구국제회의기획업주식회사 소스대구광역시 중구 봉산문화길 38 (봉산동)
23중구국제회의기획업㈜삼성여행사대구광역시 중구 국채보상로 515, 2층 (서문로2가, 갑을빌딩)
34중구국제회의기획업마이스코㈜대구광역시 중구 서성로 26, 501호(계산동2가, 정무빌딩)
45중구국제회의기획업여행스케치 주식회사대구광역시 중구 서성로26, 정무빌딩 501호
56중구국제회의기획업주식회사 픽쇼코리아대구광역시 중구 달구벌대로 2204, 1층 (대봉동)
67동구국제회의기획업㈜예일커뮤니케이션즈대구광역시 동구 화랑로37길 25-4(효목동)
78동구국제회의기획업주식회사 제이비스퀘어대구광역시 동구 동대구로 475, 8층(신천동)
89동구국제회의기획업청년기획대구광역시 동구 이노밸리로56길 3-2 (신서동)
910남구국제회의기획업한국애드대구광역시 남구 이천로 142(이천동)
연번구군업종중분류업체명소재지
3536수성구국제회의기획업글로벌비즈니스센터(주)대구광역시 수성구 국채보상로 186길 127(범어동)
3637수성구국제회의기획업아이엠티솔루션(IMT solution)대구광역시 수성구 청수로38길 30, A동 501호(지산동)
3738수성구국제회의기획업㈜에이시티대구광역시 수성구 지산로 48(지산동)
3839수성구국제회의기획업㈜덱스코커뮤니케이션즈대구광역시 수성구 화랑로8길 11-13 (만촌동,성화빌딩6층)
3940달서구국제회의기획업(주)코드대구광역시 달서구 장기로36 안길28(감삼동)
4041달서구국제회의기획업한국국제교류사업단대구광역시 달서구 성서로 406, 대구성서우체국 3층 301호(이곡동)
4142달서구국제회의기획업㈜비아이이인터내셔널대구광역시 달서구 송현동길 30
4243달서구국제회의기획업대성기획대구광역시 달서구 감삼남3길 24, 1층 (감삼동)
4344달성군국제회의기획업마루기획대구광역시 달성군 가창면 가창로 1110. 1층
4445달성군국제회의기획업주식회사 미미대구광역시 달성군 가창면 가창로 213길 36. (2~3층)