Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 3065 |
Missing cells | 15348 |
Missing cells (%) | 55.6% |
Duplicate rows | 4 |
Duplicate rows (%) | 0.1% |
Total size in memory | 233.6 KiB |
Average record size in memory | 78.0 B |
Variable types
Numeric | 1 |
---|---|
Text | 2 |
Unsupported | 5 |
DateTime | 1 |
Dataset
Description | 정보통신공사업체 현황 |
---|---|
Author | 경기도 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=EC4ZK8N0T3SDGCI849GM32015987&infSeq=1 |
Dataset has 4 (0.1%) duplicate rows | Duplicates |
전화번호 has 3065 (100.0%) missing values | Missing |
정제도로명주소 has 3065 (100.0%) missing values | Missing |
정제지번주소 has 3065 (100.0%) missing values | Missing |
정제WGS84위도 has 3065 (100.0%) missing values | Missing |
정제WGS84경도 has 3065 (100.0%) missing values | Missing |
전화번호 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
정제도로명주소 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
정제지번주소 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
정제WGS84위도 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
정제WGS84경도 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-03-12 23:35:17.389573 |
---|---|
Analysis finished | 2024-03-12 23:35:17.947551 |
Duration | 0.56 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
등록번호
Real number (ℝ)
Distinct | 3060 |
---|---|
Distinct (%) | 99.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 270747.84 |
Minimum | 110003 |
---|---|
Maximum | 640110 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 27.1 KiB |
Quantile statistics
Minimum | 110003 |
---|---|
5-th percentile | 110823.2 |
Q1 | 201252 |
median | 311188 |
Q3 | 312127 |
95-th percentile | 350465.8 |
Maximum | 640110 |
Range | 530107 |
Interquartile range (IQR) | 110875 |
Descriptive statistics
Standard deviation | 98750.692 |
---|---|
Coefficient of variation (CV) | 0.36473307 |
Kurtosis | 0.65648261 |
Mean | 270747.84 |
Median Absolute Deviation (MAD) | 1161 |
Skewness | -0.055993395 |
Sum | 8.2984213 × 108 |
Variance | 9.7516991 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
310590 | 3 | 0.1% |
202912 | 2 | 0.1% |
550186 | 2 | 0.1% |
530173 | 2 | 0.1% |
311656 | 1 | < 0.1% |
311659 | 1 | < 0.1% |
203072 | 1 | < 0.1% |
203071 | 1 | < 0.1% |
311658 | 1 | < 0.1% |
350348 | 1 | < 0.1% |
Other values (3050) | 3050 |
Value | Count | Frequency (%) |
110003 | 1 | |
110013 | 1 | |
110018 | 1 | |
110019 | 1 | |
110022 | 1 | |
110026 | 1 | |
110031 | 1 | |
110037 | 1 | |
110041 | 1 | |
110045 | 1 |
Value | Count | Frequency (%) |
640110 | 1 | |
640107 | 1 | |
640086 | 1 | |
640044 | 1 | |
640011 | 1 | |
630231 | 1 | |
630187 | 1 | |
630169 | 1 | |
630010 | 1 | |
620293 | 1 |
상호명
Text
Distinct | 3047 |
---|---|
Distinct (%) | 99.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 24.1 KiB |
Length
Max length | 58 |
---|---|
Median length | 53 |
Mean length | 9.5960848 |
Min length | 2 |
Characters and Unicode
Total characters | 29412 |
---|---|
Distinct characters | 551 |
Distinct categories | 10 ? |
Distinct scripts | 4 ? |
Distinct blocks | 4 ? |
Unique
Unique | 3030 ? |
---|---|
Unique (%) | 98.9% |
Sample
1st row | (주)아이티언 |
---|---|
2nd row | (주)성문텔레콤 |
3rd row | (주)메인 |
4th row | (주)호맘코리아 |
5th row | 중앙아이.엔.티.(주) |
Value | Count | Frequency (%) |
주식회사 | 1150 | 25.4% |
ltd | 46 | 1.0% |
co | 45 | 1.0% |
co.,ltd | 40 | 0.9% |
inc | 16 | 0.4% |
사회적협동조합 | 7 | 0.2% |
사단법인 | 4 | 0.1% |
system | 4 | 0.1% |
3 | 0.1% | |
가온정보통신 | 3 | 0.1% |
Other values (3163) | 3201 |
Most occurring characters
Value | Count | Frequency (%) |
주 | 2956 | 10.1% |
( | 1878 | 6.4% |
) | 1877 | 6.4% |
1454 | 4.9% | |
사 | 1239 | 4.2% |
회 | 1200 | 4.1% |
식 | 1181 | 4.0% |
이 | 1098 | 3.7% |
스 | 931 | 3.2% |
에 | 511 | 1.7% |
Other values (541) | 15087 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 22307 | |
Open Punctuation | 1878 | 6.4% |
Close Punctuation | 1877 | 6.4% |
Space Separator | 1454 | 4.9% |
Uppercase Letter | 867 | 2.9% |
Lowercase Letter | 729 | 2.5% |
Other Punctuation | 294 | 1.0% |
Dash Punctuation | 4 | < 0.1% |
Other Symbol | 1 | < 0.1% |
Decimal Number | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 2956 | 13.3% |
사 | 1239 | 5.6% |
회 | 1200 | 5.4% |
식 | 1181 | 5.3% |
이 | 1098 | 4.9% |
스 | 931 | 4.2% |
에 | 511 | 2.3% |
신 | 474 | 2.1% |
씨 | 375 | 1.7% |
보 | 372 | 1.7% |
Other values (485) | 11970 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 135 | |
L | 110 | |
N | 68 | 7.8% |
O | 65 | 7.5% |
T | 62 | 7.2% |
I | 56 | 6.5% |
S | 52 | 6.0% |
E | 52 | 6.0% |
A | 42 | 4.8% |
D | 38 | 4.4% |
Other values (15) | 187 |
Lowercase Letter
Value | Count | Frequency (%) |
o | 129 | |
t | 110 | |
d | 81 | |
n | 70 | |
e | 53 | |
i | 47 | 6.4% |
c | 39 | 5.3% |
a | 37 | 5.1% |
r | 35 | 4.8% |
s | 24 | 3.3% |
Other values (12) | 104 |
Other Punctuation
Value | Count | Frequency (%) |
. | 189 | |
, | 94 | |
& | 11 | 3.7% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1878 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1877 |
Space Separator
Value | Count | Frequency (%) |
1454 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 1 |
Decimal Number
Value | Count | Frequency (%) |
2 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 22292 | |
Common | 5508 | 18.7% |
Latin | 1596 | 5.4% |
Han | 16 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 2956 | 13.3% |
사 | 1239 | 5.6% |
회 | 1200 | 5.4% |
식 | 1181 | 5.3% |
이 | 1098 | 4.9% |
스 | 931 | 4.2% |
에 | 511 | 2.3% |
신 | 474 | 2.1% |
씨 | 375 | 1.7% |
보 | 372 | 1.7% |
Other values (470) | 11955 |
Latin
Value | Count | Frequency (%) |
C | 135 | 8.5% |
o | 129 | 8.1% |
t | 110 | 6.9% |
L | 110 | 6.9% |
d | 81 | 5.1% |
n | 70 | 4.4% |
N | 68 | 4.3% |
O | 65 | 4.1% |
T | 62 | 3.9% |
I | 56 | 3.5% |
Other values (37) | 710 |
Han
Value | Count | Frequency (%) |
我 | 1 | 6.2% |
到 | 1 | 6.2% |
信 | 1 | 6.2% |
元 | 1 | 6.2% |
綜 | 1 | 6.2% |
合 | 1 | 6.2% |
開 | 1 | 6.2% |
發 | 1 | 6.2% |
株 | 1 | 6.2% |
式 | 1 | 6.2% |
Other values (6) | 6 |
Common
Value | Count | Frequency (%) |
( | 1878 | |
) | 1877 | |
1454 | ||
. | 189 | 3.4% |
, | 94 | 1.7% |
& | 11 | 0.2% |
- | 4 | 0.1% |
2 | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 22291 | |
ASCII | 7104 | 24.2% |
CJK | 16 | 0.1% |
None | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
주 | 2956 | 13.3% |
사 | 1239 | 5.6% |
회 | 1200 | 5.4% |
식 | 1181 | 5.3% |
이 | 1098 | 4.9% |
스 | 931 | 4.2% |
에 | 511 | 2.3% |
신 | 474 | 2.1% |
씨 | 375 | 1.7% |
보 | 372 | 1.7% |
Other values (469) | 11954 |
ASCII
Value | Count | Frequency (%) |
( | 1878 | |
) | 1877 | |
1454 | ||
. | 189 | 2.7% |
C | 135 | 1.9% |
o | 129 | 1.8% |
t | 110 | 1.5% |
L | 110 | 1.5% |
, | 94 | 1.3% |
d | 81 | 1.1% |
Other values (45) | 1047 |
CJK
Value | Count | Frequency (%) |
我 | 1 | 6.2% |
到 | 1 | 6.2% |
信 | 1 | 6.2% |
元 | 1 | 6.2% |
綜 | 1 | 6.2% |
合 | 1 | 6.2% |
開 | 1 | 6.2% |
發 | 1 | 6.2% |
株 | 1 | 6.2% |
式 | 1 | 6.2% |
Other values (6) | 6 |
None
Value | Count | Frequency (%) |
㈜ | 1 |
전화번호
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 3065 |
---|---|
Missing (%) | 100.0% |
Memory size | 27.1 KiB |
팩스번호
Text
Distinct | 2939 |
---|---|
Distinct (%) | 96.6% |
Missing | 23 |
Missing (%) | 0.8% |
Memory size | 24.1 KiB |
Length
Max length | 14 |
---|---|
Median length | 12 |
Mean length | 12.092373 |
Min length | 11 |
Characters and Unicode
Total characters | 36785 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 2840 ? |
---|---|
Unique (%) | 93.4% |
Sample
1st row | 031-8040-9980 |
---|---|
2nd row | 031-503-0099 |
3rd row | 02-6499-0833 |
4th row | 031-520-6793 |
5th row | 031-420-4429 |
Value | Count | Frequency (%) |
031-235-6336 | 3 | 0.1% |
031-658-4904 | 3 | 0.1% |
031-541-8255 | 3 | 0.1% |
031-871-2771 | 3 | 0.1% |
031-339-9191 | 2 | 0.1% |
031-904-8981 | 2 | 0.1% |
031-968-3748 | 2 | 0.1% |
050-5304-2020 | 2 | 0.1% |
031-477-3407 | 2 | 0.1% |
031-921-0933 | 2 | 0.1% |
Other values (2929) | 3018 |
Most occurring characters
Value | Count | Frequency (%) |
- | 6084 | |
0 | 5687 | |
3 | 4534 | |
1 | 4055 | |
2 | 2979 | |
7 | 2424 | 6.6% |
4 | 2342 | 6.4% |
5 | 2318 | 6.3% |
8 | 2134 | 5.8% |
9 | 2114 | 5.7% |
Other values (2) | 2114 | 5.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 30690 | |
Dash Punctuation | 6084 | 16.5% |
Other Punctuation | 11 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 5687 | |
3 | 4534 | |
1 | 4055 | |
2 | 2979 | |
7 | 2424 | |
4 | 2342 | |
5 | 2318 | |
8 | 2134 | 7.0% |
9 | 2114 | 6.9% |
6 | 2103 | 6.9% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6084 |
Other Punctuation
Value | Count | Frequency (%) |
* | 11 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 36785 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 6084 | |
0 | 5687 | |
3 | 4534 | |
1 | 4055 | |
2 | 2979 | |
7 | 2424 | 6.6% |
4 | 2342 | 6.4% |
5 | 2318 | 6.3% |
8 | 2134 | 5.8% |
9 | 2114 | 5.7% |
Other values (2) | 2114 | 5.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 36785 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 6084 | |
0 | 5687 | |
3 | 4534 | |
1 | 4055 | |
2 | 2979 | |
7 | 2424 | 6.6% |
4 | 2342 | 6.4% |
5 | 2318 | 6.3% |
8 | 2134 | 5.8% |
9 | 2114 | 5.7% |
Other values (2) | 2114 | 5.7% |
정제도로명주소
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 3065 |
---|---|
Missing (%) | 100.0% |
Memory size | 27.1 KiB |
정제지번주소
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 3065 |
---|---|
Missing (%) | 100.0% |
Memory size | 27.1 KiB |
정제WGS84위도
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 3065 |
---|---|
Missing (%) | 100.0% |
Memory size | 27.1 KiB |
정제WGS84경도
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 3065 |
---|---|
Missing (%) | 100.0% |
Memory size | 27.1 KiB |
등록일자
Date
Distinct | 2150 |
---|---|
Distinct (%) | 70.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 24.1 KiB |
Minimum | 1971-11-27 00:00:00 |
---|---|
Maximum | 2023-10-06 00:00:00 |
등록번호 | 상호명 | 전화번호 | 팩스번호 | 정제도로명주소 | 정제지번주소 | 정제WGS84위도 | 정제WGS84경도 | 등록일자 | |
---|---|---|---|---|---|---|---|---|---|
0 | 112761 | (주)아이티언 | <NA> | 031-8040-9980 | <NA> | <NA> | <NA> | <NA> | 2002-10-10 |
1 | 112742 | (주)성문텔레콤 | <NA> | 031-503-0099 | <NA> | <NA> | <NA> | <NA> | 2002-09-27 |
2 | 112744 | (주)메인 | <NA> | 02-6499-0833 | <NA> | <NA> | <NA> | <NA> | 2002-09-27 |
3 | 112731 | (주)호맘코리아 | <NA> | 031-520-6793 | <NA> | <NA> | <NA> | <NA> | 2002-09-19 |
4 | 112732 | 중앙아이.엔.티.(주) | <NA> | 031-420-4429 | <NA> | <NA> | <NA> | <NA> | 2002-09-19 |
5 | 112726 | (주)천일정보통신 | <NA> | 031-785-4303 | <NA> | <NA> | <NA> | <NA> | 2002-09-19 |
6 | 150580 | (주)대륜통신 | <NA> | 031-658-3796 | <NA> | <NA> | <NA> | <NA> | 2002-09-18 |
7 | 112716 | (주)위드텍 | <NA> | 031-321-6009 | <NA> | <NA> | <NA> | <NA> | 2002-09-09 |
8 | 112700 | (주)이지넷콤 | <NA> | 031-681-2926 | <NA> | <NA> | <NA> | <NA> | 2002-08-26 |
9 | 112684 | (주)에스에이티 | <NA> | 031-450-1300 | <NA> | <NA> | <NA> | <NA> | 2002-08-20 |
등록번호 | 상호명 | 전화번호 | 팩스번호 | 정제도로명주소 | 정제지번주소 | 정제WGS84위도 | 정제WGS84경도 | 등록일자 | |
---|---|---|---|---|---|---|---|---|---|
3055 | 120449 | 주식회사 빛이라 | <NA> | 031-323-3681 | <NA> | <NA> | <NA> | <NA> | 2000-08-28 |
3056 | 111559 | 지에스앤티(주) | <NA> | 031-768-6503 | <NA> | <NA> | <NA> | <NA> | 2000-08-21 |
3057 | 111553 | (주)삼정보안시스템 | <NA> | 031-211-1643 | <NA> | <NA> | <NA> | <NA> | 2000-08-21 |
3058 | 111560 | 삼일씨티에스(주) | <NA> | 031-720-5166 | <NA> | <NA> | <NA> | <NA> | 2000-08-21 |
3059 | 111551 | 주식회사 경도 | <NA> | 031-901-5506 | <NA> | <NA> | <NA> | <NA> | 2000-08-11 |
3060 | 111540 | 건영네트웍스(주) | <NA> | 031-211-2100 | <NA> | <NA> | <NA> | <NA> | 2000-08-07 |
3061 | 111507 | 우주미디어정보통신(주) | <NA> | 031-207-4104 | <NA> | <NA> | <NA> | <NA> | 2000-08-02 |
3062 | 111502 | (주)쏠리드(Solid, Inc.) | <NA> | 031-627-6009 | <NA> | <NA> | <NA> | <NA> | 2000-07-25 |
3063 | 350024 | (주)경기방재 | <NA> | 031-871-2771 | <NA> | <NA> | <NA> | <NA> | 2000-07-24 |
3064 | 111482 | (주)승진정보 | <NA> | 031-872-4202 | <NA> | <NA> | <NA> | <NA> | 2000-07-05 |
Most frequently occurring
등록번호 | 상호명 | 팩스번호 | 등록일자 | # duplicates | |
---|---|---|---|---|---|
0 | 202912 | 주식회사 엔씨씨디지탈 | 031-553-9317 | 2018-06-15 | 2 |
1 | 310590 | 주식회사 태원 | 031-541-8255 | 2009-09-29 | 2 |
2 | 530173 | 영한산업 주식회사 | 02-922-3533 | 2009-07-17 | 2 |
3 | 550186 | 주식회사 이에스(ES Co., Ltd.) | 053-715-1372 | 2012-03-29 | 2 |