Amino acid dipepetide frequency for Corynebacterium argentoratense DSM 44202

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.433AlaAla: 16.433 ± 0.222
1.125AlaCys: 1.125 ± 0.046
7.167AlaAsp: 7.167 ± 0.116
7.605AlaGlu: 7.605 ± 0.145
3.565AlaPhe: 3.565 ± 0.088
9.937AlaGly: 9.937 ± 0.144
2.679AlaHis: 2.679 ± 0.072
5.569AlaIle: 5.569 ± 0.1
3.728AlaLys: 3.728 ± 0.096
11.732AlaLeu: 11.732 ± 0.167
2.781AlaMet: 2.781 ± 0.081
2.677AlaAsn: 2.677 ± 0.073
5.211AlaPro: 5.211 ± 0.109
4.903AlaGln: 4.903 ± 0.113
7.141AlaArg: 7.141 ± 0.122
6.441AlaSer: 6.441 ± 0.109
7.003AlaThr: 7.003 ± 0.105
9.805AlaVal: 9.805 ± 0.159
1.484AlaTrp: 1.484 ± 0.052
2.413AlaTyr: 2.413 ± 0.066
0.0AlaXaa: 0.0 ± 0.0
Cys
1.132CysAla: 1.132 ± 0.05
0.101CysCys: 0.101 ± 0.012
0.602CysAsp: 0.602 ± 0.034
0.552CysGlu: 0.552 ± 0.028
0.305CysPhe: 0.305 ± 0.021
0.912CysGly: 0.912 ± 0.038
0.196CysHis: 0.196 ± 0.016
0.399CysIle: 0.399 ± 0.026
0.166CysLys: 0.166 ± 0.018
0.65CysLeu: 0.65 ± 0.03
0.174CysMet: 0.174 ± 0.016
0.234CysAsn: 0.234 ± 0.022
0.525CysPro: 0.525 ± 0.029
0.227CysGln: 0.227 ± 0.018
0.413CysArg: 0.413 ± 0.027
0.633CysSer: 0.633 ± 0.034
0.509CysThr: 0.509 ± 0.029
0.82CysVal: 0.82 ± 0.04
0.104CysTrp: 0.104 ± 0.015
0.179CysTyr: 0.179 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
8.125AspAla: 8.125 ± 0.134
0.562AspCys: 0.562 ± 0.032
3.905AspAsp: 3.905 ± 0.093
3.711AspGlu: 3.711 ± 0.089
1.946AspPhe: 1.946 ± 0.058
5.152AspGly: 5.152 ± 0.105
1.26AspHis: 1.26 ± 0.048
3.163AspIle: 3.163 ± 0.078
2.113AspLys: 2.113 ± 0.07
5.329AspLeu: 5.329 ± 0.089
1.324AspMet: 1.324 ± 0.05
2.022AspAsn: 2.022 ± 0.071
3.771AspPro: 3.771 ± 0.084
1.727AspGln: 1.727 ± 0.055
3.29AspArg: 3.29 ± 0.079
3.514AspSer: 3.514 ± 0.078
3.371AspThr: 3.371 ± 0.084
5.801AspVal: 5.801 ± 0.099
0.706AspTrp: 0.706 ± 0.036
1.568AspTyr: 1.568 ± 0.058
0.0AspXaa: 0.0 ± 0.0
Glu
6.801GluAla: 6.801 ± 0.124
0.457GluCys: 0.457 ± 0.026
3.438GluAsp: 3.438 ± 0.085
4.048GluGlu: 4.048 ± 0.099
2.067GluPhe: 2.067 ± 0.06
4.623GluGly: 4.623 ± 0.098
1.797GluHis: 1.797 ± 0.06
2.941GluIle: 2.941 ± 0.072
2.102GluLys: 2.102 ± 0.076
6.624GluLeu: 6.624 ± 0.12
1.213GluMet: 1.213 ± 0.046
1.401GluAsn: 1.401 ± 0.058
2.669GluPro: 2.669 ± 0.088
3.219GluGln: 3.219 ± 0.087
3.92GluArg: 3.92 ± 0.087
2.773GluSer: 2.773 ± 0.071
2.65GluThr: 2.65 ± 0.071
4.853GluVal: 4.853 ± 0.089
0.656GluTrp: 0.656 ± 0.032
1.353GluTyr: 1.353 ± 0.049
0.0GluXaa: 0.0 ± 0.0
Phe
3.769PheAla: 3.769 ± 0.075
0.308PheCys: 0.308 ± 0.025
2.498PheAsp: 2.498 ± 0.071
1.813PheGlu: 1.813 ± 0.057
1.364PhePhe: 1.364 ± 0.055
3.363PheGly: 3.363 ± 0.078
0.641PheHis: 0.641 ± 0.036
1.641PheIle: 1.641 ± 0.057
0.893PheLys: 0.893 ± 0.039
2.652PheLeu: 2.652 ± 0.075
0.658PheMet: 0.658 ± 0.036
1.139PheAsn: 1.139 ± 0.049
1.377PhePro: 1.377 ± 0.052
0.762PheGln: 0.762 ± 0.036
1.55PheArg: 1.55 ± 0.044
2.12PheSer: 2.12 ± 0.063
1.901PheThr: 1.901 ± 0.053
2.677PheVal: 2.677 ± 0.068
0.401PheTrp: 0.401 ± 0.029
0.827PheTyr: 0.827 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
9.049GlyAla: 9.049 ± 0.129
0.834GlyCys: 0.834 ± 0.04
4.716GlyAsp: 4.716 ± 0.096
4.717GlyGlu: 4.717 ± 0.096
3.207GlyPhe: 3.207 ± 0.071
7.021GlyGly: 7.021 ± 0.136
1.948GlyHis: 1.948 ± 0.057
4.512GlyIle: 4.512 ± 0.092
2.992GlyLys: 2.992 ± 0.075
8.107GlyLeu: 8.107 ± 0.126
2.21GlyMet: 2.21 ± 0.058
2.246GlyAsn: 2.246 ± 0.059
3.35GlyPro: 3.35 ± 0.071
2.824GlyGln: 2.824 ± 0.074
5.061GlyArg: 5.061 ± 0.09
5.13GlySer: 5.13 ± 0.115
4.999GlyThr: 4.999 ± 0.1
7.905GlyVal: 7.905 ± 0.114
1.305GlyTrp: 1.305 ± 0.042
2.185GlyTyr: 2.185 ± 0.07
0.0GlyXaa: 0.0 ± 0.0
His
2.412HisAla: 2.412 ± 0.067
0.252HisCys: 0.252 ± 0.022
1.348HisAsp: 1.348 ± 0.054
1.157HisGlu: 1.157 ± 0.038
0.696HisPhe: 0.696 ± 0.034
1.959HisGly: 1.959 ± 0.058
0.617HisHis: 0.617 ± 0.039
1.107HisIle: 1.107 ± 0.052
0.545HisLys: 0.545 ± 0.031
1.808HisLeu: 1.808 ± 0.059
0.509HisMet: 0.509 ± 0.029
0.759HisAsn: 0.759 ± 0.038
1.628HisPro: 1.628 ± 0.057
0.721HisGln: 0.721 ± 0.038
1.349HisArg: 1.349 ± 0.054
1.328HisSer: 1.328 ± 0.047
1.472HisThr: 1.472 ± 0.062
1.654HisVal: 1.654 ± 0.052
0.273HisTrp: 0.273 ± 0.022
0.564HisTyr: 0.564 ± 0.035
0.0HisXaa: 0.0 ± 0.0
Ile
6.388IleAla: 6.388 ± 0.121
0.431IleCys: 0.431 ± 0.027
4.117IleAsp: 4.117 ± 0.073
3.149IleGlu: 3.149 ± 0.072
1.421IlePhe: 1.421 ± 0.06
4.406IleGly: 4.406 ± 0.098
0.995IleHis: 0.995 ± 0.044
2.649IleIle: 2.649 ± 0.077
1.527IleLys: 1.527 ± 0.06
3.824IleLeu: 3.824 ± 0.089
0.92IleMet: 0.92 ± 0.04
1.75IleAsn: 1.75 ± 0.059
2.478IlePro: 2.478 ± 0.069
1.223IleGln: 1.223 ± 0.043
2.586IleArg: 2.586 ± 0.065
2.821IleSer: 2.821 ± 0.069
3.184IleThr: 3.184 ± 0.074
4.528IleVal: 4.528 ± 0.085
0.365IleTrp: 0.365 ± 0.027
0.932IleTyr: 0.932 ± 0.034
0.0IleXaa: 0.0 ± 0.0
Lys
3.867LysAla: 3.867 ± 0.112
0.128LysCys: 0.128 ± 0.015
2.039LysAsp: 2.039 ± 0.071
1.933LysGlu: 1.933 ± 0.061
0.855LysPhe: 0.855 ± 0.036
2.43LysGly: 2.43 ± 0.074
0.738LysHis: 0.738 ± 0.035
1.507LysIle: 1.507 ± 0.055
1.706LysLys: 1.706 ± 0.078
2.979LysLeu: 2.979 ± 0.075
0.767LysMet: 0.767 ± 0.038
1.066LysAsn: 1.066 ± 0.048
1.817LysPro: 1.817 ± 0.077
1.427LysGln: 1.427 ± 0.053
1.989LysArg: 1.989 ± 0.064
1.585LysSer: 1.585 ± 0.066
1.911LysThr: 1.911 ± 0.07
2.712LysVal: 2.712 ± 0.077
0.32LysTrp: 0.32 ± 0.022
0.62LysTyr: 0.62 ± 0.042
0.0LysXaa: 0.0 ± 0.0
Leu
11.628LeuAla: 11.628 ± 0.16
0.865LeuCys: 0.865 ± 0.038
6.249LeuAsp: 6.249 ± 0.109
5.074LeuGlu: 5.074 ± 0.112
2.808LeuPhe: 2.808 ± 0.074
8.285LeuGly: 8.285 ± 0.119
1.868LeuHis: 1.868 ± 0.066
4.615LeuIle: 4.615 ± 0.087
3.159LeuLys: 3.159 ± 0.082
8.745LeuLeu: 8.745 ± 0.146
2.011LeuMet: 2.011 ± 0.057
2.573LeuAsn: 2.573 ± 0.08
5.026LeuPro: 5.026 ± 0.098
2.685LeuGln: 2.685 ± 0.066
6.065LeuArg: 6.065 ± 0.106
6.294LeuSer: 6.294 ± 0.12
5.243LeuThr: 5.243 ± 0.095
7.456LeuVal: 7.456 ± 0.128
1.114LeuTrp: 1.114 ± 0.043
1.757LeuTyr: 1.757 ± 0.05
0.002LeuXaa: 0.002 ± 0.002
Met
2.533MetAla: 2.533 ± 0.067
0.207MetCys: 0.207 ± 0.018
1.188MetAsp: 1.188 ± 0.047
1.107MetGlu: 1.107 ± 0.045
0.751MetPhe: 0.751 ± 0.037
1.832MetGly: 1.832 ± 0.061
0.535MetHis: 0.535 ± 0.031
1.124MetIle: 1.124 ± 0.044
0.792MetLys: 0.792 ± 0.037
2.377MetLeu: 2.377 ± 0.072
0.555MetMet: 0.555 ± 0.032
0.744MetAsn: 0.744 ± 0.038
1.246MetPro: 1.246 ± 0.045
0.661MetGln: 0.661 ± 0.032
1.414MetArg: 1.414 ± 0.051
1.732MetSer: 1.732 ± 0.055
1.452MetThr: 1.452 ± 0.042
1.838MetVal: 1.838 ± 0.058
0.252MetTrp: 0.252 ± 0.021
0.409MetTyr: 0.409 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
3.022AsnAla: 3.022 ± 0.07
0.23AsnCys: 0.23 ± 0.018
1.653AsnAsp: 1.653 ± 0.062
1.421AsnGlu: 1.421 ± 0.056
0.897AsnPhe: 0.897 ± 0.041
2.191AsnGly: 2.191 ± 0.066
0.58AsnHis: 0.58 ± 0.034
1.419AsnIle: 1.419 ± 0.053
1.067AsnLys: 1.067 ± 0.046
2.533AsnLeu: 2.533 ± 0.074
0.666AsnMet: 0.666 ± 0.032
1.175AsnAsn: 1.175 ± 0.054
2.19AsnPro: 2.19 ± 0.069
1.018AsnGln: 1.018 ± 0.044
1.484AsnArg: 1.484 ± 0.052
1.664AsnSer: 1.664 ± 0.06
1.852AsnThr: 1.852 ± 0.07
2.176AsnVal: 2.176 ± 0.064
0.361AsnTrp: 0.361 ± 0.023
0.776AsnTyr: 0.776 ± 0.038
0.0AsnXaa: 0.0 ± 0.0
Pro
5.8ProAla: 5.8 ± 0.123
0.332ProCys: 0.332 ± 0.024
3.241ProAsp: 3.241 ± 0.088
4.056ProGlu: 4.056 ± 0.102
1.533ProPhe: 1.533 ± 0.05
4.565ProGly: 4.565 ± 0.095
1.195ProHis: 1.195 ± 0.052
2.208ProIle: 2.208 ± 0.05
1.697ProLys: 1.697 ± 0.07
4.248ProLeu: 4.248 ± 0.082
1.157ProMet: 1.157 ± 0.038
1.56ProAsn: 1.56 ± 0.059
1.896ProPro: 1.896 ± 0.074
2.035ProGln: 2.035 ± 0.063
2.747ProArg: 2.747 ± 0.077
3.033ProSer: 3.033 ± 0.083
3.317ProThr: 3.317 ± 0.076
4.376ProVal: 4.376 ± 0.074
0.759ProTrp: 0.759 ± 0.041
1.106ProTyr: 1.106 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
4.437GlnAla: 4.437 ± 0.112
0.259GlnCys: 0.259 ± 0.021
1.578GlnAsp: 1.578 ± 0.051
1.798GlnGlu: 1.798 ± 0.05
0.985GlnPhe: 0.985 ± 0.055
2.622GlnGly: 2.622 ± 0.078
0.855GlnHis: 0.855 ± 0.039
1.641GlnIle: 1.641 ± 0.05
0.975GlnLys: 0.975 ± 0.042
4.073GlnLeu: 4.073 ± 0.093
0.822GlnMet: 0.822 ± 0.034
0.704GlnAsn: 0.704 ± 0.035
2.118GlnPro: 2.118 ± 0.071
2.072GlnGln: 2.072 ± 0.072
2.762GlnArg: 2.762 ± 0.072
1.702GlnSer: 1.702 ± 0.05
1.444GlnThr: 1.444 ± 0.055
2.621GlnVal: 2.621 ± 0.061
0.749GlnTrp: 0.749 ± 0.038
0.635GlnTyr: 0.635 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
6.557ArgAla: 6.557 ± 0.136
0.494ArgCys: 0.494 ± 0.027
3.536ArgAsp: 3.536 ± 0.085
3.86ArgGlu: 3.86 ± 0.078
2.009ArgPhe: 2.009 ± 0.063
4.779ArgGly: 4.779 ± 0.088
1.318ArgHis: 1.318 ± 0.047
3.154ArgIle: 3.154 ± 0.068
1.903ArgLys: 1.903 ± 0.063
5.424ArgLeu: 5.424 ± 0.115
1.596ArgMet: 1.596 ± 0.051
1.735ArgAsn: 1.735 ± 0.058
2.849ArgPro: 2.849 ± 0.079
2.107ArgGln: 2.107 ± 0.062
4.736ArgArg: 4.736 ± 0.115
3.512ArgSer: 3.512 ± 0.081
3.544ArgThr: 3.544 ± 0.08
4.941ArgVal: 4.941 ± 0.11
0.9ArgTrp: 0.9 ± 0.043
1.545ArgTyr: 1.545 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
6.471SerAla: 6.471 ± 0.111
0.444SerCys: 0.444 ± 0.029
3.501SerAsp: 3.501 ± 0.08
3.34SerGlu: 3.34 ± 0.08
2.009SerPhe: 2.009 ± 0.065
5.543SerGly: 5.543 ± 0.106
1.164SerHis: 1.164 ± 0.048
2.796SerIle: 2.796 ± 0.075
1.838SerLys: 1.838 ± 0.063
5.157SerLeu: 5.157 ± 0.091
1.497SerMet: 1.497 ± 0.049
1.687SerAsn: 1.687 ± 0.057
2.921SerPro: 2.921 ± 0.073
1.865SerGln: 1.865 ± 0.051
3.539SerArg: 3.539 ± 0.075
4.023SerSer: 4.023 ± 0.107
4.046SerThr: 4.046 ± 0.094
4.766SerVal: 4.766 ± 0.089
0.827SerTrp: 0.827 ± 0.035
1.319SerTyr: 1.319 ± 0.053
0.0SerXaa: 0.0 ± 0.0
Thr
6.4ThrAla: 6.4 ± 0.119
0.539ThrCys: 0.539 ± 0.035
3.153ThrAsp: 3.153 ± 0.077
2.916ThrGlu: 2.916 ± 0.069
1.918ThrPhe: 1.918 ± 0.054
4.969ThrGly: 4.969 ± 0.102
1.387ThrHis: 1.387 ± 0.053
3.275ThrIle: 3.275 ± 0.074
1.757ThrLys: 1.757 ± 0.067
5.646ThrLeu: 5.646 ± 0.1
1.293ThrMet: 1.293 ± 0.04
1.651ThrAsn: 1.651 ± 0.057
4.003ThrPro: 4.003 ± 0.085
1.973ThrGln: 1.973 ± 0.058
3.126ThrArg: 3.126 ± 0.071
3.38ThrSer: 3.38 ± 0.082
3.998ThrThr: 3.998 ± 0.089
5.172ThrVal: 5.172 ± 0.1
0.834ThrTrp: 0.834 ± 0.039
1.377ThrTyr: 1.377 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
10.764ValAla: 10.764 ± 0.172
0.89ValCys: 0.89 ± 0.036
6.32ValAsp: 6.32 ± 0.109
5.359ValGlu: 5.359 ± 0.111
2.758ValPhe: 2.758 ± 0.078
6.707ValGly: 6.707 ± 0.123
1.712ValHis: 1.712 ± 0.054
4.338ValIle: 4.338 ± 0.101
2.367ValLys: 2.367 ± 0.076
8.175ValLeu: 8.175 ± 0.144
1.832ValMet: 1.832 ± 0.061
2.185ValAsn: 2.185 ± 0.062
4.031ValPro: 4.031 ± 0.094
2.161ValGln: 2.161 ± 0.064
5.017ValArg: 5.017 ± 0.095
4.935ValSer: 4.935 ± 0.086
4.842ValThr: 4.842 ± 0.102
8.495ValVal: 8.495 ± 0.154
0.998ValTrp: 0.998 ± 0.045
1.596ValTyr: 1.596 ± 0.052
0.0ValXaa: 0.0 ± 0.0
Trp
1.384TrpAla: 1.384 ± 0.053
0.171TrpCys: 0.171 ± 0.018
0.673TrpAsp: 0.673 ± 0.032
0.723TrpGlu: 0.723 ± 0.035
0.511TrpPhe: 0.511 ± 0.03
0.946TrpGly: 0.946 ± 0.039
0.259TrpHis: 0.259 ± 0.02
0.671TrpIle: 0.671 ± 0.033
0.438TrpLys: 0.438 ± 0.027
1.435TrpLeu: 1.435 ± 0.061
0.378TrpMet: 0.378 ± 0.025
0.401TrpAsn: 0.401 ± 0.029
0.545TrpPro: 0.545 ± 0.036
0.507TrpGln: 0.507 ± 0.028
0.875TrpArg: 0.875 ± 0.039
0.812TrpSer: 0.812 ± 0.034
0.641TrpThr: 0.641 ± 0.03
1.135TrpVal: 1.135 ± 0.047
0.32TrpTrp: 0.32 ± 0.024
0.252TrpTyr: 0.252 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.428TyrAla: 2.428 ± 0.075
0.204TyrCys: 0.204 ± 0.018
1.442TyrAsp: 1.442 ± 0.048
1.263TyrGlu: 1.263 ± 0.045
0.753TyrPhe: 0.753 ± 0.036
2.075TyrGly: 2.075 ± 0.055
0.416TyrHis: 0.416 ± 0.027
0.965TyrIle: 0.965 ± 0.038
0.63TyrLys: 0.63 ± 0.032
2.002TyrLeu: 2.002 ± 0.061
0.386TyrMet: 0.386 ± 0.024
0.665TyrAsn: 0.665 ± 0.031
1.208TyrPro: 1.208 ± 0.045
0.733TyrGln: 0.733 ± 0.036
1.437TyrArg: 1.437 ± 0.059
1.319TyrSer: 1.319 ± 0.043
1.386TyrThr: 1.386 ± 0.052
1.75TyrVal: 1.75 ± 0.051
0.35TyrTrp: 0.35 ± 0.025
0.525TyrTyr: 0.525 ± 0.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.002XaaGlu: 0.002 ± 0.002
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1870 proteins (603294 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski