Amino acid dipepetide frequency for Deinococcus aerius

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.615AlaAla: 16.615 ± 0.175
0.884AlaCys: 0.884 ± 0.032
5.685AlaAsp: 5.685 ± 0.067
6.836AlaGlu: 6.836 ± 0.086
4.244AlaPhe: 4.244 ± 0.062
11.76AlaGly: 11.76 ± 0.103
2.674AlaHis: 2.674 ± 0.047
3.256AlaIle: 3.256 ± 0.053
2.026AlaLys: 2.026 ± 0.049
16.802AlaLeu: 16.802 ± 0.192
2.12AlaMet: 2.12 ± 0.043
2.312AlaAsn: 2.312 ± 0.045
6.577AlaPro: 6.577 ± 0.088
4.833AlaGln: 4.833 ± 0.066
11.725AlaArg: 11.725 ± 0.119
5.54AlaSer: 5.54 ± 0.063
5.557AlaThr: 5.557 ± 0.084
9.04AlaVal: 9.04 ± 0.079
1.92AlaTrp: 1.92 ± 0.04
2.848AlaTyr: 2.848 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.711CysAla: 0.711 ± 0.026
0.069CysCys: 0.069 ± 0.007
0.292CysAsp: 0.292 ± 0.014
0.305CysGlu: 0.305 ± 0.018
0.162CysPhe: 0.162 ± 0.012
0.743CysGly: 0.743 ± 0.025
0.14CysHis: 0.14 ± 0.011
0.183CysIle: 0.183 ± 0.013
0.089CysLys: 0.089 ± 0.009
0.58CysLeu: 0.58 ± 0.022
0.089CysMet: 0.089 ± 0.008
0.124CysAsn: 0.124 ± 0.011
0.437CysPro: 0.437 ± 0.017
0.151CysGln: 0.151 ± 0.01
0.418CysArg: 0.418 ± 0.018
0.275CysSer: 0.275 ± 0.014
0.374CysThr: 0.374 ± 0.017
0.437CysVal: 0.437 ± 0.017
0.071CysTrp: 0.071 ± 0.008
0.132CysTyr: 0.132 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
6.174AspAla: 6.174 ± 0.069
0.241AspCys: 0.241 ± 0.013
2.276AspAsp: 2.276 ± 0.054
3.08AspGlu: 3.08 ± 0.051
1.833AspPhe: 1.833 ± 0.038
4.603AspGly: 4.603 ± 0.063
1.103AspHis: 1.103 ± 0.029
1.567AspIle: 1.567 ± 0.038
0.867AspLys: 0.867 ± 0.031
7.046AspLeu: 7.046 ± 0.095
0.755AspMet: 0.755 ± 0.026
0.886AspAsn: 0.886 ± 0.029
3.932AspPro: 3.932 ± 0.063
1.329AspGln: 1.329 ± 0.031
3.516AspArg: 3.516 ± 0.059
1.737AspSer: 1.737 ± 0.039
2.61AspThr: 2.61 ± 0.052
4.311AspVal: 4.311 ± 0.055
0.802AspTrp: 0.802 ± 0.025
1.144AspTyr: 1.144 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
8.059GluAla: 8.059 ± 0.108
0.258GluCys: 0.258 ± 0.015
2.827GluAsp: 2.827 ± 0.052
3.644GluGlu: 3.644 ± 0.065
1.643GluPhe: 1.643 ± 0.035
5.461GluGly: 5.461 ± 0.078
1.322GluHis: 1.322 ± 0.029
1.831GluIle: 1.831 ± 0.045
1.408GluLys: 1.408 ± 0.042
6.039GluLeu: 6.039 ± 0.078
1.042GluMet: 1.042 ± 0.033
1.309GluAsn: 1.309 ± 0.031
2.618GluPro: 2.618 ± 0.049
2.171GluGln: 2.171 ± 0.045
6.038GluArg: 6.038 ± 0.078
1.974GluSer: 1.974 ± 0.039
2.98GluThr: 2.98 ± 0.058
5.419GluVal: 5.419 ± 0.075
0.895GluTrp: 0.895 ± 0.028
1.42GluTyr: 1.42 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
3.64PheAla: 3.64 ± 0.057
0.199PheCys: 0.199 ± 0.011
1.848PheAsp: 1.848 ± 0.04
1.744PheGlu: 1.744 ± 0.035
1.021PhePhe: 1.021 ± 0.032
3.255PheGly: 3.255 ± 0.05
0.622PheHis: 0.622 ± 0.021
1.062PheIle: 1.062 ± 0.031
0.678PheLys: 0.678 ± 0.025
3.52PheLeu: 3.52 ± 0.058
0.493PheMet: 0.493 ± 0.018
0.778PheAsn: 0.778 ± 0.028
1.855PhePro: 1.855 ± 0.037
0.995PheGln: 0.995 ± 0.028
2.249PheArg: 2.249 ± 0.044
1.69PheSer: 1.69 ± 0.039
2.127PheThr: 2.127 ± 0.039
2.554PheVal: 2.554 ± 0.045
0.514PheTrp: 0.514 ± 0.021
0.765PheTyr: 0.765 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
10.346GlyAla: 10.346 ± 0.098
0.579GlyCys: 0.579 ± 0.022
4.558GlyAsp: 4.558 ± 0.065
6.423GlyGlu: 6.423 ± 0.079
3.031GlyPhe: 3.031 ± 0.054
9.524GlyGly: 9.524 ± 0.134
2.096GlyHis: 2.096 ± 0.038
3.032GlyIle: 3.032 ± 0.057
2.667GlyLys: 2.667 ± 0.06
10.899GlyLeu: 10.899 ± 0.114
2.012GlyMet: 2.012 ± 0.043
2.311GlyAsn: 2.311 ± 0.048
4.161GlyPro: 4.161 ± 0.067
4.039GlyGln: 4.039 ± 0.074
7.616GlyArg: 7.616 ± 0.085
4.387GlySer: 4.387 ± 0.064
6.086GlyThr: 6.086 ± 0.097
8.382GlyVal: 8.382 ± 0.087
1.698GlyTrp: 1.698 ± 0.043
2.487GlyTyr: 2.487 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
2.773HisAla: 2.773 ± 0.059
0.148HisCys: 0.148 ± 0.011
1.098HisAsp: 1.098 ± 0.031
1.136HisGlu: 1.136 ± 0.036
0.749HisPhe: 0.749 ± 0.025
1.966HisGly: 1.966 ± 0.039
0.681HisHis: 0.681 ± 0.024
0.643HisIle: 0.643 ± 0.023
0.33HisLys: 0.33 ± 0.016
2.947HisLeu: 2.947 ± 0.054
0.289HisMet: 0.289 ± 0.015
0.424HisAsn: 0.424 ± 0.02
1.845HisPro: 1.845 ± 0.043
0.519HisGln: 0.519 ± 0.023
1.52HisArg: 1.52 ± 0.033
0.815HisSer: 0.815 ± 0.028
1.147HisThr: 1.147 ± 0.032
1.702HisVal: 1.702 ± 0.036
0.314HisTrp: 0.314 ± 0.015
0.553HisTyr: 0.553 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
3.589IleAla: 3.589 ± 0.056
0.208IleCys: 0.208 ± 0.012
1.681IleAsp: 1.681 ± 0.038
2.0IleGlu: 2.0 ± 0.052
0.957IlePhe: 0.957 ± 0.029
3.092IleGly: 3.092 ± 0.059
0.73IleHis: 0.73 ± 0.027
1.133IleIle: 1.133 ± 0.036
0.706IleLys: 0.706 ± 0.03
3.36IleLeu: 3.36 ± 0.048
0.479IleMet: 0.479 ± 0.023
0.829IleAsn: 0.829 ± 0.029
1.97IlePro: 1.97 ± 0.037
1.013IleGln: 1.013 ± 0.028
2.395IleArg: 2.395 ± 0.049
1.628IleSer: 1.628 ± 0.037
1.95IleThr: 1.95 ± 0.047
2.61IleVal: 2.61 ± 0.043
0.317IleTrp: 0.317 ± 0.016
0.735IleTyr: 0.735 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
2.609LysAla: 2.609 ± 0.058
0.087LysCys: 0.087 ± 0.01
1.074LysAsp: 1.074 ± 0.032
0.998LysGlu: 0.998 ± 0.035
0.645LysPhe: 0.645 ± 0.027
1.801LysGly: 1.801 ± 0.046
0.397LysHis: 0.397 ± 0.018
0.775LysIle: 0.775 ± 0.03
0.793LysLys: 0.793 ± 0.037
2.184LysLeu: 2.184 ± 0.046
0.456LysMet: 0.456 ± 0.02
0.701LysAsn: 0.701 ± 0.027
1.387LysPro: 1.387 ± 0.037
0.669LysGln: 0.669 ± 0.025
1.534LysArg: 1.534 ± 0.036
0.95LysSer: 0.95 ± 0.03
1.445LysThr: 1.445 ± 0.033
1.886LysVal: 1.886 ± 0.048
0.24LysTrp: 0.24 ± 0.013
0.596LysTyr: 0.596 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
15.699LeuAla: 15.699 ± 0.168
0.664LeuCys: 0.664 ± 0.023
6.289LeuAsp: 6.289 ± 0.077
6.421LeuGlu: 6.421 ± 0.083
2.983LeuPhe: 2.983 ± 0.06
11.424LeuGly: 11.424 ± 0.138
2.655LeuHis: 2.655 ± 0.052
4.052LeuIle: 4.052 ± 0.06
2.58LeuLys: 2.58 ± 0.05
13.655LeuLeu: 13.655 ± 0.178
1.845LeuMet: 1.845 ± 0.034
2.979LeuAsn: 2.979 ± 0.045
7.797LeuPro: 7.797 ± 0.08
2.9LeuGln: 2.9 ± 0.048
9.873LeuArg: 9.873 ± 0.11
6.47LeuSer: 6.47 ± 0.074
8.214LeuThr: 8.214 ± 0.094
7.886LeuVal: 7.886 ± 0.093
1.455LeuTrp: 1.455 ± 0.037
2.475LeuTyr: 2.475 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
1.662MetAla: 1.662 ± 0.043
0.066MetCys: 0.066 ± 0.007
0.782MetAsp: 0.782 ± 0.024
0.767MetGlu: 0.767 ± 0.028
0.454MetPhe: 0.454 ± 0.022
1.422MetGly: 1.422 ± 0.035
0.344MetHis: 0.344 ± 0.017
0.658MetIle: 0.658 ± 0.023
0.593MetLys: 0.593 ± 0.021
1.853MetLeu: 1.853 ± 0.031
0.301MetMet: 0.301 ± 0.016
0.681MetAsn: 0.681 ± 0.02
1.062MetPro: 1.062 ± 0.028
0.602MetGln: 0.602 ± 0.022
1.449MetArg: 1.449 ± 0.034
1.025MetSer: 1.025 ± 0.029
1.843MetThr: 1.843 ± 0.035
1.078MetVal: 1.078 ± 0.029
0.185MetTrp: 0.185 ± 0.011
0.365MetTyr: 0.365 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.966AsnAla: 2.966 ± 0.048
0.152AsnCys: 0.152 ± 0.011
1.02AsnAsp: 1.02 ± 0.03
0.988AsnGlu: 0.988 ± 0.026
0.887AsnPhe: 0.887 ± 0.029
2.104AsnGly: 2.104 ± 0.047
0.442AsnHis: 0.442 ± 0.018
0.937AsnIle: 0.937 ± 0.027
0.511AsnLys: 0.511 ± 0.021
2.924AsnLeu: 2.924 ± 0.051
0.409AsnMet: 0.409 ± 0.018
0.607AsnAsn: 0.607 ± 0.029
1.901AsnPro: 1.901 ± 0.042
0.578AsnGln: 0.578 ± 0.023
1.485AsnArg: 1.485 ± 0.032
1.041AsnSer: 1.041 ± 0.034
1.402AsnThr: 1.402 ± 0.043
2.18AsnVal: 2.18 ± 0.047
0.371AsnTrp: 0.371 ± 0.019
0.688AsnTyr: 0.688 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
7.475ProAla: 7.475 ± 0.104
0.293ProCys: 0.293 ± 0.019
4.087ProAsp: 4.087 ± 0.065
4.738ProGlu: 4.738 ± 0.076
1.907ProPhe: 1.907 ± 0.04
7.083ProGly: 7.083 ± 0.091
1.452ProHis: 1.452 ± 0.035
1.614ProIle: 1.614 ± 0.037
1.104ProLys: 1.104 ± 0.033
6.611ProLeu: 6.611 ± 0.087
1.004ProMet: 1.004 ± 0.029
1.431ProAsn: 1.431 ± 0.037
3.832ProPro: 3.832 ± 0.068
2.068ProGln: 2.068 ± 0.046
3.992ProArg: 3.992 ± 0.061
2.828ProSer: 2.828 ± 0.049
3.343ProThr: 3.343 ± 0.059
4.78ProVal: 4.78 ± 0.073
0.808ProTrp: 0.808 ± 0.025
1.338ProTyr: 1.338 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
4.831GlnAla: 4.831 ± 0.079
0.147GlnCys: 0.147 ± 0.011
1.724GlnAsp: 1.724 ± 0.041
1.972GlnGlu: 1.972 ± 0.044
0.966GlnPhe: 0.966 ± 0.03
3.418GlnGly: 3.418 ± 0.057
0.626GlnHis: 0.626 ± 0.022
1.09GlnIle: 1.09 ± 0.031
0.882GlnLys: 0.882 ± 0.029
2.91GlnLeu: 2.91 ± 0.045
0.632GlnMet: 0.632 ± 0.025
0.938GlnAsn: 0.938 ± 0.031
2.244GlnPro: 2.244 ± 0.051
1.15GlnGln: 1.15 ± 0.033
2.296GlnArg: 2.296 ± 0.044
1.377GlnSer: 1.377 ± 0.034
1.972GlnThr: 1.972 ± 0.037
2.962GlnVal: 2.962 ± 0.049
0.354GlnTrp: 0.354 ± 0.019
0.673GlnTyr: 0.673 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
10.538ArgAla: 10.538 ± 0.121
0.385ArgCys: 0.385 ± 0.018
4.244ArgAsp: 4.244 ± 0.067
6.215ArgGlu: 6.215 ± 0.081
2.563ArgPhe: 2.563 ± 0.049
6.863ArgGly: 6.863 ± 0.088
1.846ArgHis: 1.846 ± 0.037
2.368ArgIle: 2.368 ± 0.042
1.526ArgLys: 1.526 ± 0.041
9.58ArgLeu: 9.58 ± 0.093
1.616ArgMet: 1.616 ± 0.037
1.633ArgAsn: 1.633 ± 0.036
4.717ArgPro: 4.717 ± 0.066
2.777ArgGln: 2.777 ± 0.048
6.589ArgArg: 6.589 ± 0.081
3.289ArgSer: 3.289 ± 0.046
4.319ArgThr: 4.319 ± 0.058
7.469ArgVal: 7.469 ± 0.083
1.203ArgTrp: 1.203 ± 0.031
2.016ArgTyr: 2.016 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
5.429SerAla: 5.429 ± 0.065
0.3SerCys: 0.3 ± 0.017
2.044SerAsp: 2.044 ± 0.043
2.24SerGlu: 2.24 ± 0.042
1.601SerPhe: 1.601 ± 0.038
5.573SerGly: 5.573 ± 0.086
0.903SerHis: 0.903 ± 0.024
1.428SerIle: 1.428 ± 0.034
0.933SerLys: 0.933 ± 0.028
5.193SerLeu: 5.193 ± 0.063
0.796SerMet: 0.796 ± 0.022
1.042SerAsn: 1.042 ± 0.032
3.342SerPro: 3.342 ± 0.057
1.35SerGln: 1.35 ± 0.035
3.573SerArg: 3.573 ± 0.051
2.414SerSer: 2.414 ± 0.054
2.431SerThr: 2.431 ± 0.051
3.797SerVal: 3.797 ± 0.052
0.596SerTrp: 0.596 ± 0.023
1.058SerTyr: 1.058 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
6.612ThrAla: 6.612 ± 0.1
0.374ThrCys: 0.374 ± 0.019
2.781ThrAsp: 2.781 ± 0.042
2.5ThrGlu: 2.5 ± 0.04
2.261ThrPhe: 2.261 ± 0.043
5.695ThrGly: 5.695 ± 0.077
1.297ThrHis: 1.297 ± 0.038
1.635ThrIle: 1.635 ± 0.041
0.91ThrLys: 0.91 ± 0.033
8.501ThrLeu: 8.501 ± 0.101
0.745ThrMet: 0.745 ± 0.024
1.245ThrAsn: 1.245 ± 0.038
4.925ThrPro: 4.925 ± 0.073
1.668ThrGln: 1.668 ± 0.041
4.77ThrArg: 4.77 ± 0.053
2.767ThrSer: 2.767 ± 0.054
3.302ThrThr: 3.302 ± 0.082
5.247ThrVal: 5.247 ± 0.074
0.903ThrTrp: 0.903 ± 0.028
1.542ThrTyr: 1.542 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
8.83ValAla: 8.83 ± 0.083
0.491ValCys: 0.491 ± 0.018
3.567ValAsp: 3.567 ± 0.057
4.415ValGlu: 4.415 ± 0.065
2.472ValPhe: 2.472 ± 0.04
6.874ValGly: 6.874 ± 0.077
1.382ValHis: 1.382 ± 0.032
3.154ValIle: 3.154 ± 0.056
1.884ValLys: 1.884 ± 0.04
9.316ValLeu: 9.316 ± 0.115
1.53ValMet: 1.53 ± 0.034
2.297ValAsn: 2.297 ± 0.05
5.004ValPro: 5.004 ± 0.067
2.95ValGln: 2.95 ± 0.05
7.103ValArg: 7.103 ± 0.076
4.112ValSer: 4.112 ± 0.059
6.071ValThr: 6.071 ± 0.08
6.309ValVal: 6.309 ± 0.079
1.212ValTrp: 1.212 ± 0.031
1.942ValTyr: 1.942 ± 0.045
0.0ValXaa: 0.0 ± 0.0
Trp
1.611TrpAla: 1.611 ± 0.032
0.09TrpCys: 0.09 ± 0.008
0.667TrpAsp: 0.667 ± 0.023
0.741TrpGlu: 0.741 ± 0.023
0.429TrpPhe: 0.429 ± 0.017
1.213TrpGly: 1.213 ± 0.036
0.353TrpHis: 0.353 ± 0.016
0.365TrpIle: 0.365 ± 0.017
0.294TrpLys: 0.294 ± 0.015
1.757TrpLeu: 1.757 ± 0.043
0.306TrpMet: 0.306 ± 0.016
0.499TrpAsn: 0.499 ± 0.018
0.813TrpPro: 0.813 ± 0.022
0.643TrpGln: 0.643 ± 0.023
1.423TrpArg: 1.423 ± 0.033
0.611TrpSer: 0.611 ± 0.021
0.967TrpThr: 0.967 ± 0.028
1.133TrpVal: 1.133 ± 0.034
0.312TrpTrp: 0.312 ± 0.017
0.292TrpTyr: 0.292 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.087TyrAla: 3.087 ± 0.053
0.148TyrCys: 0.148 ± 0.013
1.266TyrAsp: 1.266 ± 0.03
1.194TyrGlu: 1.194 ± 0.034
0.805TyrPhe: 0.805 ± 0.025
2.458TyrGly: 2.458 ± 0.046
0.533TyrHis: 0.533 ± 0.024
0.596TyrIle: 0.596 ± 0.022
0.447TyrLys: 0.447 ± 0.021
2.638TyrLeu: 2.638 ± 0.05
0.277TyrMet: 0.277 ± 0.016
0.567TyrAsn: 0.567 ± 0.021
1.373TyrPro: 1.373 ± 0.033
0.758TyrGln: 0.758 ± 0.027
2.19TyrArg: 2.19 ± 0.042
1.112TyrSer: 1.112 ± 0.031
1.54TyrThr: 1.54 ± 0.037
1.732TyrVal: 1.732 ± 0.038
0.348TyrTrp: 0.348 ± 0.016
0.611TyrTyr: 0.611 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4445 proteins (1341196 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski