Amino acid dipepetide frequency for Pontibacillus litoralis JSM 072002

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.938AlaAla: 4.938 ± 0.096
0.621AlaCys: 0.621 ± 0.024
3.147AlaAsp: 3.147 ± 0.061
4.337AlaGlu: 4.337 ± 0.077
3.305AlaPhe: 3.305 ± 0.068
4.665AlaGly: 4.665 ± 0.091
1.473AlaHis: 1.473 ± 0.044
6.089AlaIle: 6.089 ± 0.097
4.566AlaLys: 4.566 ± 0.076
7.062AlaLeu: 7.062 ± 0.113
2.254AlaMet: 2.254 ± 0.047
3.006AlaAsn: 3.006 ± 0.061
2.062AlaPro: 2.062 ± 0.057
2.567AlaGln: 2.567 ± 0.055
2.524AlaArg: 2.524 ± 0.058
4.289AlaSer: 4.289 ± 0.07
3.932AlaThr: 3.932 ± 0.064
4.876AlaVal: 4.876 ± 0.072
0.565AlaTrp: 0.565 ± 0.027
2.555AlaTyr: 2.555 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.464CysAla: 0.464 ± 0.024
0.093CysCys: 0.093 ± 0.01
0.425CysAsp: 0.425 ± 0.023
0.407CysGlu: 0.407 ± 0.022
0.317CysPhe: 0.317 ± 0.016
0.641CysGly: 0.641 ± 0.028
0.214CysHis: 0.214 ± 0.017
0.539CysIle: 0.539 ± 0.025
0.393CysLys: 0.393 ± 0.02
0.603CysLeu: 0.603 ± 0.028
0.217CysMet: 0.217 ± 0.016
0.322CysAsn: 0.322 ± 0.017
0.337CysPro: 0.337 ± 0.021
0.258CysGln: 0.258 ± 0.019
0.243CysArg: 0.243 ± 0.017
0.445CysSer: 0.445 ± 0.022
0.445CysThr: 0.445 ± 0.024
0.503CysVal: 0.503 ± 0.022
0.053CysTrp: 0.053 ± 0.008
0.261CysTyr: 0.261 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.407AspAla: 3.407 ± 0.058
0.377AspCys: 0.377 ± 0.02
2.648AspAsp: 2.648 ± 0.065
4.527AspGlu: 4.527 ± 0.084
2.213AspPhe: 2.213 ± 0.05
3.187AspGly: 3.187 ± 0.07
1.206AspHis: 1.206 ± 0.036
4.235AspIle: 4.235 ± 0.075
3.228AspLys: 3.228 ± 0.06
4.514AspLeu: 4.514 ± 0.077
1.457AspMet: 1.457 ± 0.048
1.984AspAsn: 1.984 ± 0.052
1.928AspPro: 1.928 ± 0.052
2.113AspGln: 2.113 ± 0.055
2.145AspArg: 2.145 ± 0.048
2.395AspSer: 2.395 ± 0.053
2.48AspThr: 2.48 ± 0.049
4.257AspVal: 4.257 ± 0.069
0.673AspTrp: 0.673 ± 0.029
2.272AspTyr: 2.272 ± 0.052
0.0AspXaa: 0.0 ± 0.0
Glu
5.766GluAla: 5.766 ± 0.09
0.367GluCys: 0.367 ± 0.021
4.101GluAsp: 4.101 ± 0.071
7.933GluGlu: 7.933 ± 0.131
2.229GluPhe: 2.229 ± 0.053
4.52GluGly: 4.52 ± 0.071
1.886GluHis: 1.886 ± 0.048
5.013GluIle: 5.013 ± 0.083
5.767GluLys: 5.767 ± 0.094
6.492GluLeu: 6.492 ± 0.099
2.539GluMet: 2.539 ± 0.054
3.261GluAsn: 3.261 ± 0.072
1.915GluPro: 1.915 ± 0.049
4.959GluGln: 4.959 ± 0.085
3.854GluArg: 3.854 ± 0.076
3.428GluSer: 3.428 ± 0.067
3.75GluThr: 3.75 ± 0.069
5.434GluVal: 5.434 ± 0.09
0.892GluTrp: 0.892 ± 0.034
2.211GluTyr: 2.211 ± 0.055
0.0GluXaa: 0.0 ± 0.0
Phe
2.933PheAla: 2.933 ± 0.061
0.3PheCys: 0.3 ± 0.02
2.188PheAsp: 2.188 ± 0.051
2.627PheGlu: 2.627 ± 0.058
2.137PhePhe: 2.137 ± 0.06
3.153PheGly: 3.153 ± 0.078
1.119PheHis: 1.119 ± 0.035
3.834PheIle: 3.834 ± 0.08
2.017PheLys: 2.017 ± 0.051
4.22PheLeu: 4.22 ± 0.088
1.311PheMet: 1.311 ± 0.037
1.682PheAsn: 1.682 ± 0.044
1.626PhePro: 1.626 ± 0.042
1.834PheGln: 1.834 ± 0.052
1.513PheArg: 1.513 ± 0.039
2.93PheSer: 2.93 ± 0.057
2.599PheThr: 2.599 ± 0.054
3.127PheVal: 3.127 ± 0.066
0.395PheTrp: 0.395 ± 0.022
1.574PheTyr: 1.574 ± 0.045
0.0PheXaa: 0.0 ± 0.0
Gly
4.691GlyAla: 4.691 ± 0.091
0.609GlyCys: 0.609 ± 0.025
3.336GlyAsp: 3.336 ± 0.075
4.661GlyGlu: 4.661 ± 0.081
3.094GlyPhe: 3.094 ± 0.057
4.549GlyGly: 4.549 ± 0.078
1.384GlyHis: 1.384 ± 0.041
5.492GlyIle: 5.492 ± 0.084
4.573GlyLys: 4.573 ± 0.075
5.966GlyLeu: 5.966 ± 0.096
2.181GlyMet: 2.181 ± 0.044
2.581GlyAsn: 2.581 ± 0.057
1.641GlyPro: 1.641 ± 0.049
2.202GlyGln: 2.202 ± 0.051
2.457GlyArg: 2.457 ± 0.057
3.797GlySer: 3.797 ± 0.061
3.875GlyThr: 3.875 ± 0.066
5.205GlyVal: 5.205 ± 0.096
0.72GlyTrp: 0.72 ± 0.029
2.773GlyTyr: 2.773 ± 0.064
0.0GlyXaa: 0.0 ± 0.0
His
1.513HisAla: 1.513 ± 0.044
0.201HisCys: 0.201 ± 0.017
1.234HisAsp: 1.234 ± 0.043
1.57HisGlu: 1.57 ± 0.04
1.179HisPhe: 1.179 ± 0.036
1.355HisGly: 1.355 ± 0.039
0.879HisHis: 0.879 ± 0.035
1.896HisIle: 1.896 ± 0.042
1.258HisLys: 1.258 ± 0.039
2.347HisLeu: 2.347 ± 0.052
0.643HisMet: 0.643 ± 0.028
1.01HisAsn: 1.01 ± 0.031
1.238HisPro: 1.238 ± 0.037
1.108HisGln: 1.108 ± 0.038
1.041HisArg: 1.041 ± 0.037
1.336HisSer: 1.336 ± 0.038
1.282HisThr: 1.282 ± 0.04
1.828HisVal: 1.828 ± 0.042
0.228HisTrp: 0.228 ± 0.016
1.082HisTyr: 1.082 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
6.02IleAla: 6.02 ± 0.1
0.584IleCys: 0.584 ± 0.024
4.334IleAsp: 4.334 ± 0.075
5.75IleGlu: 5.75 ± 0.095
2.962IlePhe: 2.962 ± 0.067
5.979IleGly: 5.979 ± 0.101
1.995IleHis: 1.995 ± 0.052
5.773IleIle: 5.773 ± 0.109
3.9IleLys: 3.9 ± 0.073
6.305IleLeu: 6.305 ± 0.098
1.922IleMet: 1.922 ± 0.042
3.004IleAsn: 3.004 ± 0.061
3.399IlePro: 3.399 ± 0.063
3.571IleGln: 3.571 ± 0.06
2.889IleArg: 2.889 ± 0.053
4.661IleSer: 4.661 ± 0.084
4.502IleThr: 4.502 ± 0.079
5.938IleVal: 5.938 ± 0.086
0.626IleTrp: 0.626 ± 0.028
2.439IleTyr: 2.439 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
4.469LysAla: 4.469 ± 0.086
0.31LysCys: 0.31 ± 0.02
3.415LysAsp: 3.415 ± 0.059
6.796LysGlu: 6.796 ± 0.111
1.494LysPhe: 1.494 ± 0.039
4.307LysGly: 4.307 ± 0.075
1.588LysHis: 1.588 ± 0.043
3.565LysIle: 3.565 ± 0.066
5.166LysLys: 5.166 ± 0.094
5.147LysLeu: 5.147 ± 0.095
2.067LysMet: 2.067 ± 0.05
2.668LysAsn: 2.668 ± 0.059
2.169LysPro: 2.169 ± 0.051
4.256LysGln: 4.256 ± 0.08
3.546LysArg: 3.546 ± 0.065
3.242LysSer: 3.242 ± 0.066
3.126LysThr: 3.126 ± 0.06
4.498LysVal: 4.498 ± 0.069
0.826LysTrp: 0.826 ± 0.029
2.019LysTyr: 2.019 ± 0.052
0.0LysXaa: 0.0 ± 0.0
Leu
6.815LeuAla: 6.815 ± 0.098
0.663LeuCys: 0.663 ± 0.026
4.669LeuAsp: 4.669 ± 0.081
6.282LeuGlu: 6.282 ± 0.107
4.572LeuPhe: 4.572 ± 0.098
5.819LeuGly: 5.819 ± 0.091
2.386LeuHis: 2.386 ± 0.052
6.51LeuIle: 6.51 ± 0.119
5.606LeuLys: 5.606 ± 0.078
9.406LeuLeu: 9.406 ± 0.148
2.629LeuMet: 2.629 ± 0.057
4.048LeuAsn: 4.048 ± 0.073
3.679LeuPro: 3.679 ± 0.059
4.501LeuGln: 4.501 ± 0.087
3.566LeuArg: 3.566 ± 0.068
6.456LeuSer: 6.456 ± 0.093
5.576LeuThr: 5.576 ± 0.073
5.84LeuVal: 5.84 ± 0.09
0.778LeuTrp: 0.778 ± 0.034
3.198LeuTyr: 3.198 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
2.063MetAla: 2.063 ± 0.048
0.172MetCys: 0.172 ± 0.014
1.785MetAsp: 1.785 ± 0.05
2.394MetGlu: 2.394 ± 0.052
1.172MetPhe: 1.172 ± 0.037
1.778MetGly: 1.778 ± 0.044
0.588MetHis: 0.588 ± 0.023
2.119MetIle: 2.119 ± 0.043
2.749MetLys: 2.749 ± 0.061
2.735MetLeu: 2.735 ± 0.063
1.015MetMet: 1.015 ± 0.032
1.819MetAsn: 1.819 ± 0.046
1.047MetPro: 1.047 ± 0.038
1.244MetGln: 1.244 ± 0.041
1.15MetArg: 1.15 ± 0.034
1.705MetSer: 1.705 ± 0.043
1.714MetThr: 1.714 ± 0.042
1.985MetVal: 1.985 ± 0.05
0.231MetTrp: 0.231 ± 0.014
0.861MetTyr: 0.861 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
2.785AsnAla: 2.785 ± 0.051
0.263AsnCys: 0.263 ± 0.016
2.398AsnAsp: 2.398 ± 0.05
3.681AsnGlu: 3.681 ± 0.072
1.34AsnPhe: 1.34 ± 0.037
3.311AsnGly: 3.311 ± 0.06
1.134AsnHis: 1.134 ± 0.037
3.261AsnIle: 3.261 ± 0.06
3.101AsnLys: 3.101 ± 0.065
3.207AsnLeu: 3.207 ± 0.059
1.292AsnMet: 1.292 ± 0.035
2.019AsnAsn: 2.019 ± 0.055
1.978AsnPro: 1.978 ± 0.049
2.192AsnGln: 2.192 ± 0.053
2.043AsnArg: 2.043 ± 0.052
2.032AsnSer: 2.032 ± 0.048
2.169AsnThr: 2.169 ± 0.055
3.337AsnVal: 3.337 ± 0.06
0.487AsnTrp: 0.487 ± 0.021
1.476AsnTyr: 1.476 ± 0.045
0.0AsnXaa: 0.0 ± 0.0
Pro
2.111ProAla: 2.111 ± 0.044
0.231ProCys: 0.231 ± 0.016
1.786ProAsp: 1.786 ± 0.049
2.683ProGlu: 2.683 ± 0.057
2.042ProPhe: 2.042 ± 0.043
2.112ProGly: 2.112 ± 0.058
0.852ProHis: 0.852 ± 0.035
3.008ProIle: 3.008 ± 0.063
2.108ProLys: 2.108 ± 0.051
3.379ProLeu: 3.379 ± 0.057
0.929ProMet: 0.929 ± 0.032
1.822ProAsn: 1.822 ± 0.042
0.986ProPro: 0.986 ± 0.043
1.217ProGln: 1.217 ± 0.038
1.06ProArg: 1.06 ± 0.031
2.386ProSer: 2.386 ± 0.052
2.108ProThr: 2.108 ± 0.055
2.731ProVal: 2.731 ± 0.053
0.373ProTrp: 0.373 ± 0.021
1.496ProTyr: 1.496 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
3.365GlnAla: 3.365 ± 0.068
0.283GlnCys: 0.283 ± 0.017
1.995GlnAsp: 1.995 ± 0.048
3.497GlnGlu: 3.497 ± 0.067
1.787GlnPhe: 1.787 ± 0.044
2.614GlnGly: 2.614 ± 0.059
1.222GlnHis: 1.222 ± 0.037
2.749GlnIle: 2.749 ± 0.055
2.735GlnLys: 2.735 ± 0.068
5.18GlnLeu: 5.18 ± 0.087
1.555GlnMet: 1.555 ± 0.047
1.615GlnAsn: 1.615 ± 0.048
1.522GlnPro: 1.522 ± 0.047
3.339GlnGln: 3.339 ± 0.084
1.884GlnArg: 1.884 ± 0.05
2.82GlnSer: 2.82 ± 0.06
2.484GlnThr: 2.484 ± 0.056
3.053GlnVal: 3.053 ± 0.059
0.564GlnTrp: 0.564 ± 0.025
1.751GlnTyr: 1.751 ± 0.046
0.0GlnXaa: 0.0 ± 0.0
Arg
2.501ArgAla: 2.501 ± 0.049
0.25ArgCys: 0.25 ± 0.016
1.95ArgAsp: 1.95 ± 0.046
3.085ArgGlu: 3.085 ± 0.065
1.866ArgPhe: 1.866 ± 0.047
2.222ArgGly: 2.222 ± 0.052
0.938ArgHis: 0.938 ± 0.033
3.141ArgIle: 3.141 ± 0.056
3.184ArgLys: 3.184 ± 0.072
3.841ArgLeu: 3.841 ± 0.083
1.391ArgMet: 1.391 ± 0.037
2.053ArgAsn: 2.053 ± 0.052
1.232ArgPro: 1.232 ± 0.036
1.706ArgGln: 1.706 ± 0.043
1.753ArgArg: 1.753 ± 0.048
2.218ArgSer: 2.218 ± 0.054
2.14ArgThr: 2.14 ± 0.054
2.736ArgVal: 2.736 ± 0.051
0.389ArgTrp: 0.389 ± 0.02
1.652ArgTyr: 1.652 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
3.452SerAla: 3.452 ± 0.058
0.406SerCys: 0.406 ± 0.023
2.665SerAsp: 2.665 ± 0.053
3.592SerGlu: 3.592 ± 0.069
3.297SerPhe: 3.297 ± 0.066
3.984SerGly: 3.984 ± 0.071
1.305SerHis: 1.305 ± 0.038
5.312SerIle: 5.312 ± 0.078
3.654SerLys: 3.654 ± 0.073
5.918SerLeu: 5.918 ± 0.083
1.875SerMet: 1.875 ± 0.044
2.717SerAsn: 2.717 ± 0.056
2.068SerPro: 2.068 ± 0.044
2.04SerGln: 2.04 ± 0.049
2.066SerArg: 2.066 ± 0.05
3.91SerSer: 3.91 ± 0.082
3.328SerThr: 3.328 ± 0.061
4.151SerVal: 4.151 ± 0.064
0.553SerTrp: 0.553 ± 0.027
2.338SerTyr: 2.338 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
3.575ThrAla: 3.575 ± 0.07
0.421ThrCys: 0.421 ± 0.023
2.663ThrAsp: 2.663 ± 0.061
3.63ThrGlu: 3.63 ± 0.068
2.854ThrPhe: 2.854 ± 0.061
3.903ThrGly: 3.903 ± 0.077
1.132ThrHis: 1.132 ± 0.038
5.007ThrIle: 5.007 ± 0.076
3.468ThrLys: 3.468 ± 0.063
5.574ThrLeu: 5.574 ± 0.077
1.486ThrMet: 1.486 ± 0.044
2.633ThrAsn: 2.633 ± 0.058
2.278ThrPro: 2.278 ± 0.054
1.728ThrGln: 1.728 ± 0.044
1.767ThrArg: 1.767 ± 0.05
3.55ThrSer: 3.55 ± 0.07
3.267ThrThr: 3.267 ± 0.066
4.081ThrVal: 4.081 ± 0.089
0.515ThrTrp: 0.515 ± 0.026
2.28ThrTyr: 2.28 ± 0.054
0.0ThrXaa: 0.0 ± 0.0
Val
5.227ValAla: 5.227 ± 0.092
0.619ValCys: 0.619 ± 0.027
3.865ValAsp: 3.865 ± 0.071
5.206ValGlu: 5.206 ± 0.084
2.958ValPhe: 2.958 ± 0.068
4.611ValGly: 4.611 ± 0.08
1.745ValHis: 1.745 ± 0.042
5.585ValIle: 5.585 ± 0.087
4.429ValLys: 4.429 ± 0.073
6.759ValLeu: 6.759 ± 0.094
2.211ValMet: 2.211 ± 0.054
3.061ValAsn: 3.061 ± 0.06
2.658ValPro: 2.658 ± 0.046
3.104ValGln: 3.104 ± 0.061
2.793ValArg: 2.793 ± 0.055
4.474ValSer: 4.474 ± 0.071
4.468ValThr: 4.468 ± 0.081
5.454ValVal: 5.454 ± 0.081
0.653ValTrp: 0.653 ± 0.026
2.488ValTyr: 2.488 ± 0.061
0.0ValXaa: 0.0 ± 0.0
Trp
0.488TrpAla: 0.488 ± 0.023
0.082TrpCys: 0.082 ± 0.011
0.506TrpAsp: 0.506 ± 0.024
0.675TrpGlu: 0.675 ± 0.029
0.538TrpPhe: 0.538 ± 0.025
0.618TrpGly: 0.618 ± 0.022
0.209TrpHis: 0.209 ± 0.016
0.884TrpIle: 0.884 ± 0.035
0.737TrpLys: 0.737 ± 0.029
1.158TrpLeu: 1.158 ± 0.037
0.353TrpMet: 0.353 ± 0.022
0.575TrpAsn: 0.575 ± 0.028
0.261TrpPro: 0.261 ± 0.016
0.4TrpGln: 0.4 ± 0.021
0.362TrpArg: 0.362 ± 0.02
0.559TrpSer: 0.559 ± 0.028
0.488TrpThr: 0.488 ± 0.024
0.637TrpVal: 0.637 ± 0.028
0.156TrpTrp: 0.156 ± 0.013
0.393TrpTyr: 0.393 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.25TyrAla: 2.25 ± 0.052
0.326TyrCys: 0.326 ± 0.02
2.138TyrAsp: 2.138 ± 0.045
2.984TyrGlu: 2.984 ± 0.068
1.762TyrPhe: 1.762 ± 0.049
2.455TyrGly: 2.455 ± 0.055
0.976TyrHis: 0.976 ± 0.039
2.544TyrIle: 2.544 ± 0.056
2.035TyrLys: 2.035 ± 0.049
3.186TyrLeu: 3.186 ± 0.06
0.998TyrMet: 0.998 ± 0.035
1.627TyrAsn: 1.627 ± 0.05
1.398TyrPro: 1.398 ± 0.041
1.674TyrGln: 1.674 ± 0.044
1.589TyrArg: 1.589 ± 0.036
1.999TyrSer: 1.999 ± 0.043
2.079TyrThr: 2.079 ± 0.048
2.697TyrVal: 2.697 ± 0.056
0.402TyrTrp: 0.402 ± 0.02
1.548TyrTyr: 1.548 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3202 proteins (917159 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski