Amino acid dipepetide frequency for Sodalis praecaptivus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.718AlaAla: 13.718 ± 0.152
1.225AlaCys: 1.225 ± 0.031
5.657AlaAsp: 5.657 ± 0.077
5.791AlaGlu: 5.791 ± 0.074
3.761AlaPhe: 3.761 ± 0.052
9.289AlaGly: 9.289 ± 0.1
2.199AlaHis: 2.199 ± 0.039
6.035AlaIle: 6.035 ± 0.066
3.24AlaLys: 3.24 ± 0.06
14.417AlaLeu: 14.417 ± 0.13
3.106AlaMet: 3.106 ± 0.045
3.028AlaAsn: 3.028 ± 0.048
4.846AlaPro: 4.846 ± 0.079
5.178AlaGln: 5.178 ± 0.076
7.305AlaArg: 7.305 ± 0.082
5.778AlaSer: 5.778 ± 0.069
5.273AlaThr: 5.273 ± 0.071
7.539AlaVal: 7.539 ± 0.079
1.577AlaTrp: 1.577 ± 0.038
2.332AlaTyr: 2.332 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
1.15CysAla: 1.15 ± 0.031
0.186CysCys: 0.186 ± 0.011
0.6CysAsp: 0.6 ± 0.022
0.514CysGlu: 0.514 ± 0.018
0.411CysPhe: 0.411 ± 0.014
1.116CysGly: 1.116 ± 0.03
0.368CysHis: 0.368 ± 0.017
0.516CysIle: 0.516 ± 0.018
0.249CysLys: 0.249 ± 0.014
1.201CysLeu: 1.201 ± 0.031
0.219CysMet: 0.219 ± 0.013
0.288CysAsn: 0.288 ± 0.014
0.554CysPro: 0.554 ± 0.021
0.469CysGln: 0.469 ± 0.017
0.824CysArg: 0.824 ± 0.028
0.601CysSer: 0.601 ± 0.023
0.417CysThr: 0.417 ± 0.019
0.669CysVal: 0.669 ± 0.022
0.193CysTrp: 0.193 ± 0.012
0.368CysTyr: 0.368 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
5.801AspAla: 5.801 ± 0.07
0.541AspCys: 0.541 ± 0.022
3.024AspAsp: 3.024 ± 0.053
3.152AspGlu: 3.152 ± 0.054
2.091AspPhe: 2.091 ± 0.043
4.086AspGly: 4.086 ± 0.06
1.082AspHis: 1.082 ± 0.031
3.468AspIle: 3.468 ± 0.05
2.049AspLys: 2.049 ± 0.045
4.523AspLeu: 4.523 ± 0.057
1.299AspMet: 1.299 ± 0.033
2.095AspAsn: 2.095 ± 0.042
2.477AspPro: 2.477 ± 0.041
1.574AspGln: 1.574 ± 0.037
3.258AspArg: 3.258 ± 0.051
2.714AspSer: 2.714 ± 0.046
2.401AspThr: 2.401 ± 0.046
3.625AspVal: 3.625 ± 0.043
0.789AspTrp: 0.789 ± 0.025
1.904AspTyr: 1.904 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
5.178GluAla: 5.178 ± 0.066
0.416GluCys: 0.416 ± 0.018
2.231GluAsp: 2.231 ± 0.048
2.751GluGlu: 2.751 ± 0.059
1.563GluPhe: 1.563 ± 0.036
3.341GluGly: 3.341 ± 0.053
1.256GluHis: 1.256 ± 0.03
2.89GluIle: 2.89 ± 0.047
2.446GluLys: 2.446 ± 0.052
5.096GluLeu: 5.096 ± 0.067
1.535GluMet: 1.535 ± 0.035
1.918GluAsn: 1.918 ± 0.041
2.049GluPro: 2.049 ± 0.038
3.025GluGln: 3.025 ± 0.052
3.89GluArg: 3.89 ± 0.058
2.467GluSer: 2.467 ± 0.046
2.77GluThr: 2.77 ± 0.049
3.423GluVal: 3.423 ± 0.051
0.667GluTrp: 0.667 ± 0.022
1.216GluTyr: 1.216 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
3.896PheAla: 3.896 ± 0.051
0.538PheCys: 0.538 ± 0.02
2.253PheAsp: 2.253 ± 0.045
1.409PheGlu: 1.409 ± 0.03
1.549PhePhe: 1.549 ± 0.04
2.983PheGly: 2.983 ± 0.048
0.851PheHis: 0.851 ± 0.026
2.484PheIle: 2.484 ± 0.04
1.024PheLys: 1.024 ± 0.033
3.273PheLeu: 3.273 ± 0.061
0.89PheMet: 0.89 ± 0.028
1.49PheAsn: 1.49 ± 0.031
1.629PhePro: 1.629 ± 0.035
1.084PheGln: 1.084 ± 0.027
1.882PheArg: 1.882 ± 0.04
2.861PheSer: 2.861 ± 0.051
2.266PheThr: 2.266 ± 0.04
2.328PheVal: 2.328 ± 0.046
0.556PheTrp: 0.556 ± 0.024
1.141PheTyr: 1.141 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
7.548GlyAla: 7.548 ± 0.083
1.011GlyCys: 1.011 ± 0.029
3.856GlyAsp: 3.856 ± 0.058
4.539GlyGlu: 4.539 ± 0.062
3.185GlyPhe: 3.185 ± 0.046
5.988GlyGly: 5.988 ± 0.075
1.798GlyHis: 1.798 ± 0.036
4.875GlyIle: 4.875 ± 0.066
3.376GlyLys: 3.376 ± 0.053
8.043GlyLeu: 8.043 ± 0.08
2.342GlyMet: 2.342 ± 0.044
2.581GlyAsn: 2.581 ± 0.042
2.396GlyPro: 2.396 ± 0.045
3.308GlyGln: 3.308 ± 0.05
4.592GlyArg: 4.592 ± 0.065
4.122GlySer: 4.122 ± 0.05
3.697GlyThr: 3.697 ± 0.052
5.91GlyVal: 5.91 ± 0.08
1.326GlyTrp: 1.326 ± 0.034
2.538GlyTyr: 2.538 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
2.229HisAla: 2.229 ± 0.037
0.358HisCys: 0.358 ± 0.015
1.254HisAsp: 1.254 ± 0.034
1.003HisGlu: 1.003 ± 0.029
1.069HisPhe: 1.069 ± 0.031
1.884HisGly: 1.884 ± 0.037
0.835HisHis: 0.835 ± 0.028
1.335HisIle: 1.335 ± 0.03
0.652HisLys: 0.652 ± 0.022
2.463HisLeu: 2.463 ± 0.042
0.518HisMet: 0.518 ± 0.019
0.784HisAsn: 0.784 ± 0.025
1.479HisPro: 1.479 ± 0.04
1.284HisGln: 1.284 ± 0.031
1.532HisArg: 1.532 ± 0.034
1.218HisSer: 1.218 ± 0.028
1.035HisThr: 1.035 ± 0.023
1.334HisVal: 1.334 ± 0.031
0.417HisTrp: 0.417 ± 0.02
1.024HisTyr: 1.024 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
6.853IleAla: 6.853 ± 0.077
0.609IleCys: 0.609 ± 0.025
3.466IleAsp: 3.466 ± 0.053
2.816IleGlu: 2.816 ± 0.041
1.809IlePhe: 1.809 ± 0.038
4.666IleGly: 4.666 ± 0.061
1.194IleHis: 1.194 ± 0.03
3.303IleIle: 3.303 ± 0.053
2.007IleLys: 2.007 ± 0.046
4.737IleLeu: 4.737 ± 0.06
1.21IleMet: 1.21 ± 0.029
2.298IleAsn: 2.298 ± 0.037
2.603IlePro: 2.603 ± 0.042
1.681IleGln: 1.681 ± 0.034
3.007IleArg: 3.007 ± 0.047
3.393IleSer: 3.393 ± 0.058
3.396IleThr: 3.396 ± 0.054
3.653IleVal: 3.653 ± 0.063
0.563IleTrp: 0.563 ± 0.023
1.434IleTyr: 1.434 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
3.455LysAla: 3.455 ± 0.064
0.188LysCys: 0.188 ± 0.013
1.551LysAsp: 1.551 ± 0.039
1.714LysGlu: 1.714 ± 0.045
0.902LysPhe: 0.902 ± 0.027
2.364LysGly: 2.364 ± 0.048
0.768LysHis: 0.768 ± 0.025
2.046LysIle: 2.046 ± 0.041
1.626LysLys: 1.626 ± 0.047
3.402LysLeu: 3.402 ± 0.061
0.97LysMet: 0.97 ± 0.027
1.268LysAsn: 1.268 ± 0.032
1.717LysPro: 1.717 ± 0.042
1.649LysGln: 1.649 ± 0.039
2.341LysArg: 2.341 ± 0.047
1.856LysSer: 1.856 ± 0.04
2.066LysThr: 2.066 ± 0.041
2.378LysVal: 2.378 ± 0.044
0.341LysTrp: 0.341 ± 0.015
0.807LysTyr: 0.807 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
13.773LeuAla: 13.773 ± 0.121
1.443LeuCys: 1.443 ± 0.035
5.617LeuAsp: 5.617 ± 0.07
5.086LeuGlu: 5.086 ± 0.062
4.186LeuPhe: 4.186 ± 0.061
7.914LeuGly: 7.914 ± 0.088
2.448LeuHis: 2.448 ± 0.05
5.725LeuIle: 5.725 ± 0.078
3.621LeuLys: 3.621 ± 0.065
13.157LeuLeu: 13.157 ± 0.16
2.993LeuMet: 2.993 ± 0.048
3.777LeuAsn: 3.777 ± 0.059
6.528LeuPro: 6.528 ± 0.085
4.344LeuGln: 4.344 ± 0.063
7.206LeuArg: 7.206 ± 0.083
7.65LeuSer: 7.65 ± 0.079
6.802LeuThr: 6.802 ± 0.074
7.104LeuVal: 7.104 ± 0.077
1.398LeuTrp: 1.398 ± 0.034
2.786LeuTyr: 2.786 ± 0.044
0.0LeuXaa: 0.0 ± 0.0
Met
3.058MetAla: 3.058 ± 0.052
0.189MetCys: 0.189 ± 0.012
1.104MetAsp: 1.104 ± 0.026
1.044MetGlu: 1.044 ± 0.031
0.776MetPhe: 0.776 ± 0.029
1.751MetGly: 1.751 ± 0.039
0.499MetHis: 0.499 ± 0.018
1.45MetIle: 1.45 ± 0.034
1.136MetLys: 1.136 ± 0.029
3.093MetLeu: 3.093 ± 0.054
0.826MetMet: 0.826 ± 0.025
0.973MetAsn: 0.973 ± 0.031
1.299MetPro: 1.299 ± 0.032
1.099MetGln: 1.099 ± 0.024
1.553MetArg: 1.553 ± 0.033
1.662MetSer: 1.662 ± 0.03
1.809MetThr: 1.809 ± 0.035
1.808MetVal: 1.808 ± 0.034
0.201MetTrp: 0.201 ± 0.012
0.414MetTyr: 0.414 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.512AsnAla: 3.512 ± 0.056
0.292AsnCys: 0.292 ± 0.015
1.864AsnAsp: 1.864 ± 0.038
1.552AsnGlu: 1.552 ± 0.036
1.137AsnPhe: 1.137 ± 0.031
2.72AsnGly: 2.72 ± 0.047
0.769AsnHis: 0.769 ± 0.023
2.01AsnIle: 2.01 ± 0.043
1.171AsnLys: 1.171 ± 0.033
3.163AsnLeu: 3.163 ± 0.052
0.751AsnMet: 0.751 ± 0.021
1.287AsnAsn: 1.287 ± 0.032
1.899AsnPro: 1.899 ± 0.042
1.395AsnGln: 1.395 ± 0.031
2.08AsnArg: 2.08 ± 0.037
1.65AsnSer: 1.65 ± 0.04
1.684AsnThr: 1.684 ± 0.037
2.217AsnVal: 2.217 ± 0.042
0.411AsnTrp: 0.411 ± 0.019
1.037AsnTyr: 1.037 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
6.283ProAla: 6.283 ± 0.086
0.462ProCys: 0.462 ± 0.017
2.944ProAsp: 2.944 ± 0.05
2.969ProGlu: 2.969 ± 0.051
1.856ProPhe: 1.856 ± 0.034
3.841ProGly: 3.841 ± 0.056
1.184ProHis: 1.184 ± 0.032
1.936ProIle: 1.936 ± 0.041
1.174ProLys: 1.174 ± 0.03
5.974ProLeu: 5.974 ± 0.076
1.093ProMet: 1.093 ± 0.029
1.196ProAsn: 1.196 ± 0.03
2.525ProPro: 2.525 ± 0.07
2.263ProGln: 2.263 ± 0.046
2.551ProArg: 2.551 ± 0.044
2.593ProSer: 2.593 ± 0.047
2.301ProThr: 2.301 ± 0.037
3.887ProVal: 3.887 ± 0.055
0.784ProTrp: 0.784 ± 0.026
1.301ProTyr: 1.301 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
5.403GlnAla: 5.403 ± 0.075
0.371GlnCys: 0.371 ± 0.016
2.016GlnAsp: 2.016 ± 0.038
1.98GlnGlu: 1.98 ± 0.047
1.334GlnPhe: 1.334 ± 0.03
3.425GlnGly: 3.425 ± 0.055
1.259GlnHis: 1.259 ± 0.031
2.031GlnIle: 2.031 ± 0.042
1.349GlnLys: 1.349 ± 0.032
4.965GlnLeu: 4.965 ± 0.067
1.088GlnMet: 1.088 ± 0.025
1.181GlnAsn: 1.181 ± 0.029
2.468GlnPro: 2.468 ± 0.048
3.267GlnGln: 3.267 ± 0.063
3.726GlnArg: 3.726 ± 0.064
2.331GlnSer: 2.331 ± 0.041
2.305GlnThr: 2.305 ± 0.044
2.997GlnVal: 2.997 ± 0.044
0.649GlnTrp: 0.649 ± 0.024
1.024GlnTyr: 1.024 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
6.24ArgAla: 6.24 ± 0.068
0.726ArgCys: 0.726 ± 0.025
3.386ArgAsp: 3.386 ± 0.053
3.749ArgGlu: 3.749 ± 0.057
2.667ArgPhe: 2.667 ± 0.046
4.092ArgGly: 4.092 ± 0.054
2.035ArgHis: 2.035 ± 0.039
3.376ArgIle: 3.376 ± 0.046
2.004ArgLys: 2.004 ± 0.038
8.126ArgLeu: 8.126 ± 0.108
1.573ArgMet: 1.573 ± 0.038
1.828ArgAsn: 1.828 ± 0.037
2.898ArgPro: 2.898 ± 0.045
4.178ArgGln: 4.178 ± 0.065
5.054ArgArg: 5.054 ± 0.071
2.91ArgSer: 2.91 ± 0.047
2.621ArgThr: 2.621 ± 0.05
4.292ArgVal: 4.292 ± 0.059
1.157ArgTrp: 1.157 ± 0.033
2.396ArgTyr: 2.396 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
6.418SerAla: 6.418 ± 0.077
0.533SerCys: 0.533 ± 0.021
2.881SerAsp: 2.881 ± 0.055
2.691SerGlu: 2.691 ± 0.045
2.035SerPhe: 2.035 ± 0.041
5.146SerGly: 5.146 ± 0.06
1.424SerHis: 1.424 ± 0.029
2.611SerIle: 2.611 ± 0.047
1.54SerLys: 1.54 ± 0.036
6.99SerLeu: 6.99 ± 0.073
1.347SerMet: 1.347 ± 0.032
1.539SerAsn: 1.539 ± 0.034
2.872SerPro: 2.872 ± 0.049
2.408SerGln: 2.408 ± 0.048
3.641SerArg: 3.641 ± 0.052
3.306SerSer: 3.306 ± 0.069
2.742SerThr: 2.742 ± 0.048
3.977SerVal: 3.977 ± 0.043
0.806SerTrp: 0.806 ± 0.024
1.569SerTyr: 1.569 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
5.482ThrAla: 5.482 ± 0.067
0.484ThrCys: 0.484 ± 0.019
2.493ThrAsp: 2.493 ± 0.053
2.307ThrGlu: 2.307 ± 0.037
1.917ThrPhe: 1.917 ± 0.036
4.501ThrGly: 4.501 ± 0.064
1.254ThrHis: 1.254 ± 0.032
2.339ThrIle: 2.339 ± 0.045
1.148ThrLys: 1.148 ± 0.031
8.156ThrLeu: 8.156 ± 0.084
1.011ThrMet: 1.011 ± 0.027
1.234ThrAsn: 1.234 ± 0.035
3.537ThrPro: 3.537 ± 0.05
1.989ThrGln: 1.989 ± 0.041
3.322ThrArg: 3.322 ± 0.05
2.723ThrSer: 2.723 ± 0.047
2.728ThrThr: 2.728 ± 0.053
3.866ThrVal: 3.866 ± 0.054
0.651ThrTrp: 0.651 ± 0.022
1.139ThrTyr: 1.139 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
7.777ValAla: 7.777 ± 0.09
0.759ValCys: 0.759 ± 0.026
3.64ValAsp: 3.64 ± 0.052
3.511ValGlu: 3.511 ± 0.054
2.436ValPhe: 2.436 ± 0.037
4.854ValGly: 4.854 ± 0.067
1.207ValHis: 1.207 ± 0.026
4.35ValIle: 4.35 ± 0.058
2.573ValLys: 2.573 ± 0.045
7.222ValLeu: 7.222 ± 0.082
2.121ValMet: 2.121 ± 0.043
2.614ValAsn: 2.614 ± 0.045
3.208ValPro: 3.208 ± 0.042
2.272ValGln: 2.272 ± 0.046
3.913ValArg: 3.913 ± 0.058
4.366ValSer: 4.366 ± 0.052
4.126ValThr: 4.126 ± 0.055
5.347ValVal: 5.347 ± 0.071
0.881ValTrp: 0.881 ± 0.028
1.684ValTyr: 1.684 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
0.969TrpAla: 0.969 ± 0.028
0.182TrpCys: 0.182 ± 0.011
0.584TrpAsp: 0.584 ± 0.02
0.514TrpGlu: 0.514 ± 0.02
0.561TrpPhe: 0.561 ± 0.021
0.856TrpGly: 0.856 ± 0.027
0.485TrpHis: 0.485 ± 0.02
0.628TrpIle: 0.628 ± 0.021
0.349TrpLys: 0.349 ± 0.016
2.316TrpLeu: 2.316 ± 0.05
0.346TrpMet: 0.346 ± 0.017
0.375TrpAsn: 0.375 ± 0.017
0.708TrpPro: 0.708 ± 0.026
1.088TrpGln: 1.088 ± 0.03
1.358TrpArg: 1.358 ± 0.039
0.763TrpSer: 0.763 ± 0.024
0.506TrpThr: 0.506 ± 0.022
0.863TrpVal: 0.863 ± 0.028
0.201TrpTrp: 0.201 ± 0.013
0.351TrpTyr: 0.351 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.551TyrAla: 2.551 ± 0.042
0.402TyrCys: 0.402 ± 0.017
1.533TyrAsp: 1.533 ± 0.043
1.035TyrGlu: 1.035 ± 0.029
1.142TyrPhe: 1.142 ± 0.031
2.213TyrGly: 2.213 ± 0.042
0.794TyrHis: 0.794 ± 0.025
1.289TyrIle: 1.289 ± 0.031
0.676TyrLys: 0.676 ± 0.024
3.242TyrLeu: 3.242 ± 0.055
0.507TyrMet: 0.507 ± 0.019
0.848TyrAsn: 0.848 ± 0.023
1.424TyrPro: 1.424 ± 0.035
1.611TyrGln: 1.611 ± 0.039
2.273TyrArg: 2.273 ± 0.042
1.513TyrSer: 1.513 ± 0.037
1.336TyrThr: 1.336 ± 0.034
1.642TyrVal: 1.642 ± 0.035
0.434TyrTrp: 0.434 ± 0.02
0.901TyrTyr: 0.901 ± 0.03
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4281 proteins (1400054 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski