Amino acid dipepetide frequency for Pasteurellaceae bacterium 15-036681

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.52AlaAla: 5.52 ± 0.154
1.026AlaCys: 1.026 ± 0.043
4.522AlaAsp: 4.522 ± 0.11
6.475AlaGlu: 6.475 ± 0.106
3.396AlaPhe: 3.396 ± 0.079
5.639AlaGly: 5.639 ± 0.127
1.484AlaHis: 1.484 ± 0.047
6.253AlaIle: 6.253 ± 0.111
5.948AlaLys: 5.948 ± 0.098
8.986AlaLeu: 8.986 ± 0.137
2.341AlaMet: 2.341 ± 0.065
4.033AlaAsn: 4.033 ± 0.101
2.591AlaPro: 2.591 ± 0.097
4.017AlaGln: 4.017 ± 0.088
3.303AlaArg: 3.303 ± 0.081
4.286AlaSer: 4.286 ± 0.081
4.856AlaThr: 4.856 ± 0.133
6.06AlaVal: 6.06 ± 0.085
0.746AlaTrp: 0.746 ± 0.028
2.326AlaTyr: 2.326 ± 0.056
0.0AlaXaa: 0.0 ± 0.0
Cys
0.715CysAla: 0.715 ± 0.033
0.154CysCys: 0.154 ± 0.017
0.557CysAsp: 0.557 ± 0.029
0.588CysGlu: 0.588 ± 0.029
0.447CysPhe: 0.447 ± 0.026
0.897CysGly: 0.897 ± 0.034
0.276CysHis: 0.276 ± 0.02
0.553CysIle: 0.553 ± 0.03
0.457CysLys: 0.457 ± 0.03
0.914CysLeu: 0.914 ± 0.036
0.158CysMet: 0.158 ± 0.015
0.357CysAsn: 0.357 ± 0.022
0.42CysPro: 0.42 ± 0.03
0.408CysGln: 0.408 ± 0.025
0.415CysArg: 0.415 ± 0.026
0.633CysSer: 0.633 ± 0.033
0.45CysThr: 0.45 ± 0.028
0.672CysVal: 0.672 ± 0.032
0.118CysTrp: 0.118 ± 0.012
0.345CysTyr: 0.345 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.595AspAla: 3.595 ± 0.089
0.493AspCys: 0.493 ± 0.027
2.579AspAsp: 2.579 ± 0.082
3.805AspGlu: 3.805 ± 0.093
2.537AspPhe: 2.537 ± 0.057
3.664AspGly: 3.664 ± 0.155
0.849AspHis: 0.849 ± 0.035
3.794AspIle: 3.794 ± 0.077
3.686AspLys: 3.686 ± 0.089
5.122AspLeu: 5.122 ± 0.085
1.141AspMet: 1.141 ± 0.037
2.619AspAsn: 2.619 ± 0.073
2.11AspPro: 2.11 ± 0.08
1.76AspGln: 1.76 ± 0.049
2.184AspArg: 2.184 ± 0.065
2.844AspSer: 2.844 ± 0.065
2.461AspThr: 2.461 ± 0.09
3.729AspVal: 3.729 ± 0.084
0.695AspTrp: 0.695 ± 0.03
2.179AspTyr: 2.179 ± 0.057
0.0AspXaa: 0.0 ± 0.0
Glu
4.694GluAla: 4.694 ± 0.101
0.505GluCys: 0.505 ± 0.032
2.799GluAsp: 2.799 ± 0.07
3.853GluGlu: 3.853 ± 0.095
2.531GluPhe: 2.531 ± 0.066
3.634GluGly: 3.634 ± 0.076
1.295GluHis: 1.295 ± 0.043
4.928GluIle: 4.928 ± 0.085
4.971GluLys: 4.971 ± 0.094
7.008GluLeu: 7.008 ± 0.13
1.838GluMet: 1.838 ± 0.047
3.572GluAsn: 3.572 ± 0.069
2.075GluPro: 2.075 ± 0.087
4.596GluGln: 4.596 ± 0.103
3.298GluArg: 3.298 ± 0.078
3.156GluSer: 3.156 ± 0.063
3.158GluThr: 3.158 ± 0.07
4.307GluVal: 4.307 ± 0.089
0.866GluTrp: 0.866 ± 0.035
1.915GluTyr: 1.915 ± 0.053
0.0GluXaa: 0.0 ± 0.0
Phe
3.796PheAla: 3.796 ± 0.08
0.53PheCys: 0.53 ± 0.029
2.846PheAsp: 2.846 ± 0.065
2.711PheGlu: 2.711 ± 0.07
1.875PhePhe: 1.875 ± 0.055
3.419PheGly: 3.419 ± 0.084
0.812PheHis: 0.812 ± 0.037
3.187PheIle: 3.187 ± 0.082
2.258PheLys: 2.258 ± 0.059
3.713PheLeu: 3.713 ± 0.086
0.949PheMet: 0.949 ± 0.038
2.214PheAsn: 2.214 ± 0.058
1.491PhePro: 1.491 ± 0.044
1.374PheGln: 1.374 ± 0.052
1.493PheArg: 1.493 ± 0.044
3.257PheSer: 3.257 ± 0.078
2.302PheThr: 2.302 ± 0.064
2.987PheVal: 2.987 ± 0.068
0.537PheTrp: 0.537 ± 0.027
1.491PheTyr: 1.491 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
5.22GlyAla: 5.22 ± 0.099
0.778GlyCys: 0.778 ± 0.045
3.394GlyAsp: 3.394 ± 0.113
4.361GlyGlu: 4.361 ± 0.092
3.213GlyPhe: 3.213 ± 0.077
4.694GlyGly: 4.694 ± 0.092
1.214GlyHis: 1.214 ± 0.044
5.232GlyIle: 5.232 ± 0.087
5.111GlyLys: 5.111 ± 0.121
6.695GlyLeu: 6.695 ± 0.123
1.81GlyMet: 1.81 ± 0.053
3.172GlyAsn: 3.172 ± 0.127
1.146GlyPro: 1.146 ± 0.047
2.412GlyGln: 2.412 ± 0.066
2.833GlyArg: 2.833 ± 0.072
4.046GlySer: 4.046 ± 0.108
3.615GlyThr: 3.615 ± 0.115
5.117GlyVal: 5.117 ± 0.106
0.912GlyTrp: 0.912 ± 0.032
2.5GlyTyr: 2.5 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.187HisAla: 1.187 ± 0.042
0.314HisCys: 0.314 ± 0.022
0.835HisAsp: 0.835 ± 0.035
0.831HisGlu: 0.831 ± 0.038
1.047HisPhe: 1.047 ± 0.038
1.216HisGly: 1.216 ± 0.043
0.596HisHis: 0.596 ± 0.04
1.419HisIle: 1.419 ± 0.045
1.165HisLys: 1.165 ± 0.038
2.156HisLeu: 2.156 ± 0.061
0.323HisMet: 0.323 ± 0.018
1.076HisAsn: 1.076 ± 0.042
1.003HisPro: 1.003 ± 0.036
1.112HisGln: 1.112 ± 0.043
0.95HisArg: 0.95 ± 0.039
1.358HisSer: 1.358 ± 0.046
1.073HisThr: 1.073 ± 0.039
0.847HisVal: 0.847 ± 0.033
0.276HisTrp: 0.276 ± 0.019
0.866HisTyr: 0.866 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.863IleAla: 6.863 ± 0.116
0.757IleCys: 0.757 ± 0.035
4.073IleAsp: 4.073 ± 0.075
4.956IleGlu: 4.956 ± 0.081
2.85IlePhe: 2.85 ± 0.072
5.134IleGly: 5.134 ± 0.082
1.347IleHis: 1.347 ± 0.044
4.432IleIle: 4.432 ± 0.091
3.83IleLys: 3.83 ± 0.084
6.083IleLeu: 6.083 ± 0.101
1.365IleMet: 1.365 ± 0.05
3.288IleAsn: 3.288 ± 0.074
2.737IlePro: 2.737 ± 0.068
2.783IleGln: 2.783 ± 0.067
2.899IleArg: 2.899 ± 0.075
4.729IleSer: 4.729 ± 0.099
3.971IleThr: 3.971 ± 0.087
4.567IleVal: 4.567 ± 0.091
0.639IleTrp: 0.639 ± 0.032
2.049IleTyr: 2.049 ± 0.052
0.0IleXaa: 0.0 ± 0.0
Lys
5.953LysAla: 5.953 ± 0.121
0.378LysCys: 0.378 ± 0.026
3.331LysAsp: 3.331 ± 0.108
4.206LysGlu: 4.206 ± 0.092
2.145LysPhe: 2.145 ± 0.055
4.168LysGly: 4.168 ± 0.095
1.166LysHis: 1.166 ± 0.046
3.945LysIle: 3.945 ± 0.083
3.677LysLys: 3.677 ± 0.09
6.144LysLeu: 6.144 ± 0.107
1.822LysMet: 1.822 ± 0.051
2.958LysAsn: 2.958 ± 0.078
2.398LysPro: 2.398 ± 0.074
3.467LysGln: 3.467 ± 0.079
2.791LysArg: 2.791 ± 0.06
3.38LysSer: 3.38 ± 0.071
3.45LysThr: 3.45 ± 0.073
4.436LysVal: 4.436 ± 0.085
0.661LysTrp: 0.661 ± 0.028
1.693LysTyr: 1.693 ± 0.051
0.0LysXaa: 0.0 ± 0.0
Leu
9.705LeuAla: 9.705 ± 0.142
0.92LeuCys: 0.92 ± 0.039
5.536LeuAsp: 5.536 ± 0.089
6.089LeuGlu: 6.089 ± 0.11
4.43LeuPhe: 4.43 ± 0.094
6.866LeuGly: 6.866 ± 0.122
1.842LeuHis: 1.842 ± 0.053
6.674LeuIle: 6.674 ± 0.123
5.88LeuLys: 5.88 ± 0.09
9.962LeuLeu: 9.962 ± 0.165
2.326LeuMet: 2.326 ± 0.072
5.197LeuAsn: 5.197 ± 0.11
4.406LeuPro: 4.406 ± 0.097
4.023LeuGln: 4.023 ± 0.102
4.302LeuArg: 4.302 ± 0.1
7.143LeuSer: 7.143 ± 0.102
5.806LeuThr: 5.806 ± 0.094
6.599LeuVal: 6.599 ± 0.102
1.004LeuTrp: 1.004 ± 0.04
2.625LeuTyr: 2.625 ± 0.06
0.0LeuXaa: 0.0 ± 0.0
Met
2.331MetAla: 2.331 ± 0.064
0.193MetCys: 0.193 ± 0.017
1.008MetAsp: 1.008 ± 0.041
1.179MetGlu: 1.179 ± 0.041
0.953MetPhe: 0.953 ± 0.036
1.634MetGly: 1.634 ± 0.051
0.389MetHis: 0.389 ± 0.023
1.539MetIle: 1.539 ± 0.052
1.57MetLys: 1.57 ± 0.047
2.546MetLeu: 2.546 ± 0.062
0.726MetMet: 0.726 ± 0.032
1.189MetAsn: 1.189 ± 0.04
1.081MetPro: 1.081 ± 0.04
1.195MetGln: 1.195 ± 0.038
1.077MetArg: 1.077 ± 0.041
1.592MetSer: 1.592 ± 0.045
1.364MetThr: 1.364 ± 0.041
1.497MetVal: 1.497 ± 0.04
0.237MetTrp: 0.237 ± 0.019
0.481MetTyr: 0.481 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
4.096AsnAla: 4.096 ± 0.109
0.468AsnCys: 0.468 ± 0.029
2.254AsnAsp: 2.254 ± 0.064
2.835AsnGlu: 2.835 ± 0.067
1.965AsnPhe: 1.965 ± 0.058
3.51AsnGly: 3.51 ± 0.106
0.999AsnHis: 0.999 ± 0.039
3.519AsnIle: 3.519 ± 0.087
2.857AsnLys: 2.857 ± 0.068
4.917AsnLeu: 4.917 ± 0.096
1.033AsnMet: 1.033 ± 0.037
2.539AsnAsn: 2.539 ± 0.081
2.549AsnPro: 2.549 ± 0.075
2.508AsnGln: 2.508 ± 0.082
2.18AsnArg: 2.18 ± 0.056
2.964AsnSer: 2.964 ± 0.084
2.322AsnThr: 2.322 ± 0.077
3.176AsnVal: 3.176 ± 0.084
0.647AsnTrp: 0.647 ± 0.032
1.738AsnTyr: 1.738 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
2.73ProAla: 2.73 ± 0.094
0.27ProCys: 0.27 ± 0.02
1.984ProAsp: 1.984 ± 0.08
3.35ProGlu: 3.35 ± 0.078
1.822ProPhe: 1.822 ± 0.056
1.327ProGly: 1.327 ± 0.055
0.804ProHis: 0.804 ± 0.036
2.638ProIle: 2.638 ± 0.07
2.284ProLys: 2.284 ± 0.074
3.623ProLeu: 3.623 ± 0.059
0.943ProMet: 0.943 ± 0.037
2.249ProAsn: 2.249 ± 0.069
0.964ProPro: 0.964 ± 0.042
1.769ProGln: 1.769 ± 0.053
1.318ProArg: 1.318 ± 0.047
2.1ProSer: 2.1 ± 0.064
2.442ProThr: 2.442 ± 0.08
2.764ProVal: 2.764 ± 0.084
0.351ProTrp: 0.351 ± 0.025
1.337ProTyr: 1.337 ± 0.045
0.0ProXaa: 0.0 ± 0.0
Gln
4.524GlnAla: 4.524 ± 0.097
0.327GlnCys: 0.327 ± 0.024
2.042GlnAsp: 2.042 ± 0.057
2.453GlnGlu: 2.453 ± 0.061
2.115GlnPhe: 2.115 ± 0.064
3.045GlnGly: 3.045 ± 0.086
1.089GlnHis: 1.089 ± 0.046
3.248GlnIle: 3.248 ± 0.077
2.86GlnLys: 2.86 ± 0.07
5.109GlnLeu: 5.109 ± 0.114
1.061GlnMet: 1.061 ± 0.043
2.329GlnAsn: 2.329 ± 0.077
1.861GlnPro: 1.861 ± 0.06
3.713GlnGln: 3.713 ± 0.147
2.233GlnArg: 2.233 ± 0.066
2.589GlnSer: 2.589 ± 0.077
2.465GlnThr: 2.465 ± 0.07
2.835GlnVal: 2.835 ± 0.058
0.63GlnTrp: 0.63 ± 0.031
1.504GlnTyr: 1.504 ± 0.054
0.0GlnXaa: 0.0 ± 0.0
Arg
2.946ArgAla: 2.946 ± 0.081
0.4ArgCys: 0.4 ± 0.024
2.203ArgAsp: 2.203 ± 0.059
3.004ArgGlu: 3.004 ± 0.093
2.229ArgPhe: 2.229 ± 0.064
2.539ArgGly: 2.539 ± 0.066
0.93ArgHis: 0.93 ± 0.04
3.079ArgIle: 3.079 ± 0.065
2.65ArgLys: 2.65 ± 0.069
4.705ArgLeu: 4.705 ± 0.102
0.977ArgMet: 0.977 ± 0.035
2.093ArgAsn: 2.093 ± 0.053
1.514ArgPro: 1.514 ± 0.051
2.042ArgGln: 2.042 ± 0.057
2.049ArgArg: 2.049 ± 0.064
2.399ArgSer: 2.399 ± 0.065
2.041ArgThr: 2.041 ± 0.054
2.872ArgVal: 2.872 ± 0.069
0.496ArgTrp: 0.496 ± 0.024
1.706ArgTyr: 1.706 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
5.157SerAla: 5.157 ± 0.095
0.505SerCys: 0.505 ± 0.029
3.171SerAsp: 3.171 ± 0.064
3.849SerGlu: 3.849 ± 0.068
2.677SerPhe: 2.677 ± 0.062
4.507SerGly: 4.507 ± 0.086
1.341SerHis: 1.341 ± 0.043
3.817SerIle: 3.817 ± 0.085
3.359SerLys: 3.359 ± 0.074
6.325SerLeu: 6.325 ± 0.101
1.369SerMet: 1.369 ± 0.041
2.691SerAsn: 2.691 ± 0.063
2.266SerPro: 2.266 ± 0.055
2.961SerGln: 2.961 ± 0.065
2.517SerArg: 2.517 ± 0.062
3.775SerSer: 3.775 ± 0.086
3.177SerThr: 3.177 ± 0.086
4.194SerVal: 4.194 ± 0.093
0.662SerTrp: 0.662 ± 0.029
2.019SerTyr: 2.019 ± 0.062
0.0SerXaa: 0.0 ± 0.0
Thr
4.961ThrAla: 4.961 ± 0.134
0.403ThrCys: 0.403 ± 0.02
2.799ThrAsp: 2.799 ± 0.09
3.456ThrGlu: 3.456 ± 0.078
2.242ThrPhe: 2.242 ± 0.059
3.972ThrGly: 3.972 ± 0.104
1.07ThrHis: 1.07 ± 0.042
3.688ThrIle: 3.688 ± 0.089
2.984ThrLys: 2.984 ± 0.07
5.871ThrLeu: 5.871 ± 0.11
1.079ThrMet: 1.079 ± 0.037
2.403ThrAsn: 2.403 ± 0.083
2.366ThrPro: 2.366 ± 0.072
2.507ThrGln: 2.507 ± 0.066
1.964ThrArg: 1.964 ± 0.049
3.023ThrSer: 3.023 ± 0.086
3.08ThrThr: 3.08 ± 0.111
4.053ThrVal: 4.053 ± 0.153
0.503ThrTrp: 0.503 ± 0.024
1.497ThrTyr: 1.497 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
6.516ValAla: 6.516 ± 0.095
0.624ValCys: 0.624 ± 0.031
3.755ValAsp: 3.755 ± 0.082
4.826ValGlu: 4.826 ± 0.104
2.625ValPhe: 2.625 ± 0.068
4.745ValGly: 4.745 ± 0.103
1.11ValHis: 1.11 ± 0.04
4.792ValIle: 4.792 ± 0.088
4.329ValLys: 4.329 ± 0.102
6.466ValLeu: 6.466 ± 0.124
1.681ValMet: 1.681 ± 0.057
3.192ValAsn: 3.192 ± 0.082
2.525ValPro: 2.525 ± 0.065
2.498ValGln: 2.498 ± 0.059
2.85ValArg: 2.85 ± 0.063
4.478ValSer: 4.478 ± 0.093
3.872ValThr: 3.872 ± 0.162
5.27ValVal: 5.27 ± 0.102
0.66ValTrp: 0.66 ± 0.03
1.889ValTyr: 1.889 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
0.816TrpAla: 0.816 ± 0.035
0.143TrpCys: 0.143 ± 0.013
0.527TrpAsp: 0.527 ± 0.03
0.592TrpGlu: 0.592 ± 0.025
0.553TrpPhe: 0.553 ± 0.023
0.735TrpGly: 0.735 ± 0.034
0.266TrpHis: 0.266 ± 0.02
0.753TrpIle: 0.753 ± 0.032
0.719TrpLys: 0.719 ± 0.03
1.554TrpLeu: 1.554 ± 0.046
0.215TrpMet: 0.215 ± 0.018
0.52TrpAsn: 0.52 ± 0.025
0.15TrpPro: 0.15 ± 0.014
0.9TrpGln: 0.9 ± 0.039
0.524TrpArg: 0.524 ± 0.029
0.581TrpSer: 0.581 ± 0.029
0.457TrpThr: 0.457 ± 0.026
0.808TrpVal: 0.808 ± 0.036
0.138TrpTrp: 0.138 ± 0.013
0.288TrpTyr: 0.288 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.481TyrAla: 2.481 ± 0.056
0.342TyrCys: 0.342 ± 0.023
1.627TyrAsp: 1.627 ± 0.05
1.58TyrGlu: 1.58 ± 0.05
1.589TyrPhe: 1.589 ± 0.056
2.129TyrGly: 2.129 ± 0.058
0.772TyrHis: 0.772 ± 0.034
1.864TyrIle: 1.864 ± 0.049
1.575TyrLys: 1.575 ± 0.048
3.437TyrLeu: 3.437 ± 0.08
0.626TyrMet: 0.626 ± 0.028
1.385TyrAsn: 1.385 ± 0.05
1.419TyrPro: 1.419 ± 0.048
1.985TyrGln: 1.985 ± 0.059
1.695TyrArg: 1.695 ± 0.05
2.008TyrSer: 2.008 ± 0.057
1.56TyrThr: 1.56 ± 0.049
1.911TyrVal: 1.911 ± 0.06
0.464TyrTrp: 0.464 ± 0.023
1.103TyrTyr: 1.103 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2341 proteins (739913 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski