Amino acid dipepetide frequency for Peptostreptococcus russellii

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.096AlaAla: 4.096 ± 0.109
0.755AlaCys: 0.755 ± 0.037
3.264AlaAsp: 3.264 ± 0.086
4.084AlaGlu: 4.084 ± 0.102
2.567AlaPhe: 2.567 ± 0.081
4.295AlaGly: 4.295 ± 0.103
0.781AlaHis: 0.781 ± 0.041
6.115AlaIle: 6.115 ± 0.11
5.205AlaLys: 5.205 ± 0.101
5.556AlaLeu: 5.556 ± 0.105
1.986AlaMet: 1.986 ± 0.064
2.958AlaAsn: 2.958 ± 0.075
1.454AlaPro: 1.454 ± 0.054
1.459AlaGln: 1.459 ± 0.051
2.362AlaArg: 2.362 ± 0.066
3.649AlaSer: 3.649 ± 0.084
2.925AlaThr: 2.925 ± 0.08
4.414AlaVal: 4.414 ± 0.106
0.335AlaTrp: 0.335 ± 0.024
2.076AlaTyr: 2.076 ± 0.064
0.0AlaXaa: 0.0 ± 0.0
Cys
0.646CysAla: 0.646 ± 0.04
0.177CysCys: 0.177 ± 0.021
0.653CysAsp: 0.653 ± 0.036
0.781CysGlu: 0.781 ± 0.036
0.443CysPhe: 0.443 ± 0.028
1.096CysGly: 1.096 ± 0.046
0.204CysHis: 0.204 ± 0.017
0.964CysIle: 0.964 ± 0.042
0.897CysLys: 0.897 ± 0.042
0.78CysLeu: 0.78 ± 0.039
0.293CysMet: 0.293 ± 0.023
0.564CysAsn: 0.564 ± 0.033
0.46CysPro: 0.46 ± 0.03
0.244CysGln: 0.244 ± 0.021
0.398CysArg: 0.398 ± 0.027
0.678CysSer: 0.678 ± 0.037
0.487CysThr: 0.487 ± 0.029
0.659CysVal: 0.659 ± 0.03
0.079CysTrp: 0.079 ± 0.013
0.378CysTyr: 0.378 ± 0.028
0.0CysXaa: 0.0 ± 0.0
Asp
3.142AspAla: 3.142 ± 0.082
0.663AspCys: 0.663 ± 0.036
3.355AspAsp: 3.355 ± 0.084
5.354AspGlu: 5.354 ± 0.119
3.073AspPhe: 3.073 ± 0.061
3.825AspGly: 3.825 ± 0.112
0.636AspHis: 0.636 ± 0.033
6.406AspIle: 6.406 ± 0.118
5.864AspLys: 5.864 ± 0.107
4.988AspLeu: 4.988 ± 0.097
1.877AspMet: 1.877 ± 0.051
3.075AspAsn: 3.075 ± 0.073
1.512AspPro: 1.512 ± 0.054
0.914AspGln: 0.914 ± 0.036
2.453AspArg: 2.453 ± 0.06
3.828AspSer: 3.828 ± 0.093
2.536AspThr: 2.536 ± 0.071
3.883AspVal: 3.883 ± 0.08
0.381AspTrp: 0.381 ± 0.028
2.826AspTyr: 2.826 ± 0.068
0.0AspXaa: 0.0 ± 0.0
Glu
4.536GluAla: 4.536 ± 0.097
0.661GluCys: 0.661 ± 0.033
5.098GluAsp: 5.098 ± 0.113
8.101GluGlu: 8.101 ± 0.16
3.246GluPhe: 3.246 ± 0.081
4.414GluGly: 4.414 ± 0.104
0.965GluHis: 0.965 ± 0.039
7.716GluIle: 7.716 ± 0.135
9.46GluLys: 9.46 ± 0.177
6.878GluLeu: 6.878 ± 0.137
2.317GluMet: 2.317 ± 0.059
5.779GluAsn: 5.779 ± 0.112
1.211GluPro: 1.211 ± 0.051
1.516GluGln: 1.516 ± 0.048
2.976GluArg: 2.976 ± 0.075
3.949GluSer: 3.949 ± 0.087
3.053GluThr: 3.053 ± 0.079
4.989GluVal: 4.989 ± 0.104
0.383GluTrp: 0.383 ± 0.026
3.391GluTyr: 3.391 ± 0.092
0.0GluXaa: 0.0 ± 0.0
Phe
2.638PheAla: 2.638 ± 0.082
0.539PheCys: 0.539 ± 0.03
2.804PheAsp: 2.804 ± 0.073
3.229PheGlu: 3.229 ± 0.081
1.968PhePhe: 1.968 ± 0.068
2.849PheGly: 2.849 ± 0.083
0.463PheHis: 0.463 ± 0.024
4.203PheIle: 4.203 ± 0.104
3.498PheLys: 3.498 ± 0.072
3.733PheLeu: 3.733 ± 0.1
1.303PheMet: 1.303 ± 0.049
2.356PheAsn: 2.356 ± 0.069
1.211PhePro: 1.211 ± 0.042
0.778PheGln: 0.778 ± 0.033
1.357PheArg: 1.357 ± 0.049
3.089PheSer: 3.089 ± 0.078
2.128PheThr: 2.128 ± 0.063
2.796PheVal: 2.796 ± 0.071
0.258PheTrp: 0.258 ± 0.021
1.613PheTyr: 1.613 ± 0.06
0.0PheXaa: 0.0 ± 0.0
Gly
4.171GlyAla: 4.171 ± 0.106
0.868GlyCys: 0.868 ± 0.046
3.472GlyAsp: 3.472 ± 0.075
4.435GlyGlu: 4.435 ± 0.097
2.983GlyPhe: 2.983 ± 0.074
4.37GlyGly: 4.37 ± 0.113
1.027GlyHis: 1.027 ± 0.045
6.426GlyIle: 6.426 ± 0.12
6.07GlyLys: 6.07 ± 0.108
5.173GlyLeu: 5.173 ± 0.113
2.003GlyMet: 2.003 ± 0.062
3.201GlyAsn: 3.201 ± 0.075
1.298GlyPro: 1.298 ± 0.052
1.506GlyGln: 1.506 ± 0.057
2.478GlyArg: 2.478 ± 0.061
3.537GlySer: 3.537 ± 0.079
3.11GlyThr: 3.11 ± 0.076
4.892GlyVal: 4.892 ± 0.099
0.462GlyTrp: 0.462 ± 0.036
2.986GlyTyr: 2.986 ± 0.068
0.0GlyXaa: 0.0 ± 0.0
His
0.689HisAla: 0.689 ± 0.033
0.182HisCys: 0.182 ± 0.018
0.708HisAsp: 0.708 ± 0.037
0.873HisGlu: 0.873 ± 0.039
0.596HisPhe: 0.596 ± 0.031
0.952HisGly: 0.952 ± 0.039
0.263HisHis: 0.263 ± 0.025
1.298HisIle: 1.298 ± 0.047
1.021HisLys: 1.021 ± 0.049
1.103HisLeu: 1.103 ± 0.046
0.356HisMet: 0.356 ± 0.024
0.691HisAsn: 0.691 ± 0.031
0.569HisPro: 0.569 ± 0.034
0.316HisGln: 0.316 ± 0.023
0.509HisArg: 0.509 ± 0.032
0.939HisSer: 0.939 ± 0.04
0.694HisThr: 0.694 ± 0.036
0.738HisVal: 0.738 ± 0.036
0.097HisTrp: 0.097 ± 0.013
0.473HisTyr: 0.473 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.385IleAla: 6.385 ± 0.122
1.064IleCys: 1.064 ± 0.049
6.329IleAsp: 6.329 ± 0.106
7.624IleGlu: 7.624 ± 0.136
3.949IlePhe: 3.949 ± 0.097
5.919IleGly: 5.919 ± 0.114
1.114IleHis: 1.114 ± 0.039
7.996IleIle: 7.996 ± 0.158
7.994IleLys: 7.994 ± 0.118
8.367IleLeu: 8.367 ± 0.148
2.27IleMet: 2.27 ± 0.066
5.145IleAsn: 5.145 ± 0.117
3.008IlePro: 3.008 ± 0.07
1.855IleGln: 1.855 ± 0.065
3.16IleArg: 3.16 ± 0.078
6.871IleSer: 6.871 ± 0.132
4.126IleThr: 4.126 ± 0.087
6.733IleVal: 6.733 ± 0.108
0.418IleTrp: 0.418 ± 0.028
3.405IleTyr: 3.405 ± 0.073
0.0IleXaa: 0.0 ± 0.0
Lys
5.049LysAla: 5.049 ± 0.095
0.684LysCys: 0.684 ± 0.039
6.199LysAsp: 6.199 ± 0.114
9.125LysGlu: 9.125 ± 0.151
3.319LysPhe: 3.319 ± 0.069
4.653LysGly: 4.653 ± 0.09
1.169LysHis: 1.169 ± 0.046
8.34LysIle: 8.34 ± 0.134
10.223LysLys: 10.223 ± 0.149
7.517LysLeu: 7.517 ± 0.124
3.085LysMet: 3.085 ± 0.074
6.575LysAsn: 6.575 ± 0.12
1.978LysPro: 1.978 ± 0.067
1.814LysGln: 1.814 ± 0.058
3.333LysArg: 3.333 ± 0.079
5.468LysSer: 5.468 ± 0.101
4.189LysThr: 4.189 ± 0.103
5.94LysVal: 5.94 ± 0.105
0.529LysTrp: 0.529 ± 0.033
4.27LysTyr: 4.27 ± 0.093
0.0LysXaa: 0.0 ± 0.0
Leu
5.491LeuAla: 5.491 ± 0.114
0.895LeuCys: 0.895 ± 0.035
5.402LeuAsp: 5.402 ± 0.105
7.045LeuGlu: 7.045 ± 0.136
3.371LeuPhe: 3.371 ± 0.096
5.896LeuGly: 5.896 ± 0.095
1.019LeuHis: 1.019 ± 0.042
6.922LeuIle: 6.922 ± 0.108
7.813LeuLys: 7.813 ± 0.143
7.221LeuLeu: 7.221 ± 0.129
2.369LeuMet: 2.369 ± 0.064
4.698LeuAsn: 4.698 ± 0.081
2.712LeuPro: 2.712 ± 0.074
1.717LeuGln: 1.717 ± 0.063
3.105LeuArg: 3.105 ± 0.079
6.376LeuSer: 6.376 ± 0.115
3.949LeuThr: 3.949 ± 0.082
5.531LeuVal: 5.531 ± 0.093
0.463LeuTrp: 0.463 ± 0.027
2.894LeuTyr: 2.894 ± 0.06
0.0LeuXaa: 0.0 ± 0.0
Met
2.003MetAla: 2.003 ± 0.068
0.315MetCys: 0.315 ± 0.022
1.907MetAsp: 1.907 ± 0.059
1.983MetGlu: 1.983 ± 0.055
1.156MetPhe: 1.156 ± 0.048
2.009MetGly: 2.009 ± 0.059
0.331MetHis: 0.331 ± 0.023
2.352MetIle: 2.352 ± 0.062
2.893MetLys: 2.893 ± 0.062
2.448MetLeu: 2.448 ± 0.064
0.865MetMet: 0.865 ± 0.043
1.685MetAsn: 1.685 ± 0.052
0.897MetPro: 0.897 ± 0.034
0.673MetGln: 0.673 ± 0.033
1.089MetArg: 1.089 ± 0.049
2.016MetSer: 2.016 ± 0.054
1.397MetThr: 1.397 ± 0.048
1.805MetVal: 1.805 ± 0.054
0.167MetTrp: 0.167 ± 0.017
0.954MetTyr: 0.954 ± 0.043
0.0MetXaa: 0.0 ± 0.0
Asn
2.899AsnAla: 2.899 ± 0.076
0.592AsnCys: 0.592 ± 0.034
2.856AsnAsp: 2.856 ± 0.065
3.846AsnGlu: 3.846 ± 0.086
2.367AsnPhe: 2.367 ± 0.07
3.406AsnGly: 3.406 ± 0.088
0.736AsnHis: 0.736 ± 0.036
6.338AsnIle: 6.338 ± 0.123
5.538AsnLys: 5.538 ± 0.108
4.798AsnLeu: 4.798 ± 0.107
1.686AsnMet: 1.686 ± 0.049
3.525AsnAsn: 3.525 ± 0.109
2.158AsnPro: 2.158 ± 0.049
1.308AsnGln: 1.308 ± 0.052
2.245AsnArg: 2.245 ± 0.055
3.835AsnSer: 3.835 ± 0.088
2.766AsnThr: 2.766 ± 0.079
3.336AsnVal: 3.336 ± 0.072
0.345AsnTrp: 0.345 ± 0.026
2.208AsnTyr: 2.208 ± 0.063
0.0AsnXaa: 0.0 ± 0.0
Pro
1.635ProAla: 1.635 ± 0.063
0.315ProCys: 0.315 ± 0.028
1.635ProAsp: 1.635 ± 0.054
2.471ProGlu: 2.471 ± 0.071
1.285ProPhe: 1.285 ± 0.053
1.755ProGly: 1.755 ± 0.055
0.435ProHis: 0.435 ± 0.027
2.5ProIle: 2.5 ± 0.063
2.219ProLys: 2.219 ± 0.067
2.132ProLeu: 2.132 ± 0.063
0.724ProMet: 0.724 ± 0.036
1.425ProAsn: 1.425 ± 0.055
0.52ProPro: 0.52 ± 0.029
0.761ProGln: 0.761 ± 0.038
0.875ProArg: 0.875 ± 0.041
1.732ProSer: 1.732 ± 0.05
1.384ProThr: 1.384 ± 0.046
2.115ProVal: 2.115 ± 0.06
0.219ProTrp: 0.219 ± 0.022
1.124ProTyr: 1.124 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
1.412GlnAla: 1.412 ± 0.055
0.226GlnCys: 0.226 ± 0.021
1.079GlnAsp: 1.079 ± 0.044
1.666GlnGlu: 1.666 ± 0.054
0.863GlnPhe: 0.863 ± 0.045
1.405GlnGly: 1.405 ± 0.054
0.298GlnHis: 0.298 ± 0.021
2.045GlnIle: 2.045 ± 0.057
1.989GlnLys: 1.989 ± 0.059
1.815GlnLeu: 1.815 ± 0.06
0.661GlnMet: 0.661 ± 0.032
1.191GlnAsn: 1.191 ± 0.042
0.497GlnPro: 0.497 ± 0.031
0.472GlnGln: 0.472 ± 0.031
0.917GlnArg: 0.917 ± 0.044
1.303GlnSer: 1.303 ± 0.046
0.987GlnThr: 0.987 ± 0.038
1.409GlnVal: 1.409 ± 0.047
0.169GlnTrp: 0.169 ± 0.016
0.863GlnTyr: 0.863 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
2.234ArgAla: 2.234 ± 0.061
0.368ArgCys: 0.368 ± 0.029
2.132ArgAsp: 2.132 ± 0.057
3.498ArgGlu: 3.498 ± 0.078
1.643ArgPhe: 1.643 ± 0.049
2.135ArgGly: 2.135 ± 0.064
0.522ArgHis: 0.522 ± 0.029
3.256ArgIle: 3.256 ± 0.071
3.619ArgLys: 3.619 ± 0.074
3.082ArgLeu: 3.082 ± 0.071
1.099ArgMet: 1.099 ± 0.043
2.207ArgAsn: 2.207 ± 0.06
0.997ArgPro: 0.997 ± 0.044
0.942ArgGln: 0.942 ± 0.045
1.63ArgArg: 1.63 ± 0.064
1.758ArgSer: 1.758 ± 0.053
1.625ArgThr: 1.625 ± 0.058
2.603ArgVal: 2.603 ± 0.071
0.234ArgTrp: 0.234 ± 0.021
1.641ArgTyr: 1.641 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
3.564SerAla: 3.564 ± 0.075
0.647SerCys: 0.647 ± 0.041
3.607SerAsp: 3.607 ± 0.085
4.586SerGlu: 4.586 ± 0.092
2.911SerPhe: 2.911 ± 0.082
4.675SerGly: 4.675 ± 0.096
0.915SerHis: 0.915 ± 0.036
6.261SerIle: 6.261 ± 0.122
6.098SerLys: 6.098 ± 0.111
5.483SerLeu: 5.483 ± 0.097
1.824SerMet: 1.824 ± 0.062
3.423SerAsn: 3.423 ± 0.1
1.628SerPro: 1.628 ± 0.05
1.653SerGln: 1.653 ± 0.049
2.498SerArg: 2.498 ± 0.065
4.546SerSer: 4.546 ± 0.111
2.997SerThr: 2.997 ± 0.076
4.082SerVal: 4.082 ± 0.087
0.417SerTrp: 0.417 ± 0.024
2.592SerTyr: 2.592 ± 0.068
0.0SerXaa: 0.0 ± 0.0
Thr
2.981ThrAla: 2.981 ± 0.082
0.447ThrCys: 0.447 ± 0.026
2.645ThrAsp: 2.645 ± 0.074
3.107ThrGlu: 3.107 ± 0.07
1.924ThrPhe: 1.924 ± 0.057
3.629ThrGly: 3.629 ± 0.086
0.713ThrHis: 0.713 ± 0.035
4.26ThrIle: 4.26 ± 0.09
3.378ThrLys: 3.378 ± 0.077
3.98ThrLeu: 3.98 ± 0.092
1.251ThrMet: 1.251 ± 0.048
2.172ThrAsn: 2.172 ± 0.071
1.735ThrPro: 1.735 ± 0.066
1.049ThrGln: 1.049 ± 0.046
1.608ThrArg: 1.608 ± 0.053
3.115ThrSer: 3.115 ± 0.075
2.364ThrThr: 2.364 ± 0.073
3.736ThrVal: 3.736 ± 0.081
0.328ThrTrp: 0.328 ± 0.025
1.603ThrTyr: 1.603 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
4.379ValAla: 4.379 ± 0.108
0.897ValCys: 0.897 ± 0.037
4.598ValAsp: 4.598 ± 0.089
5.494ValGlu: 5.494 ± 0.099
3.082ValPhe: 3.082 ± 0.071
4.337ValGly: 4.337 ± 0.096
0.88ValHis: 0.88 ± 0.048
5.908ValIle: 5.908 ± 0.116
5.439ValLys: 5.439 ± 0.099
6.003ValLeu: 6.003 ± 0.099
1.747ValMet: 1.747 ± 0.059
3.341ValAsn: 3.341 ± 0.075
2.008ValPro: 2.008 ± 0.063
1.223ValGln: 1.223 ± 0.044
2.356ValArg: 2.356 ± 0.067
4.673ValSer: 4.673 ± 0.106
3.1ValThr: 3.1 ± 0.072
4.84ValVal: 4.84 ± 0.114
0.392ValTrp: 0.392 ± 0.024
2.587ValTyr: 2.587 ± 0.062
0.0ValXaa: 0.0 ± 0.0
Trp
0.351TrpAla: 0.351 ± 0.029
0.102TrpCys: 0.102 ± 0.011
0.397TrpAsp: 0.397 ± 0.028
0.383TrpGlu: 0.383 ± 0.026
0.261TrpPhe: 0.261 ± 0.021
0.388TrpGly: 0.388 ± 0.024
0.105TrpHis: 0.105 ± 0.014
0.606TrpIle: 0.606 ± 0.031
0.514TrpLys: 0.514 ± 0.031
0.509TrpLeu: 0.509 ± 0.03
0.179TrpMet: 0.179 ± 0.017
0.348TrpAsn: 0.348 ± 0.025
0.127TrpPro: 0.127 ± 0.015
0.191TrpGln: 0.191 ± 0.017
0.191TrpArg: 0.191 ± 0.018
0.331TrpSer: 0.331 ± 0.023
0.283TrpThr: 0.283 ± 0.023
0.376TrpVal: 0.376 ± 0.027
0.067TrpTrp: 0.067 ± 0.01
0.269TrpTyr: 0.269 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.034TyrAla: 2.034 ± 0.06
0.484TyrCys: 0.484 ± 0.027
2.453TyrAsp: 2.453 ± 0.066
2.951TyrGlu: 2.951 ± 0.08
1.805TyrPhe: 1.805 ± 0.063
2.461TyrGly: 2.461 ± 0.063
0.489TyrHis: 0.489 ± 0.029
3.823TyrIle: 3.823 ± 0.089
3.729TyrLys: 3.729 ± 0.09
3.326TyrLeu: 3.326 ± 0.071
1.046TyrMet: 1.046 ± 0.041
2.433TyrAsn: 2.433 ± 0.086
1.278TyrPro: 1.278 ± 0.048
0.878TyrGln: 0.878 ± 0.041
1.685TyrArg: 1.685 ± 0.06
2.714TyrSer: 2.714 ± 0.069
1.926TyrThr: 1.926 ± 0.057
2.404TyrVal: 2.404 ± 0.062
0.236TyrTrp: 0.236 ± 0.017
1.782TyrTyr: 1.782 ± 0.059
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1869 proteins (597693 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski