Amino acid dipepetide frequency for Petrimonas mucosa

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.406AlaAla: 5.406 ± 0.08
0.761AlaCys: 0.761 ± 0.027
3.87AlaAsp: 3.87 ± 0.055
4.463AlaGlu: 4.463 ± 0.066
3.062AlaPhe: 3.062 ± 0.057
5.185AlaGly: 5.185 ± 0.081
1.218AlaHis: 1.218 ± 0.036
4.836AlaIle: 4.836 ± 0.086
3.673AlaLys: 3.673 ± 0.057
6.352AlaLeu: 6.352 ± 0.085
1.714AlaMet: 1.714 ± 0.044
2.949AlaAsn: 2.949 ± 0.055
2.208AlaPro: 2.208 ± 0.045
2.195AlaGln: 2.195 ± 0.046
3.498AlaArg: 3.498 ± 0.055
4.168AlaSer: 4.168 ± 0.063
3.541AlaThr: 3.541 ± 0.055
4.617AlaVal: 4.617 ± 0.084
0.804AlaTrp: 0.804 ± 0.029
2.867AlaTyr: 2.867 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.535CysAla: 0.535 ± 0.022
0.133CysCys: 0.133 ± 0.012
0.537CysAsp: 0.537 ± 0.022
0.575CysGlu: 0.575 ± 0.025
0.514CysPhe: 0.514 ± 0.024
0.868CysGly: 0.868 ± 0.035
0.243CysHis: 0.243 ± 0.015
0.638CysIle: 0.638 ± 0.026
0.53CysLys: 0.53 ± 0.022
0.754CysLeu: 0.754 ± 0.022
0.211CysMet: 0.211 ± 0.013
0.509CysAsn: 0.509 ± 0.022
0.365CysPro: 0.365 ± 0.02
0.289CysGln: 0.289 ± 0.018
0.563CysArg: 0.563 ± 0.027
0.686CysSer: 0.686 ± 0.028
0.462CysThr: 0.462 ± 0.022
0.557CysVal: 0.557 ± 0.024
0.125CysTrp: 0.125 ± 0.01
0.393CysTyr: 0.393 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
3.813AspAla: 3.813 ± 0.068
0.48AspCys: 0.48 ± 0.02
2.67AspAsp: 2.67 ± 0.053
3.988AspGlu: 3.988 ± 0.071
3.107AspPhe: 3.107 ± 0.054
3.938AspGly: 3.938 ± 0.068
1.021AspHis: 1.021 ± 0.031
3.903AspIle: 3.903 ± 0.064
3.412AspLys: 3.412 ± 0.06
4.866AspLeu: 4.866 ± 0.07
1.331AspMet: 1.331 ± 0.036
2.737AspAsn: 2.737 ± 0.056
2.327AspPro: 2.327 ± 0.053
1.499AspGln: 1.499 ± 0.039
2.927AspArg: 2.927 ± 0.054
2.839AspSer: 2.839 ± 0.058
2.556AspThr: 2.556 ± 0.051
3.241AspVal: 3.241 ± 0.056
0.917AspTrp: 0.917 ± 0.032
2.571AspTyr: 2.571 ± 0.048
0.0AspXaa: 0.0 ± 0.0
Glu
4.49GluAla: 4.49 ± 0.08
0.474GluCys: 0.474 ± 0.021
2.806GluAsp: 2.806 ± 0.055
5.455GluGlu: 5.455 ± 0.087
2.616GluPhe: 2.616 ± 0.049
4.194GluGly: 4.194 ± 0.061
1.154GluHis: 1.154 ± 0.034
5.348GluIle: 5.348 ± 0.076
5.664GluLys: 5.664 ± 0.079
6.381GluLeu: 6.381 ± 0.091
2.141GluMet: 2.141 ± 0.044
4.007GluAsn: 4.007 ± 0.062
2.049GluPro: 2.049 ± 0.044
2.579GluGln: 2.579 ± 0.054
3.928GluArg: 3.928 ± 0.058
3.796GluSer: 3.796 ± 0.067
3.528GluThr: 3.528 ± 0.054
4.418GluVal: 4.418 ± 0.069
0.913GluTrp: 0.913 ± 0.03
2.536GluTyr: 2.536 ± 0.051
0.0GluXaa: 0.0 ± 0.0
Phe
3.042PheAla: 3.042 ± 0.054
0.533PheCys: 0.533 ± 0.024
2.992PheAsp: 2.992 ± 0.058
2.814PheGlu: 2.814 ± 0.056
2.395PhePhe: 2.395 ± 0.05
3.292PheGly: 3.292 ± 0.059
0.897PheHis: 0.897 ± 0.031
3.165PheIle: 3.165 ± 0.06
2.204PheLys: 2.204 ± 0.048
4.212PheLeu: 4.212 ± 0.074
1.13PheMet: 1.13 ± 0.033
2.558PheAsn: 2.558 ± 0.055
1.91PhePro: 1.91 ± 0.044
1.366PheGln: 1.366 ± 0.04
2.501PheArg: 2.501 ± 0.045
3.751PheSer: 3.751 ± 0.062
2.895PheThr: 2.895 ± 0.057
2.953PheVal: 2.953 ± 0.06
0.652PheTrp: 0.652 ± 0.026
2.002PheTyr: 2.002 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
4.42GlyAla: 4.42 ± 0.078
0.764GlyCys: 0.764 ± 0.028
3.664GlyAsp: 3.664 ± 0.068
4.666GlyGlu: 4.666 ± 0.06
3.369GlyPhe: 3.369 ± 0.065
4.938GlyGly: 4.938 ± 0.09
1.304GlyHis: 1.304 ± 0.035
5.433GlyIle: 5.433 ± 0.081
5.284GlyLys: 5.284 ± 0.073
5.906GlyLeu: 5.906 ± 0.096
2.13GlyMet: 2.13 ± 0.05
3.766GlyAsn: 3.766 ± 0.071
1.514GlyPro: 1.514 ± 0.038
1.972GlyGln: 1.972 ± 0.048
3.447GlyArg: 3.447 ± 0.059
4.385GlySer: 4.385 ± 0.076
3.922GlyThr: 3.922 ± 0.069
4.982GlyVal: 4.982 ± 0.077
1.111GlyTrp: 1.111 ± 0.038
3.337GlyTyr: 3.337 ± 0.07
0.0GlyXaa: 0.0 ± 0.0
His
1.212HisAla: 1.212 ± 0.033
0.256HisCys: 0.256 ± 0.016
0.991HisAsp: 0.991 ± 0.032
1.025HisGlu: 1.025 ± 0.029
1.084HisPhe: 1.084 ± 0.032
1.27HisGly: 1.27 ± 0.033
0.509HisHis: 0.509 ± 0.023
1.329HisIle: 1.329 ± 0.031
0.899HisLys: 0.899 ± 0.032
1.993HisLeu: 1.993 ± 0.046
0.369HisMet: 0.369 ± 0.018
0.935HisAsn: 0.935 ± 0.031
1.136HisPro: 1.136 ± 0.039
0.701HisGln: 0.701 ± 0.024
1.023HisArg: 1.023 ± 0.035
1.178HisSer: 1.178 ± 0.036
0.988HisThr: 0.988 ± 0.031
1.062HisVal: 1.062 ± 0.032
0.295HisTrp: 0.295 ± 0.018
0.899HisTyr: 0.899 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
5.145IleAla: 5.145 ± 0.078
0.753IleCys: 0.753 ± 0.026
4.582IleAsp: 4.582 ± 0.07
4.777IleGlu: 4.777 ± 0.068
3.046IlePhe: 3.046 ± 0.063
4.92IleGly: 4.92 ± 0.081
1.458IleHis: 1.458 ± 0.037
4.54IleIle: 4.54 ± 0.078
3.57IleLys: 3.57 ± 0.062
5.979IleLeu: 5.979 ± 0.101
1.316IleMet: 1.316 ± 0.039
3.42IleAsn: 3.42 ± 0.063
3.328IlePro: 3.328 ± 0.053
2.012IleGln: 2.012 ± 0.046
4.137IleArg: 4.137 ± 0.067
4.775IleSer: 4.775 ± 0.07
4.141IleThr: 4.141 ± 0.066
4.392IleVal: 4.392 ± 0.063
0.782IleTrp: 0.782 ± 0.028
2.727IleTyr: 2.727 ± 0.06
0.0IleXaa: 0.0 ± 0.0
Lys
3.997LysAla: 3.997 ± 0.064
0.415LysCys: 0.415 ± 0.027
2.882LysAsp: 2.882 ± 0.058
5.236LysGlu: 5.236 ± 0.084
2.143LysPhe: 2.143 ± 0.049
4.395LysGly: 4.395 ± 0.065
1.046LysHis: 1.046 ± 0.034
4.375LysIle: 4.375 ± 0.075
4.265LysLys: 4.265 ± 0.087
5.08LysLeu: 5.08 ± 0.075
1.77LysMet: 1.77 ± 0.037
3.161LysAsn: 3.161 ± 0.065
2.286LysPro: 2.286 ± 0.051
2.213LysGln: 2.213 ± 0.05
3.835LysArg: 3.835 ± 0.069
3.775LysSer: 3.775 ± 0.059
3.149LysThr: 3.149 ± 0.057
4.056LysVal: 4.056 ± 0.06
0.735LysTrp: 0.735 ± 0.028
2.448LysTyr: 2.448 ± 0.057
0.0LysXaa: 0.0 ± 0.0
Leu
6.146LeuAla: 6.146 ± 0.085
0.937LeuCys: 0.937 ± 0.032
4.627LeuAsp: 4.627 ± 0.075
5.788LeuGlu: 5.788 ± 0.087
5.111LeuPhe: 5.111 ± 0.082
5.629LeuGly: 5.629 ± 0.091
1.842LeuHis: 1.842 ± 0.045
5.976LeuIle: 5.976 ± 0.086
6.215LeuLys: 6.215 ± 0.077
9.841LeuLeu: 9.841 ± 0.151
2.319LeuMet: 2.319 ± 0.047
4.705LeuAsn: 4.705 ± 0.065
4.218LeuPro: 4.218 ± 0.075
3.336LeuGln: 3.336 ± 0.065
4.521LeuArg: 4.521 ± 0.079
6.799LeuSer: 6.799 ± 0.088
5.123LeuThr: 5.123 ± 0.069
5.24LeuVal: 5.24 ± 0.079
1.021LeuTrp: 1.021 ± 0.033
3.521LeuTyr: 3.521 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
1.95MetAla: 1.95 ± 0.039
0.165MetCys: 0.165 ± 0.012
1.276MetAsp: 1.276 ± 0.036
1.746MetGlu: 1.746 ± 0.041
0.838MetPhe: 0.838 ± 0.028
1.811MetGly: 1.811 ± 0.046
0.453MetHis: 0.453 ± 0.02
1.694MetIle: 1.694 ± 0.045
2.139MetLys: 2.139 ± 0.044
2.336MetLeu: 2.336 ± 0.054
0.733MetMet: 0.733 ± 0.027
1.369MetAsn: 1.369 ± 0.036
1.055MetPro: 1.055 ± 0.029
0.933MetGln: 0.933 ± 0.027
1.329MetArg: 1.329 ± 0.037
1.424MetSer: 1.424 ± 0.037
1.241MetThr: 1.241 ± 0.036
1.611MetVal: 1.611 ± 0.04
0.242MetTrp: 0.242 ± 0.016
0.712MetTyr: 0.712 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
3.09AsnAla: 3.09 ± 0.051
0.459AsnCys: 0.459 ± 0.023
2.606AsnAsp: 2.606 ± 0.05
3.024AsnGlu: 3.024 ± 0.053
2.34AsnPhe: 2.34 ± 0.05
4.008AsnGly: 4.008 ± 0.078
1.026AsnHis: 1.026 ± 0.032
3.763AsnIle: 3.763 ± 0.07
2.644AsnLys: 2.644 ± 0.048
4.67AsnLeu: 4.67 ± 0.071
1.221AsnMet: 1.221 ± 0.035
2.656AsnAsn: 2.656 ± 0.056
2.7AsnPro: 2.7 ± 0.056
1.546AsnGln: 1.546 ± 0.043
3.38AsnArg: 3.38 ± 0.063
3.105AsnSer: 3.105 ± 0.055
2.359AsnThr: 2.359 ± 0.05
2.898AsnVal: 2.898 ± 0.064
0.759AsnTrp: 0.759 ± 0.029
2.348AsnTyr: 2.348 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
2.863ProAla: 2.863 ± 0.059
0.253ProCys: 0.253 ± 0.013
2.907ProAsp: 2.907 ± 0.056
3.488ProGlu: 3.488 ± 0.057
1.924ProPhe: 1.924 ± 0.049
2.971ProGly: 2.971 ± 0.059
0.762ProHis: 0.762 ± 0.025
2.336ProIle: 2.336 ± 0.052
1.978ProLys: 1.978 ± 0.041
3.556ProLeu: 3.556 ± 0.063
0.845ProMet: 0.845 ± 0.027
1.69ProAsn: 1.69 ± 0.039
1.105ProPro: 1.105 ± 0.034
1.381ProGln: 1.381 ± 0.037
1.685ProArg: 1.685 ± 0.038
2.417ProSer: 2.417 ± 0.051
1.924ProThr: 1.924 ± 0.042
3.411ProVal: 3.411 ± 0.062
0.526ProTrp: 0.526 ± 0.023
1.782ProTyr: 1.782 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
2.155GlnAla: 2.155 ± 0.054
0.203GlnCys: 0.203 ± 0.015
1.396GlnAsp: 1.396 ± 0.036
2.166GlnGlu: 2.166 ± 0.05
1.531GlnPhe: 1.531 ± 0.039
1.93GlnGly: 1.93 ± 0.047
0.685GlnHis: 0.685 ± 0.024
2.309GlnIle: 2.309 ± 0.047
2.098GlnLys: 2.098 ± 0.05
3.624GlnLeu: 3.624 ± 0.061
0.913GlnMet: 0.913 ± 0.032
1.572GlnAsn: 1.572 ± 0.04
1.38GlnPro: 1.38 ± 0.036
1.57GlnGln: 1.57 ± 0.045
1.794GlnArg: 1.794 ± 0.048
1.914GlnSer: 1.914 ± 0.046
1.75GlnThr: 1.75 ± 0.038
2.169GlnVal: 2.169 ± 0.049
0.408GlnTrp: 0.408 ± 0.019
1.346GlnTyr: 1.346 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
2.992ArgAla: 2.992 ± 0.057
0.396ArgCys: 0.396 ± 0.02
2.7ArgAsp: 2.7 ± 0.052
4.224ArgGlu: 4.224 ± 0.06
2.927ArgPhe: 2.927 ± 0.054
3.213ArgGly: 3.213 ± 0.058
1.057ArgHis: 1.057 ± 0.033
4.052ArgIle: 4.052 ± 0.067
3.878ArgLys: 3.878 ± 0.07
5.119ArgLeu: 5.119 ± 0.083
1.49ArgMet: 1.49 ± 0.035
2.931ArgAsn: 2.931 ± 0.057
1.838ArgPro: 1.838 ± 0.042
2.103ArgGln: 2.103 ± 0.047
2.871ArgArg: 2.871 ± 0.056
3.341ArgSer: 3.341 ± 0.059
2.486ArgThr: 2.486 ± 0.053
3.311ArgVal: 3.311 ± 0.055
0.809ArgTrp: 0.809 ± 0.03
2.588ArgTyr: 2.588 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
4.156SerAla: 4.156 ± 0.075
0.712SerCys: 0.712 ± 0.026
3.596SerAsp: 3.596 ± 0.061
4.008SerGlu: 4.008 ± 0.069
3.175SerPhe: 3.175 ± 0.061
5.088SerGly: 5.088 ± 0.079
1.257SerHis: 1.257 ± 0.039
4.335SerIle: 4.335 ± 0.067
3.231SerLys: 3.231 ± 0.067
6.328SerLeu: 6.328 ± 0.075
1.475SerMet: 1.475 ± 0.039
2.926SerAsn: 2.926 ± 0.059
2.679SerPro: 2.679 ± 0.053
1.997SerGln: 1.997 ± 0.041
3.554SerArg: 3.554 ± 0.052
4.143SerSer: 4.143 ± 0.083
3.185SerThr: 3.185 ± 0.063
4.273SerVal: 4.273 ± 0.057
0.932SerTrp: 0.932 ± 0.033
2.778SerTyr: 2.778 ± 0.051
0.0SerXaa: 0.0 ± 0.0
Thr
3.822ThrAla: 3.822 ± 0.073
0.44ThrCys: 0.44 ± 0.022
3.104ThrAsp: 3.104 ± 0.057
3.1ThrGlu: 3.1 ± 0.063
2.574ThrPhe: 2.574 ± 0.051
4.688ThrGly: 4.688 ± 0.065
1.027ThrHis: 1.027 ± 0.03
3.852ThrIle: 3.852 ± 0.063
2.296ThrLys: 2.296 ± 0.053
5.128ThrLeu: 5.128 ± 0.076
1.072ThrMet: 1.072 ± 0.031
2.14ThrAsn: 2.14 ± 0.046
2.734ThrPro: 2.734 ± 0.046
1.542ThrGln: 1.542 ± 0.037
2.614ThrArg: 2.614 ± 0.05
3.179ThrSer: 3.179 ± 0.057
3.143ThrThr: 3.143 ± 0.063
4.092ThrVal: 4.092 ± 0.063
0.681ThrTrp: 0.681 ± 0.028
2.038ThrTyr: 2.038 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
4.742ValAla: 4.742 ± 0.079
0.747ValCys: 0.747 ± 0.028
3.722ValAsp: 3.722 ± 0.063
4.69ValGlu: 4.69 ± 0.072
2.69ValPhe: 2.69 ± 0.055
3.976ValGly: 3.976 ± 0.06
1.073ValHis: 1.073 ± 0.034
4.599ValIle: 4.599 ± 0.068
4.207ValLys: 4.207 ± 0.069
5.43ValLeu: 5.43 ± 0.069
1.616ValMet: 1.616 ± 0.042
3.436ValAsn: 3.436 ± 0.057
2.634ValPro: 2.634 ± 0.048
1.835ValGln: 1.835 ± 0.045
3.226ValArg: 3.226 ± 0.055
4.441ValSer: 4.441 ± 0.07
3.993ValThr: 3.993 ± 0.065
4.694ValVal: 4.694 ± 0.081
0.74ValTrp: 0.74 ± 0.028
2.438ValTyr: 2.438 ± 0.046
0.0ValXaa: 0.0 ± 0.0
Trp
0.73TrpAla: 0.73 ± 0.024
0.153TrpCys: 0.153 ± 0.011
0.826TrpAsp: 0.826 ± 0.034
0.941TrpGlu: 0.941 ± 0.034
0.614TrpPhe: 0.614 ± 0.022
0.959TrpGly: 0.959 ± 0.033
0.284TrpHis: 0.284 ± 0.017
0.888TrpIle: 0.888 ± 0.035
0.834TrpLys: 0.834 ± 0.033
1.201TrpLeu: 1.201 ± 0.034
0.412TrpMet: 0.412 ± 0.018
0.791TrpAsn: 0.791 ± 0.031
0.349TrpPro: 0.349 ± 0.02
0.523TrpGln: 0.523 ± 0.025
0.691TrpArg: 0.691 ± 0.029
0.826TrpSer: 0.826 ± 0.031
0.646TrpThr: 0.646 ± 0.026
0.819TrpVal: 0.819 ± 0.031
0.177TrpTrp: 0.177 ± 0.013
0.575TrpTyr: 0.575 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.68TyrAla: 2.68 ± 0.046
0.451TyrCys: 0.451 ± 0.021
2.389TyrAsp: 2.389 ± 0.051
2.303TyrGlu: 2.303 ± 0.041
2.204TyrPhe: 2.204 ± 0.052
3.044TyrGly: 3.044 ± 0.064
0.84TyrHis: 0.84 ± 0.028
2.43TyrIle: 2.43 ± 0.052
2.243TyrLys: 2.243 ± 0.055
4.157TyrLeu: 4.157 ± 0.068
0.852TyrMet: 0.852 ± 0.029
2.344TyrAsn: 2.344 ± 0.06
1.991TyrPro: 1.991 ± 0.041
1.32TyrGln: 1.32 ± 0.034
2.803TyrArg: 2.803 ± 0.053
2.885TyrSer: 2.885 ± 0.052
2.234TyrThr: 2.234 ± 0.047
2.131TyrVal: 2.131 ± 0.04
0.608TyrTrp: 0.608 ± 0.025
1.915TyrTyr: 1.915 ± 0.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2896 proteins (1089703 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski