Amino acid dipepetide frequency for Megasphaera paucivorans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.464AlaAla: 9.464 ± 0.166
1.249AlaCys: 1.249 ± 0.047
4.524AlaAsp: 4.524 ± 0.073
4.885AlaGlu: 4.885 ± 0.087
3.125AlaPhe: 3.125 ± 0.061
6.597AlaGly: 6.597 ± 0.125
1.494AlaHis: 1.494 ± 0.047
6.151AlaIle: 6.151 ± 0.1
4.632AlaLys: 4.632 ± 0.073
7.453AlaLeu: 7.453 ± 0.084
2.7AlaMet: 2.7 ± 0.06
2.861AlaAsn: 2.861 ± 0.066
2.302AlaPro: 2.302 ± 0.049
2.747AlaGln: 2.747 ± 0.059
2.95AlaArg: 2.95 ± 0.065
4.196AlaSer: 4.196 ± 0.068
3.407AlaThr: 3.407 ± 0.068
7.354AlaVal: 7.354 ± 0.109
0.662AlaTrp: 0.662 ± 0.03
2.889AlaTyr: 2.889 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
0.985CysAla: 0.985 ± 0.031
0.278CysCys: 0.278 ± 0.018
0.737CysAsp: 0.737 ± 0.031
0.714CysGlu: 0.714 ± 0.026
0.601CysPhe: 0.601 ± 0.027
1.457CysGly: 1.457 ± 0.048
0.394CysHis: 0.394 ± 0.023
1.271CysIle: 1.271 ± 0.035
0.735CysLys: 0.735 ± 0.03
1.229CysLeu: 1.229 ± 0.04
0.453CysMet: 0.453 ± 0.027
0.557CysAsn: 0.557 ± 0.026
0.671CysPro: 0.671 ± 0.034
0.471CysGln: 0.471 ± 0.026
0.879CysArg: 0.879 ± 0.037
0.918CysSer: 0.918 ± 0.032
0.783CysThr: 0.783 ± 0.034
0.89CysVal: 0.89 ± 0.029
0.129CysTrp: 0.129 ± 0.013
0.477CysTyr: 0.477 ± 0.028
0.0CysXaa: 0.0 ± 0.0
Asp
4.01AspAla: 4.01 ± 0.084
0.706AspCys: 0.706 ± 0.031
2.658AspAsp: 2.658 ± 0.073
3.235AspGlu: 3.235 ± 0.062
2.377AspPhe: 2.377 ± 0.057
3.529AspGly: 3.529 ± 0.069
1.064AspHis: 1.064 ± 0.035
4.965AspIle: 4.965 ± 0.079
3.423AspLys: 3.423 ± 0.074
4.214AspLeu: 4.214 ± 0.075
1.956AspMet: 1.956 ± 0.049
2.153AspAsn: 2.153 ± 0.064
1.892AspPro: 1.892 ± 0.049
1.418AspGln: 1.418 ± 0.04
2.228AspArg: 2.228 ± 0.049
2.718AspSer: 2.718 ± 0.054
3.497AspThr: 3.497 ± 0.072
3.924AspVal: 3.924 ± 0.077
0.617AspTrp: 0.617 ± 0.025
2.105AspTyr: 2.105 ± 0.055
0.0AspXaa: 0.0 ± 0.0
Glu
4.926GluAla: 4.926 ± 0.089
0.631GluCys: 0.631 ± 0.03
2.915GluAsp: 2.915 ± 0.07
4.736GluGlu: 4.736 ± 0.091
2.032GluPhe: 2.032 ± 0.045
3.739GluGly: 3.739 ± 0.075
1.375GluHis: 1.375 ± 0.043
4.738GluIle: 4.738 ± 0.081
5.446GluLys: 5.446 ± 0.083
5.491GluLeu: 5.491 ± 0.093
1.977GluMet: 1.977 ± 0.05
3.175GluAsn: 3.175 ± 0.065
1.684GluPro: 1.684 ± 0.045
2.729GluGln: 2.729 ± 0.068
2.842GluArg: 2.842 ± 0.066
2.815GluSer: 2.815 ± 0.059
3.226GluThr: 3.226 ± 0.061
3.588GluVal: 3.588 ± 0.069
0.59GluTrp: 0.59 ± 0.029
2.351GluTyr: 2.351 ± 0.054
0.0GluXaa: 0.0 ± 0.0
Phe
3.012PheAla: 3.012 ± 0.063
0.717PheCys: 0.717 ± 0.033
2.396PheAsp: 2.396 ± 0.054
2.002PheGlu: 2.002 ± 0.047
2.073PhePhe: 2.073 ± 0.067
3.098PheGly: 3.098 ± 0.068
0.926PheHis: 0.926 ± 0.032
3.299PheIle: 3.299 ± 0.084
1.842PheLys: 1.842 ± 0.048
3.856PheLeu: 3.856 ± 0.086
1.268PheMet: 1.268 ± 0.045
1.619PheAsn: 1.619 ± 0.05
1.559PhePro: 1.559 ± 0.043
1.224PheGln: 1.224 ± 0.043
1.576PheArg: 1.576 ± 0.047
3.013PheSer: 3.013 ± 0.069
2.486PheThr: 2.486 ± 0.054
2.82PheVal: 2.82 ± 0.055
0.45PheTrp: 0.45 ± 0.023
1.497PheTyr: 1.497 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
5.526GlyAla: 5.526 ± 0.096
1.234GlyCys: 1.234 ± 0.048
3.284GlyAsp: 3.284 ± 0.062
3.576GlyGlu: 3.576 ± 0.063
3.102GlyPhe: 3.102 ± 0.069
5.237GlyGly: 5.237 ± 0.109
1.621GlyHis: 1.621 ± 0.046
6.792GlyIle: 6.792 ± 0.096
5.01GlyLys: 5.01 ± 0.078
6.212GlyLeu: 6.212 ± 0.099
2.524GlyMet: 2.524 ± 0.054
3.156GlyAsn: 3.156 ± 0.104
1.83GlyPro: 1.83 ± 0.052
2.364GlyGln: 2.364 ± 0.059
3.15GlyArg: 3.15 ± 0.065
4.157GlySer: 4.157 ± 0.09
4.749GlyThr: 4.749 ± 0.107
4.948GlyVal: 4.948 ± 0.075
0.742GlyTrp: 0.742 ± 0.03
3.01GlyTyr: 3.01 ± 0.06
0.0GlyXaa: 0.0 ± 0.0
His
1.552HisAla: 1.552 ± 0.041
0.425HisCys: 0.425 ± 0.021
1.196HisAsp: 1.196 ± 0.034
1.161HisGlu: 1.161 ± 0.038
0.965HisPhe: 0.965 ± 0.037
1.577HisGly: 1.577 ± 0.043
0.713HisHis: 0.713 ± 0.031
2.107HisIle: 2.107 ± 0.053
1.116HisLys: 1.116 ± 0.041
1.918HisLeu: 1.918 ± 0.05
0.736HisMet: 0.736 ± 0.027
0.903HisAsn: 0.903 ± 0.033
1.127HisPro: 1.127 ± 0.038
0.737HisGln: 0.737 ± 0.031
1.083HisArg: 1.083 ± 0.034
1.226HisSer: 1.226 ± 0.043
1.397HisThr: 1.397 ± 0.041
1.623HisVal: 1.623 ± 0.041
0.251HisTrp: 0.251 ± 0.018
0.835HisTyr: 0.835 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.834IleAla: 6.834 ± 0.1
1.419IleCys: 1.419 ± 0.049
4.332IleAsp: 4.332 ± 0.075
4.251IleGlu: 4.251 ± 0.071
3.16IlePhe: 3.16 ± 0.066
5.958IleGly: 5.958 ± 0.088
1.854IleHis: 1.854 ± 0.048
6.224IleIle: 6.224 ± 0.114
4.224IleLys: 4.224 ± 0.071
7.241IleLeu: 7.241 ± 0.103
2.238IleMet: 2.238 ± 0.055
3.148IleAsn: 3.148 ± 0.059
3.783IlePro: 3.783 ± 0.062
2.736IleGln: 2.736 ± 0.058
3.453IleArg: 3.453 ± 0.076
5.177IleSer: 5.177 ± 0.097
4.834IleThr: 4.834 ± 0.083
5.581IleVal: 5.581 ± 0.097
0.665IleTrp: 0.665 ± 0.024
2.695IleTyr: 2.695 ± 0.052
0.0IleXaa: 0.0 ± 0.0
Lys
4.716LysAla: 4.716 ± 0.082
0.531LysCys: 0.531 ± 0.025
3.288LysAsp: 3.288 ± 0.079
5.03LysGlu: 5.03 ± 0.093
1.813LysPhe: 1.813 ± 0.049
3.919LysGly: 3.919 ± 0.073
1.22LysHis: 1.22 ± 0.04
4.912LysIle: 4.912 ± 0.082
5.657LysLys: 5.657 ± 0.103
4.994LysLeu: 4.994 ± 0.086
2.186LysMet: 2.186 ± 0.055
3.58LysAsn: 3.58 ± 0.067
2.026LysPro: 2.026 ± 0.053
2.595LysGln: 2.595 ± 0.063
2.767LysArg: 2.767 ± 0.057
2.974LysSer: 2.974 ± 0.065
3.616LysThr: 3.616 ± 0.067
3.737LysVal: 3.737 ± 0.081
0.609LysTrp: 0.609 ± 0.025
2.41LysTyr: 2.41 ± 0.058
0.0LysXaa: 0.0 ± 0.0
Leu
7.52LeuAla: 7.52 ± 0.097
1.451LeuCys: 1.451 ± 0.042
4.66LeuAsp: 4.66 ± 0.078
5.104LeuGlu: 5.104 ± 0.085
3.936LeuPhe: 3.936 ± 0.078
6.381LeuGly: 6.381 ± 0.109
2.179LeuHis: 2.179 ± 0.052
5.98LeuIle: 5.98 ± 0.1
5.332LeuLys: 5.332 ± 0.084
8.365LeuLeu: 8.365 ± 0.126
2.495LeuMet: 2.495 ± 0.063
3.374LeuAsn: 3.374 ± 0.073
3.983LeuPro: 3.983 ± 0.071
3.993LeuGln: 3.993 ± 0.07
4.124LeuArg: 4.124 ± 0.077
5.817LeuSer: 5.817 ± 0.091
5.355LeuThr: 5.355 ± 0.08
5.539LeuVal: 5.539 ± 0.089
0.788LeuTrp: 0.788 ± 0.029
3.099LeuTyr: 3.099 ± 0.067
0.0LeuXaa: 0.0 ± 0.0
Met
2.856MetAla: 2.856 ± 0.058
0.304MetCys: 0.304 ± 0.019
1.848MetAsp: 1.848 ± 0.043
2.153MetGlu: 2.153 ± 0.05
1.022MetPhe: 1.022 ± 0.032
2.362MetGly: 2.362 ± 0.054
0.663MetHis: 0.663 ± 0.027
2.21MetIle: 2.21 ± 0.054
2.318MetLys: 2.318 ± 0.051
2.777MetLeu: 2.777 ± 0.067
1.018MetMet: 1.018 ± 0.039
1.529MetAsn: 1.529 ± 0.043
1.285MetPro: 1.285 ± 0.048
1.229MetGln: 1.229 ± 0.044
1.379MetArg: 1.379 ± 0.042
1.737MetSer: 1.737 ± 0.046
1.918MetThr: 1.918 ± 0.051
1.962MetVal: 1.962 ± 0.044
0.176MetTrp: 0.176 ± 0.013
0.983MetTyr: 0.983 ± 0.037
0.0MetXaa: 0.0 ± 0.0
Asn
3.134AsnAla: 3.134 ± 0.07
0.614AsnCys: 0.614 ± 0.029
2.117AsnAsp: 2.117 ± 0.06
2.436AsnGlu: 2.436 ± 0.055
1.573AsnPhe: 1.573 ± 0.047
3.171AsnGly: 3.171 ± 0.093
0.98AsnHis: 0.98 ± 0.035
3.551AsnIle: 3.551 ± 0.081
2.676AsnLys: 2.676 ± 0.06
3.58AsnLeu: 3.58 ± 0.073
1.349AsnMet: 1.349 ± 0.041
1.91AsnAsn: 1.91 ± 0.064
2.028AsnPro: 2.028 ± 0.047
1.423AsnGln: 1.423 ± 0.048
1.904AsnArg: 1.904 ± 0.047
2.335AsnSer: 2.335 ± 0.062
2.518AsnThr: 2.518 ± 0.076
3.033AsnVal: 3.033 ± 0.071
0.509AsnTrp: 0.509 ± 0.026
1.574AsnTyr: 1.574 ± 0.056
0.0AsnXaa: 0.0 ± 0.0
Pro
2.973ProAla: 2.973 ± 0.063
0.486ProCys: 0.486 ± 0.024
2.294ProAsp: 2.294 ± 0.048
2.946ProGlu: 2.946 ± 0.067
1.765ProPhe: 1.765 ± 0.043
2.526ProGly: 2.526 ± 0.056
0.874ProHis: 0.874 ± 0.031
2.79ProIle: 2.79 ± 0.057
1.943ProLys: 1.943 ± 0.048
3.366ProLeu: 3.366 ± 0.069
1.091ProMet: 1.091 ± 0.036
1.454ProAsn: 1.454 ± 0.048
1.051ProPro: 1.051 ± 0.038
1.434ProGln: 1.434 ± 0.045
1.188ProArg: 1.188 ± 0.039
1.968ProSer: 1.968 ± 0.047
1.85ProThr: 1.85 ± 0.046
3.216ProVal: 3.216 ± 0.068
0.389ProTrp: 0.389 ± 0.021
1.546ProTyr: 1.546 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
3.125GlnAla: 3.125 ± 0.062
0.504GlnCys: 0.504 ± 0.023
1.688GlnAsp: 1.688 ± 0.052
2.613GlnGlu: 2.613 ± 0.06
1.346GlnPhe: 1.346 ± 0.033
2.408GlnGly: 2.408 ± 0.053
0.997GlnHis: 0.997 ± 0.04
2.703GlnIle: 2.703 ± 0.057
2.655GlnLys: 2.655 ± 0.059
3.248GlnLeu: 3.248 ± 0.072
1.196GlnMet: 1.196 ± 0.038
1.567GlnAsn: 1.567 ± 0.045
1.167GlnPro: 1.167 ± 0.041
1.968GlnGln: 1.968 ± 0.07
1.763GlnArg: 1.763 ± 0.044
1.868GlnSer: 1.868 ± 0.048
1.849GlnThr: 1.849 ± 0.049
2.249GlnVal: 2.249 ± 0.051
0.437GlnTrp: 0.437 ± 0.027
1.751GlnTyr: 1.751 ± 0.053
0.0GlnXaa: 0.0 ± 0.0
Arg
2.756ArgAla: 2.756 ± 0.065
0.577ArgCys: 0.577 ± 0.029
2.2ArgAsp: 2.2 ± 0.051
2.956ArgGlu: 2.956 ± 0.069
1.856ArgPhe: 1.856 ± 0.05
2.474ArgGly: 2.474 ± 0.05
1.077ArgHis: 1.077 ± 0.036
3.799ArgIle: 3.799 ± 0.069
2.917ArgLys: 2.917 ± 0.061
3.824ArgLeu: 3.824 ± 0.075
1.459ArgMet: 1.459 ± 0.037
2.032ArgAsn: 2.032 ± 0.048
1.579ArgPro: 1.579 ± 0.046
2.046ArgGln: 2.046 ± 0.055
2.434ArgArg: 2.434 ± 0.058
2.248ArgSer: 2.248 ± 0.051
2.303ArgThr: 2.303 ± 0.051
2.696ArgVal: 2.696 ± 0.055
0.519ArgTrp: 0.519 ± 0.028
1.787ArgTyr: 1.787 ± 0.052
0.0ArgXaa: 0.0 ± 0.0
Ser
4.345SerAla: 4.345 ± 0.091
0.819SerCys: 0.819 ± 0.034
2.991SerAsp: 2.991 ± 0.066
3.066SerGlu: 3.066 ± 0.068
2.584SerPhe: 2.584 ± 0.063
4.695SerGly: 4.695 ± 0.106
1.395SerHis: 1.395 ± 0.045
4.518SerIle: 4.518 ± 0.086
2.97SerLys: 2.97 ± 0.068
5.478SerLeu: 5.478 ± 0.09
1.785SerMet: 1.785 ± 0.052
2.224SerAsn: 2.224 ± 0.058
2.008SerPro: 2.008 ± 0.055
2.048SerGln: 2.048 ± 0.052
2.53SerArg: 2.53 ± 0.055
3.265SerSer: 3.265 ± 0.07
2.858SerThr: 2.858 ± 0.068
3.988SerVal: 3.988 ± 0.068
0.669SerTrp: 0.669 ± 0.029
2.3SerTyr: 2.3 ± 0.057
0.0SerXaa: 0.0 ± 0.0
Thr
5.236ThrAla: 5.236 ± 0.091
0.721ThrCys: 0.721 ± 0.028
2.98ThrAsp: 2.98 ± 0.058
3.399ThrGlu: 3.399 ± 0.066
2.152ThrPhe: 2.152 ± 0.048
4.76ThrGly: 4.76 ± 0.093
1.069ThrHis: 1.069 ± 0.035
4.533ThrIle: 4.533 ± 0.081
3.011ThrLys: 3.011 ± 0.062
5.145ThrLeu: 5.145 ± 0.087
1.682ThrMet: 1.682 ± 0.046
2.22ThrAsn: 2.22 ± 0.071
2.486ThrPro: 2.486 ± 0.059
1.679ThrGln: 1.679 ± 0.046
2.028ThrArg: 2.028 ± 0.053
3.011ThrSer: 3.011 ± 0.069
3.084ThrThr: 3.084 ± 0.076
4.652ThrVal: 4.652 ± 0.092
0.564ThrTrp: 0.564 ± 0.025
2.117ThrTyr: 2.117 ± 0.054
0.0ThrXaa: 0.0 ± 0.0
Val
5.17ValAla: 5.17 ± 0.096
1.179ValCys: 1.179 ± 0.036
3.742ValAsp: 3.742 ± 0.072
3.974ValGlu: 3.974 ± 0.083
3.158ValPhe: 3.158 ± 0.068
4.577ValGly: 4.577 ± 0.081
1.517ValHis: 1.517 ± 0.041
5.465ValIle: 5.465 ± 0.093
3.954ValLys: 3.954 ± 0.072
6.766ValLeu: 6.766 ± 0.096
2.146ValMet: 2.146 ± 0.045
2.746ValAsn: 2.746 ± 0.071
2.965ValPro: 2.965 ± 0.061
2.474ValGln: 2.474 ± 0.062
3.068ValArg: 3.068 ± 0.055
4.456ValSer: 4.456 ± 0.083
4.183ValThr: 4.183 ± 0.077
4.999ValVal: 4.999 ± 0.08
0.669ValTrp: 0.669 ± 0.03
2.559ValTyr: 2.559 ± 0.057
0.0ValXaa: 0.0 ± 0.0
Trp
0.66TrpAla: 0.66 ± 0.033
0.143TrpCys: 0.143 ± 0.015
0.521TrpAsp: 0.521 ± 0.029
0.569TrpGlu: 0.569 ± 0.03
0.453TrpPhe: 0.453 ± 0.024
0.717TrpGly: 0.717 ± 0.033
0.271TrpHis: 0.271 ± 0.017
0.789TrpIle: 0.789 ± 0.03
0.651TrpLys: 0.651 ± 0.026
0.998TrpLeu: 0.998 ± 0.038
0.28TrpMet: 0.28 ± 0.017
0.612TrpAsn: 0.612 ± 0.027
0.296TrpPro: 0.296 ± 0.019
0.569TrpGln: 0.569 ± 0.029
0.5TrpArg: 0.5 ± 0.023
0.517TrpSer: 0.517 ± 0.028
0.426TrpThr: 0.426 ± 0.02
0.529TrpVal: 0.529 ± 0.029
0.135TrpTrp: 0.135 ± 0.014
0.352TrpTyr: 0.352 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.84TyrAla: 2.84 ± 0.063
0.641TyrCys: 0.641 ± 0.027
2.317TyrAsp: 2.317 ± 0.062
2.2TyrGlu: 2.2 ± 0.055
1.64TyrPhe: 1.64 ± 0.044
3.042TyrGly: 3.042 ± 0.057
0.966TyrHis: 0.966 ± 0.034
3.01TyrIle: 3.01 ± 0.06
2.069TyrLys: 2.069 ± 0.05
3.319TyrLeu: 3.319 ± 0.065
1.188TyrMet: 1.188 ± 0.038
1.595TyrAsn: 1.595 ± 0.046
1.43TyrPro: 1.43 ± 0.04
1.219TyrGln: 1.219 ± 0.036
1.693TyrArg: 1.693 ± 0.045
2.119TyrSer: 2.119 ± 0.048
2.191TyrThr: 2.191 ± 0.053
2.439TyrVal: 2.439 ± 0.057
0.419TyrTrp: 0.419 ± 0.025
1.468TyrTyr: 1.468 ± 0.055
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2723 proteins (847563 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski