Amino acid dipepetide frequency for Blautia sp. CAG:37

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.718AlaAla: 8.718 ± 0.16
1.248AlaCys: 1.248 ± 0.047
4.94AlaAsp: 4.94 ± 0.104
6.602AlaGlu: 6.602 ± 0.122
3.258AlaPhe: 3.258 ± 0.073
6.646AlaGly: 6.646 ± 0.118
1.309AlaHis: 1.309 ± 0.04
4.655AlaIle: 4.655 ± 0.093
5.204AlaLys: 5.204 ± 0.104
7.049AlaLeu: 7.049 ± 0.096
2.582AlaMet: 2.582 ± 0.066
2.392AlaAsn: 2.392 ± 0.054
2.251AlaPro: 2.251 ± 0.064
2.331AlaGln: 2.331 ± 0.065
3.154AlaArg: 3.154 ± 0.078
4.384AlaSer: 4.384 ± 0.089
3.226AlaThr: 3.226 ± 0.074
6.994AlaVal: 6.994 ± 0.106
0.825AlaTrp: 0.825 ± 0.033
2.803AlaTyr: 2.803 ± 0.051
0.004AlaXaa: 0.004 ± 0.002
Cys
1.215CysAla: 1.215 ± 0.051
0.382CysCys: 0.382 ± 0.03
0.866CysAsp: 0.866 ± 0.037
0.944CysGlu: 0.944 ± 0.039
0.753CysPhe: 0.753 ± 0.032
1.592CysGly: 1.592 ± 0.048
0.377CysHis: 0.377 ± 0.024
1.022CysIle: 1.022 ± 0.038
0.76CysLys: 0.76 ± 0.033
1.151CysLeu: 1.151 ± 0.038
0.489CysMet: 0.489 ± 0.028
0.582CysAsn: 0.582 ± 0.029
0.643CysPro: 0.643 ± 0.032
0.487CysGln: 0.487 ± 0.025
0.896CysArg: 0.896 ± 0.036
0.964CysSer: 0.964 ± 0.036
0.786CysThr: 0.786 ± 0.032
1.106CysVal: 1.106 ± 0.04
0.154CysTrp: 0.154 ± 0.013
0.593CysTyr: 0.593 ± 0.032
0.001CysXaa: 0.001 ± 0.001
Asp
4.698AspAla: 4.698 ± 0.082
0.786AspCys: 0.786 ± 0.034
2.779AspAsp: 2.779 ± 0.073
4.476AspGlu: 4.476 ± 0.088
2.502AspPhe: 2.502 ± 0.071
4.763AspGly: 4.763 ± 0.096
1.054AspHis: 1.054 ± 0.041
3.498AspIle: 3.498 ± 0.072
2.583AspLys: 2.583 ± 0.072
5.021AspLeu: 5.021 ± 0.075
1.55AspMet: 1.55 ± 0.043
1.75AspAsn: 1.75 ± 0.055
2.12AspPro: 2.12 ± 0.061
1.559AspGln: 1.559 ± 0.05
2.417AspArg: 2.417 ± 0.059
2.696AspSer: 2.696 ± 0.065
3.193AspThr: 3.193 ± 0.072
3.764AspVal: 3.764 ± 0.069
0.619AspTrp: 0.619 ± 0.031
2.476AspTyr: 2.476 ± 0.069
0.0AspXaa: 0.0 ± 0.0
Glu
6.05GluAla: 6.05 ± 0.115
0.892GluCys: 0.892 ± 0.038
4.142GluAsp: 4.142 ± 0.074
7.807GluGlu: 7.807 ± 0.13
2.464GluPhe: 2.464 ± 0.049
4.484GluGly: 4.484 ± 0.069
1.425GluHis: 1.425 ± 0.047
5.459GluIle: 5.459 ± 0.098
7.163GluLys: 7.163 ± 0.118
6.673GluLeu: 6.673 ± 0.107
2.55GluMet: 2.55 ± 0.058
4.186GluAsn: 4.186 ± 0.091
2.165GluPro: 2.165 ± 0.07
3.352GluGln: 3.352 ± 0.089
3.581GluArg: 3.581 ± 0.074
3.397GluSer: 3.397 ± 0.075
4.525GluThr: 4.525 ± 0.102
4.486GluVal: 4.486 ± 0.077
0.701GluTrp: 0.701 ± 0.032
2.853GluTyr: 2.853 ± 0.069
0.003GluXaa: 0.003 ± 0.002
Phe
3.276PheAla: 3.276 ± 0.072
0.972PheCys: 0.972 ± 0.042
2.424PheAsp: 2.424 ± 0.058
2.487PheGlu: 2.487 ± 0.059
2.038PhePhe: 2.038 ± 0.067
3.308PheGly: 3.308 ± 0.071
1.0PheHis: 1.0 ± 0.042
2.185PheIle: 2.185 ± 0.058
1.567PheLys: 1.567 ± 0.053
4.344PheLeu: 4.344 ± 0.101
1.062PheMet: 1.062 ± 0.039
1.274PheAsn: 1.274 ± 0.043
1.571PhePro: 1.571 ± 0.045
1.538PheGln: 1.538 ± 0.046
1.925PheArg: 1.925 ± 0.051
2.97PheSer: 2.97 ± 0.071
2.25PheThr: 2.25 ± 0.065
2.853PheVal: 2.853 ± 0.064
0.477PheTrp: 0.477 ± 0.027
1.708PheTyr: 1.708 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
5.443GlyAla: 5.443 ± 0.1
1.38GlyCys: 1.38 ± 0.049
3.589GlyAsp: 3.589 ± 0.077
4.991GlyGlu: 4.991 ± 0.096
3.237GlyPhe: 3.237 ± 0.069
5.02GlyGly: 5.02 ± 0.102
1.294GlyHis: 1.294 ± 0.045
6.127GlyIle: 6.127 ± 0.104
5.712GlyLys: 5.712 ± 0.088
5.598GlyLeu: 5.598 ± 0.109
2.681GlyMet: 2.681 ± 0.064
3.059GlyAsn: 3.059 ± 0.066
1.423GlyPro: 1.423 ± 0.049
2.114GlyGln: 2.114 ± 0.049
3.055GlyArg: 3.055 ± 0.065
4.045GlySer: 4.045 ± 0.087
4.817GlyThr: 4.817 ± 0.082
5.314GlyVal: 5.314 ± 0.09
0.817GlyTrp: 0.817 ± 0.044
3.23GlyTyr: 3.23 ± 0.069
0.005GlyXaa: 0.005 ± 0.003
His
1.378HisAla: 1.378 ± 0.046
0.33HisCys: 0.33 ± 0.02
0.944HisAsp: 0.944 ± 0.039
1.119HisGlu: 1.119 ± 0.042
0.908HisPhe: 0.908 ± 0.032
1.451HisGly: 1.451 ± 0.041
0.507HisHis: 0.507 ± 0.04
1.311HisIle: 1.311 ± 0.048
0.855HisLys: 0.855 ± 0.037
1.756HisLeu: 1.756 ± 0.059
0.597HisMet: 0.597 ± 0.03
0.716HisAsn: 0.716 ± 0.03
1.047HisPro: 1.047 ± 0.041
0.667HisGln: 0.667 ± 0.033
0.902HisArg: 0.902 ± 0.035
0.932HisSer: 0.932 ± 0.034
1.054HisThr: 1.054 ± 0.038
1.28HisVal: 1.28 ± 0.04
0.196HisTrp: 0.196 ± 0.017
0.804HisTyr: 0.804 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.611IleAla: 5.611 ± 0.102
1.326IleCys: 1.326 ± 0.048
3.579IleAsp: 3.579 ± 0.068
4.207IleGlu: 4.207 ± 0.078
2.729IlePhe: 2.729 ± 0.066
4.954IleGly: 4.954 ± 0.108
1.435IleHis: 1.435 ± 0.049
3.76IleIle: 3.76 ± 0.076
2.893IleLys: 2.893 ± 0.069
6.866IleLeu: 6.866 ± 0.117
1.716IleMet: 1.716 ± 0.045
2.295IleAsn: 2.295 ± 0.064
3.165IlePro: 3.165 ± 0.062
2.193IleGln: 2.193 ± 0.061
3.744IleArg: 3.744 ± 0.076
4.349IleSer: 4.349 ± 0.075
3.941IleThr: 3.941 ± 0.074
4.56IleVal: 4.56 ± 0.085
0.689IleTrp: 0.689 ± 0.031
2.343IleTyr: 2.343 ± 0.054
0.001IleXaa: 0.001 ± 0.001
Lys
5.078LysAla: 5.078 ± 0.106
0.636LysCys: 0.636 ± 0.033
3.447LysAsp: 3.447 ± 0.073
6.672LysGlu: 6.672 ± 0.117
1.807LysPhe: 1.807 ± 0.047
3.86LysGly: 3.86 ± 0.062
0.839LysHis: 0.839 ± 0.032
4.317LysIle: 4.317 ± 0.089
6.066LysLys: 6.066 ± 0.108
4.804LysLeu: 4.804 ± 0.079
2.369LysMet: 2.369 ± 0.056
3.294LysAsn: 3.294 ± 0.062
2.031LysPro: 2.031 ± 0.051
2.237LysGln: 2.237 ± 0.051
2.963LysArg: 2.963 ± 0.065
2.847LysSer: 2.847 ± 0.06
3.828LysThr: 3.828 ± 0.077
3.945LysVal: 3.945 ± 0.077
0.605LysTrp: 0.605 ± 0.03
2.497LysTyr: 2.497 ± 0.059
0.001LysXaa: 0.001 ± 0.001
Leu
7.551LeuAla: 7.551 ± 0.116
1.606LeuCys: 1.606 ± 0.055
4.83LeuAsp: 4.83 ± 0.088
6.277LeuGlu: 6.277 ± 0.118
4.006LeuPhe: 4.006 ± 0.094
6.012LeuGly: 6.012 ± 0.111
1.789LeuHis: 1.789 ± 0.055
5.541LeuIle: 5.541 ± 0.103
5.476LeuLys: 5.476 ± 0.076
8.918LeuLeu: 8.918 ± 0.143
2.712LeuMet: 2.712 ± 0.061
3.396LeuAsn: 3.396 ± 0.069
3.825LeuPro: 3.825 ± 0.076
3.013LeuGln: 3.013 ± 0.066
3.987LeuArg: 3.987 ± 0.07
6.127LeuSer: 6.127 ± 0.098
5.051LeuThr: 5.051 ± 0.084
5.854LeuVal: 5.854 ± 0.108
0.786LeuTrp: 0.786 ± 0.038
3.314LeuTyr: 3.314 ± 0.069
0.0LeuXaa: 0.0 ± 0.0
Met
2.641MetAla: 2.641 ± 0.069
0.354MetCys: 0.354 ± 0.022
1.774MetAsp: 1.774 ± 0.044
2.709MetGlu: 2.709 ± 0.066
0.994MetPhe: 0.994 ± 0.035
2.175MetGly: 2.175 ± 0.06
0.524MetHis: 0.524 ± 0.025
2.213MetIle: 2.213 ± 0.051
2.46MetLys: 2.46 ± 0.055
2.607MetLeu: 2.607 ± 0.059
1.108MetMet: 1.108 ± 0.04
1.498MetAsn: 1.498 ± 0.048
1.285MetPro: 1.285 ± 0.041
1.162MetGln: 1.162 ± 0.042
1.433MetArg: 1.433 ± 0.042
1.636MetSer: 1.636 ± 0.047
1.807MetThr: 1.807 ± 0.056
2.123MetVal: 2.123 ± 0.053
0.221MetTrp: 0.221 ± 0.018
0.904MetTyr: 0.904 ± 0.034
0.0MetXaa: 0.0 ± 0.0
Asn
3.071AsnAla: 3.071 ± 0.073
0.557AsnCys: 0.557 ± 0.03
1.923AsnAsp: 1.923 ± 0.055
2.522AsnGlu: 2.522 ± 0.056
1.587AsnPhe: 1.587 ± 0.047
3.65AsnGly: 3.65 ± 0.072
0.786AsnHis: 0.786 ± 0.034
2.731AsnIle: 2.731 ± 0.057
1.952AsnLys: 1.952 ± 0.063
3.587AsnLeu: 3.587 ± 0.067
1.22AsnMet: 1.22 ± 0.041
1.439AsnAsn: 1.439 ± 0.048
1.948AsnPro: 1.948 ± 0.051
1.379AsnGln: 1.379 ± 0.045
2.007AsnArg: 2.007 ± 0.053
2.037AsnSer: 2.037 ± 0.058
2.202AsnThr: 2.202 ± 0.05
2.783AsnVal: 2.783 ± 0.066
0.407AsnTrp: 0.407 ± 0.025
1.586AsnTyr: 1.586 ± 0.04
0.001AsnXaa: 0.001 ± 0.001
Pro
2.743ProAla: 2.743 ± 0.069
0.455ProCys: 0.455 ± 0.027
2.404ProAsp: 2.404 ± 0.061
4.018ProGlu: 4.018 ± 0.089
1.636ProPhe: 1.636 ± 0.048
2.657ProGly: 2.657 ± 0.068
0.605ProHis: 0.605 ± 0.028
2.061ProIle: 2.061 ± 0.049
1.985ProLys: 1.985 ± 0.054
2.987ProLeu: 2.987 ± 0.075
1.037ProMet: 1.037 ± 0.039
1.234ProAsn: 1.234 ± 0.048
0.712ProPro: 0.712 ± 0.032
1.119ProGln: 1.119 ± 0.04
1.124ProArg: 1.124 ± 0.042
2.027ProSer: 2.027 ± 0.056
1.745ProThr: 1.745 ± 0.07
3.148ProVal: 3.148 ± 0.069
0.415ProTrp: 0.415 ± 0.024
1.46ProTyr: 1.46 ± 0.048
0.0ProXaa: 0.0 ± 0.0
Gln
2.472GlnAla: 2.472 ± 0.067
0.408GlnCys: 0.408 ± 0.026
1.549GlnAsp: 1.549 ± 0.046
2.881GlnGlu: 2.881 ± 0.06
1.154GlnPhe: 1.154 ± 0.04
1.986GlnGly: 1.986 ± 0.053
0.55GlnHis: 0.55 ± 0.03
2.779GlnIle: 2.779 ± 0.062
3.119GlnLys: 3.119 ± 0.07
2.73GlnLeu: 2.73 ± 0.068
1.402GlnMet: 1.402 ± 0.042
1.703GlnAsn: 1.703 ± 0.047
1.014GlnPro: 1.014 ± 0.04
1.302GlnGln: 1.302 ± 0.049
1.449GlnArg: 1.449 ± 0.044
1.599GlnSer: 1.599 ± 0.049
1.827GlnThr: 1.827 ± 0.058
2.141GlnVal: 2.141 ± 0.064
0.305GlnTrp: 0.305 ± 0.018
1.439GlnTyr: 1.439 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
3.023ArgAla: 3.023 ± 0.064
0.655ArgCys: 0.655 ± 0.036
2.273ArgAsp: 2.273 ± 0.067
4.023ArgGlu: 4.023 ± 0.077
1.893ArgPhe: 1.893 ± 0.054
2.62ArgGly: 2.62 ± 0.065
0.902ArgHis: 0.902 ± 0.032
3.465ArgIle: 3.465 ± 0.073
3.621ArgLys: 3.621 ± 0.08
3.819ArgLeu: 3.819 ± 0.078
1.696ArgMet: 1.696 ± 0.053
2.043ArgAsn: 2.043 ± 0.057
1.468ArgPro: 1.468 ± 0.047
1.797ArgGln: 1.797 ± 0.053
2.502ArgArg: 2.502 ± 0.067
2.255ArgSer: 2.255 ± 0.053
2.466ArgThr: 2.466 ± 0.064
2.901ArgVal: 2.901 ± 0.065
0.394ArgTrp: 0.394 ± 0.024
1.832ArgTyr: 1.832 ± 0.053
0.0ArgXaa: 0.0 ± 0.0
Ser
4.56SerAla: 4.56 ± 0.09
0.928SerCys: 0.928 ± 0.041
3.104SerAsp: 3.104 ± 0.077
3.843SerGlu: 3.843 ± 0.078
2.627SerPhe: 2.627 ± 0.063
4.917SerGly: 4.917 ± 0.074
1.016SerHis: 1.016 ± 0.035
3.613SerIle: 3.613 ± 0.073
2.83SerLys: 2.83 ± 0.063
5.056SerLeu: 5.056 ± 0.079
1.817SerMet: 1.817 ± 0.059
1.859SerAsn: 1.859 ± 0.044
1.734SerPro: 1.734 ± 0.046
1.695SerGln: 1.695 ± 0.047
2.656SerArg: 2.656 ± 0.055
3.569SerSer: 3.569 ± 0.087
2.77SerThr: 2.77 ± 0.068
4.248SerVal: 4.248 ± 0.079
0.696SerTrp: 0.696 ± 0.036
2.457SerTyr: 2.457 ± 0.056
0.001SerXaa: 0.001 ± 0.001
Thr
4.964ThrAla: 4.964 ± 0.083
0.739ThrCys: 0.739 ± 0.035
3.071ThrAsp: 3.071 ± 0.066
4.22ThrGlu: 4.22 ± 0.083
2.161ThrPhe: 2.161 ± 0.058
4.748ThrGly: 4.748 ± 0.084
0.927ThrHis: 0.927 ± 0.034
3.898ThrIle: 3.898 ± 0.075
3.038ThrLys: 3.038 ± 0.068
4.885ThrLeu: 4.885 ± 0.09
1.638ThrMet: 1.638 ± 0.043
1.904ThrAsn: 1.904 ± 0.053
2.486ThrPro: 2.486 ± 0.071
1.419ThrGln: 1.419 ± 0.037
2.123ThrArg: 2.123 ± 0.049
2.974ThrSer: 2.974 ± 0.074
2.957ThrThr: 2.957 ± 0.078
4.722ThrVal: 4.722 ± 0.091
0.585ThrTrp: 0.585 ± 0.029
2.133ThrTyr: 2.133 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
4.715ValAla: 4.715 ± 0.089
1.354ValCys: 1.354 ± 0.043
3.734ValAsp: 3.734 ± 0.076
4.807ValGlu: 4.807 ± 0.077
3.177ValPhe: 3.177 ± 0.072
4.34ValGly: 4.34 ± 0.074
1.285ValHis: 1.285 ± 0.043
4.852ValIle: 4.852 ± 0.104
4.255ValLys: 4.255 ± 0.068
7.326ValLeu: 7.326 ± 0.123
2.08ValMet: 2.08 ± 0.055
2.709ValAsn: 2.709 ± 0.061
2.869ValPro: 2.869 ± 0.065
2.316ValGln: 2.316 ± 0.057
3.146ValArg: 3.146 ± 0.066
4.579ValSer: 4.579 ± 0.079
4.443ValThr: 4.443 ± 0.087
4.948ValVal: 4.948 ± 0.095
0.68ValTrp: 0.68 ± 0.029
2.709ValTyr: 2.709 ± 0.059
0.0ValXaa: 0.0 ± 0.0
Trp
0.578TrpAla: 0.578 ± 0.029
0.196TrpCys: 0.196 ± 0.016
0.529TrpAsp: 0.529 ± 0.028
0.713TrpGlu: 0.713 ± 0.036
0.44TrpPhe: 0.44 ± 0.025
0.739TrpGly: 0.739 ± 0.038
0.198TrpHis: 0.198 ± 0.017
0.713TrpIle: 0.713 ± 0.038
0.855TrpLys: 0.855 ± 0.037
0.903TrpLeu: 0.903 ± 0.034
0.367TrpMet: 0.367 ± 0.022
0.575TrpAsn: 0.575 ± 0.029
0.272TrpPro: 0.272 ± 0.02
0.389TrpGln: 0.389 ± 0.023
0.387TrpArg: 0.387 ± 0.021
0.552TrpSer: 0.552 ± 0.029
0.485TrpThr: 0.485 ± 0.025
0.571TrpVal: 0.571 ± 0.028
0.123TrpTrp: 0.123 ± 0.015
0.476TrpTyr: 0.476 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.787TyrAla: 2.787 ± 0.066
0.558TyrCys: 0.558 ± 0.028
2.404TyrAsp: 2.404 ± 0.058
3.04TyrGlu: 3.04 ± 0.075
1.86TyrPhe: 1.86 ± 0.061
3.027TyrGly: 3.027 ± 0.069
0.931TyrHis: 0.931 ± 0.035
2.153TyrIle: 2.153 ± 0.051
1.753TyrLys: 1.753 ± 0.051
3.991TyrLeu: 3.991 ± 0.07
0.956TyrMet: 0.956 ± 0.039
1.466TyrAsn: 1.466 ± 0.045
1.539TyrPro: 1.539 ± 0.05
1.741TyrGln: 1.741 ± 0.045
2.192TyrArg: 2.192 ± 0.06
2.053TyrSer: 2.053 ± 0.059
2.217TyrThr: 2.217 ± 0.062
2.582TyrVal: 2.582 ± 0.062
0.366TyrTrp: 0.366 ± 0.022
1.821TyrTyr: 1.821 ± 0.075
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.001
0.0XaaPhe: 0.0 ± 0.0
0.004XaaGly: 0.004 ± 0.002
0.003XaaHis: 0.003 ± 0.002
0.001XaaIle: 0.001 ± 0.001
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.001XaaGln: 0.001 ± 0.001
0.001XaaArg: 0.001 ± 0.001
0.003XaaSer: 0.003 ± 0.002
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.001
0.021XaaXaa: 0.021 ± 0.007
Statistics based on 2353 proteins (754178 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski