Amino acid dipepetide frequency for Olavius algarvensis Delta 1 endosymbiont

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.321AlaAla: 9.321 ± 0.071
1.107AlaCys: 1.107 ± 0.023
5.173AlaAsp: 5.173 ± 0.053
5.613AlaGlu: 5.613 ± 0.054
3.423AlaPhe: 3.423 ± 0.038
7.583AlaGly: 7.583 ± 0.071
1.645AlaHis: 1.645 ± 0.024
5.352AlaIle: 5.352 ± 0.049
4.045AlaLys: 4.045 ± 0.048
8.488AlaLeu: 8.488 ± 0.067
2.422AlaMet: 2.422 ± 0.029
2.555AlaAsn: 2.555 ± 0.034
2.958AlaPro: 2.958 ± 0.036
2.781AlaGln: 2.781 ± 0.034
4.917AlaArg: 4.917 ± 0.046
4.42AlaSer: 4.42 ± 0.045
3.836AlaThr: 3.836 ± 0.038
6.661AlaVal: 6.661 ± 0.056
0.936AlaTrp: 0.936 ± 0.021
2.372AlaTyr: 2.372 ± 0.031
0.0AlaXaa: 0.0 ± 0.0
Cys
0.921CysAla: 0.921 ± 0.019
0.23CysCys: 0.23 ± 0.012
0.633CysAsp: 0.633 ± 0.015
0.626CysGlu: 0.626 ± 0.017
0.557CysPhe: 0.557 ± 0.015
1.269CysGly: 1.269 ± 0.027
0.403CysHis: 0.403 ± 0.015
0.68CysIle: 0.68 ± 0.014
0.445CysLys: 0.445 ± 0.013
1.142CysLeu: 1.142 ± 0.021
0.279CysMet: 0.279 ± 0.009
0.411CysAsn: 0.411 ± 0.014
0.714CysPro: 0.714 ± 0.019
0.478CysGln: 0.478 ± 0.014
1.184CysArg: 1.184 ± 0.024
0.823CysSer: 0.823 ± 0.022
0.512CysThr: 0.512 ± 0.017
0.675CysVal: 0.675 ± 0.014
0.176CysTrp: 0.176 ± 0.01
0.391CysTyr: 0.391 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
4.182AspAla: 4.182 ± 0.047
0.669AspCys: 0.669 ± 0.015
3.02AspAsp: 3.02 ± 0.044
3.56AspGlu: 3.56 ± 0.041
2.762AspPhe: 2.762 ± 0.035
3.929AspGly: 3.929 ± 0.057
1.196AspHis: 1.196 ± 0.019
4.105AspIle: 4.105 ± 0.037
2.637AspLys: 2.637 ± 0.035
5.931AspLeu: 5.931 ± 0.06
1.35AspMet: 1.35 ± 0.022
1.919AspAsn: 1.919 ± 0.041
3.122AspPro: 3.122 ± 0.038
2.204AspGln: 2.204 ± 0.032
3.637AspArg: 3.637 ± 0.039
2.927AspSer: 2.927 ± 0.036
2.462AspThr: 2.462 ± 0.035
3.396AspVal: 3.396 ± 0.036
0.807AspTrp: 0.807 ± 0.019
1.928AspTyr: 1.928 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
5.098GluAla: 5.098 ± 0.055
0.584GluCys: 0.584 ± 0.015
3.192GluAsp: 3.192 ± 0.041
3.468GluGlu: 3.468 ± 0.045
2.516GluPhe: 2.516 ± 0.03
3.455GluGly: 3.455 ± 0.04
1.239GluHis: 1.239 ± 0.023
4.774GluIle: 4.774 ± 0.045
4.217GluLys: 4.217 ± 0.052
5.8GluLeu: 5.8 ± 0.056
1.761GluMet: 1.761 ± 0.026
2.773GluAsn: 2.773 ± 0.031
2.338GluPro: 2.338 ± 0.027
2.248GluGln: 2.248 ± 0.034
3.211GluArg: 3.211 ± 0.035
3.261GluSer: 3.261 ± 0.037
3.224GluThr: 3.224 ± 0.037
3.854GluVal: 3.854 ± 0.044
0.652GluTrp: 0.652 ± 0.016
1.878GluTyr: 1.878 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
3.314PheAla: 3.314 ± 0.037
0.644PheCys: 0.644 ± 0.016
2.852PheAsp: 2.852 ± 0.034
2.657PheGlu: 2.657 ± 0.033
2.371PhePhe: 2.371 ± 0.038
3.432PheGly: 3.432 ± 0.035
0.926PheHis: 0.926 ± 0.021
2.868PheIle: 2.868 ± 0.037
2.406PheLys: 2.406 ± 0.035
4.162PheLeu: 4.162 ± 0.053
1.152PheMet: 1.152 ± 0.021
1.826PheAsn: 1.826 ± 0.031
1.739PhePro: 1.739 ± 0.024
1.554PheGln: 1.554 ± 0.023
2.416PheArg: 2.416 ± 0.03
3.119PheSer: 3.119 ± 0.039
2.222PheThr: 2.222 ± 0.03
2.802PheVal: 2.802 ± 0.03
0.649PheTrp: 0.649 ± 0.016
1.485PheTyr: 1.485 ± 0.024
0.0PheXaa: 0.0 ± 0.0
Gly
5.579GlyAla: 5.579 ± 0.06
1.171GlyCys: 1.171 ± 0.026
3.578GlyAsp: 3.578 ± 0.044
3.778GlyGlu: 3.778 ± 0.038
3.582GlyPhe: 3.582 ± 0.038
5.489GlyGly: 5.489 ± 0.056
1.57GlyHis: 1.57 ± 0.025
5.608GlyIle: 5.608 ± 0.048
3.973GlyLys: 3.973 ± 0.044
7.428GlyLeu: 7.428 ± 0.06
2.146GlyMet: 2.146 ± 0.029
2.694GlyAsn: 2.694 ± 0.041
2.517GlyPro: 2.517 ± 0.034
2.796GlyGln: 2.796 ± 0.033
4.504GlyArg: 4.504 ± 0.047
4.555GlySer: 4.555 ± 0.049
3.745GlyThr: 3.745 ± 0.052
4.914GlyVal: 4.914 ± 0.047
1.051GlyTrp: 1.051 ± 0.02
2.632GlyTyr: 2.632 ± 0.032
0.0GlyXaa: 0.0 ± 0.0
His
1.521HisAla: 1.521 ± 0.026
0.321HisCys: 0.321 ± 0.01
1.057HisAsp: 1.057 ± 0.018
1.096HisGlu: 1.096 ± 0.022
1.07HisPhe: 1.07 ± 0.021
1.45HisGly: 1.45 ± 0.024
0.58HisHis: 0.58 ± 0.017
1.252HisIle: 1.252 ± 0.02
1.023HisLys: 1.023 ± 0.018
2.231HisLeu: 2.231 ± 0.033
0.458HisMet: 0.458 ± 0.014
0.732HisAsn: 0.732 ± 0.016
1.297HisPro: 1.297 ± 0.021
0.878HisGln: 0.878 ± 0.017
1.367HisArg: 1.367 ± 0.024
1.265HisSer: 1.265 ± 0.024
0.908HisThr: 0.908 ± 0.018
1.118HisVal: 1.118 ± 0.021
0.29HisTrp: 0.29 ± 0.012
0.777HisTyr: 0.777 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
5.882IleAla: 5.882 ± 0.053
0.933IleCys: 0.933 ± 0.02
4.344IleAsp: 4.344 ± 0.04
4.719IleGlu: 4.719 ± 0.045
3.154IlePhe: 3.154 ± 0.041
5.042IleGly: 5.042 ± 0.046
1.412IleHis: 1.412 ± 0.023
4.422IleIle: 4.422 ± 0.045
3.715IleLys: 3.715 ± 0.043
6.688IleLeu: 6.688 ± 0.065
1.626IleMet: 1.626 ± 0.024
2.713IleAsn: 2.713 ± 0.03
3.418IlePro: 3.418 ± 0.036
2.35IleGln: 2.35 ± 0.033
4.013IleArg: 4.013 ± 0.034
4.551IleSer: 4.551 ± 0.045
3.423IleThr: 3.423 ± 0.05
4.414IleVal: 4.414 ± 0.042
0.752IleTrp: 0.752 ± 0.016
2.068IleTyr: 2.068 ± 0.021
0.0IleXaa: 0.0 ± 0.0
Lys
4.375LysAla: 4.375 ± 0.051
0.516LysCys: 0.516 ± 0.016
2.816LysAsp: 2.816 ± 0.033
3.245LysGlu: 3.245 ± 0.044
1.965LysPhe: 1.965 ± 0.026
3.324LysGly: 3.324 ± 0.038
0.964LysHis: 0.964 ± 0.021
4.486LysIle: 4.486 ± 0.053
4.155LysLys: 4.155 ± 0.064
5.155LysLeu: 5.155 ± 0.051
1.562LysMet: 1.562 ± 0.026
2.414LysAsn: 2.414 ± 0.034
2.418LysPro: 2.418 ± 0.032
1.821LysGln: 1.821 ± 0.029
2.991LysArg: 2.991 ± 0.039
3.412LysSer: 3.412 ± 0.041
3.073LysThr: 3.073 ± 0.034
3.369LysVal: 3.369 ± 0.038
0.624LysTrp: 0.624 ± 0.015
1.656LysTyr: 1.656 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
9.36LeuAla: 9.36 ± 0.067
1.142LeuCys: 1.142 ± 0.023
5.543LeuAsp: 5.543 ± 0.051
5.994LeuGlu: 5.994 ± 0.049
4.163LeuPhe: 4.163 ± 0.047
6.902LeuGly: 6.902 ± 0.054
1.731LeuHis: 1.731 ± 0.025
6.532LeuIle: 6.532 ± 0.063
6.227LeuLys: 6.227 ± 0.056
9.185LeuLeu: 9.185 ± 0.084
2.589LeuMet: 2.589 ± 0.032
4.143LeuAsn: 4.143 ± 0.043
4.619LeuPro: 4.619 ± 0.046
3.517LeuGln: 3.517 ± 0.037
5.009LeuArg: 5.009 ± 0.049
6.445LeuSer: 6.445 ± 0.046
5.238LeuThr: 5.238 ± 0.058
6.506LeuVal: 6.506 ± 0.057
1.109LeuTrp: 1.109 ± 0.021
2.726LeuTyr: 2.726 ± 0.034
0.0LeuXaa: 0.0 ± 0.0
Met
2.578MetAla: 2.578 ± 0.033
0.232MetCys: 0.232 ± 0.009
1.447MetAsp: 1.447 ± 0.024
1.477MetGlu: 1.477 ± 0.026
0.84MetPhe: 0.84 ± 0.016
1.951MetGly: 1.951 ± 0.03
0.445MetHis: 0.445 ± 0.011
1.807MetIle: 1.807 ± 0.028
1.631MetLys: 1.631 ± 0.027
2.446MetLeu: 2.446 ± 0.034
0.701MetMet: 0.701 ± 0.017
1.113MetAsn: 1.113 ± 0.018
1.27MetPro: 1.27 ± 0.022
0.989MetGln: 0.989 ± 0.018
1.505MetArg: 1.505 ± 0.023
1.583MetSer: 1.583 ± 0.024
1.571MetThr: 1.571 ± 0.029
1.834MetVal: 1.834 ± 0.023
0.214MetTrp: 0.214 ± 0.01
0.553MetTyr: 0.553 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.832AsnAla: 2.832 ± 0.038
0.649AsnCys: 0.649 ± 0.016
1.887AsnAsp: 1.887 ± 0.037
1.888AsnGlu: 1.888 ± 0.026
1.676AsnPhe: 1.676 ± 0.03
2.55AsnGly: 2.55 ± 0.045
0.828AsnHis: 0.828 ± 0.018
2.913AsnIle: 2.913 ± 0.032
1.874AsnLys: 1.874 ± 0.028
4.367AsnLeu: 4.367 ± 0.044
0.943AsnMet: 0.943 ± 0.017
1.348AsnAsn: 1.348 ± 0.026
2.432AsnPro: 2.432 ± 0.028
1.537AsnGln: 1.537 ± 0.022
2.56AsnArg: 2.56 ± 0.035
2.221AsnSer: 2.221 ± 0.036
1.759AsnThr: 1.759 ± 0.03
2.387AsnVal: 2.387 ± 0.033
0.521AsnTrp: 0.521 ± 0.014
1.285AsnTyr: 1.285 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
4.444ProAla: 4.444 ± 0.056
0.47ProCys: 0.47 ± 0.013
3.125ProAsp: 3.125 ± 0.036
3.285ProGlu: 3.285 ± 0.041
1.921ProPhe: 1.921 ± 0.029
3.555ProGly: 3.555 ± 0.037
0.968ProHis: 0.968 ± 0.019
2.573ProIle: 2.573 ± 0.031
1.993ProLys: 1.993 ± 0.031
4.239ProLeu: 4.239 ± 0.046
1.034ProMet: 1.034 ± 0.02
1.548ProAsn: 1.548 ± 0.023
2.053ProPro: 2.053 ± 0.035
1.746ProGln: 1.746 ± 0.026
2.112ProArg: 2.112 ± 0.029
2.426ProSer: 2.426 ± 0.031
2.059ProThr: 2.059 ± 0.03
3.631ProVal: 3.631 ± 0.047
0.605ProTrp: 0.605 ± 0.014
1.368ProTyr: 1.368 ± 0.023
0.0ProXaa: 0.0 ± 0.0
Gln
3.352GlnAla: 3.352 ± 0.038
0.368GlnCys: 0.368 ± 0.012
1.731GlnAsp: 1.731 ± 0.029
1.961GlnGlu: 1.961 ± 0.031
1.402GlnPhe: 1.402 ± 0.023
2.429GlnGly: 2.429 ± 0.033
0.727GlnHis: 0.727 ± 0.019
2.656GlnIle: 2.656 ± 0.033
2.414GlnLys: 2.414 ± 0.036
3.269GlnLeu: 3.269 ± 0.038
0.996GlnMet: 0.996 ± 0.02
1.763GlnAsn: 1.763 ± 0.029
1.626GlnPro: 1.626 ± 0.026
1.555GlnGln: 1.555 ± 0.026
2.344GlnArg: 2.344 ± 0.034
2.224GlnSer: 2.224 ± 0.028
2.036GlnThr: 2.036 ± 0.027
2.475GlnVal: 2.475 ± 0.035
0.507GlnTrp: 0.507 ± 0.012
1.103GlnTyr: 1.103 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
4.208ArgAla: 4.208 ± 0.039
0.776ArgCys: 0.776 ± 0.019
2.842ArgAsp: 2.842 ± 0.032
3.437ArgGlu: 3.437 ± 0.036
2.975ArgPhe: 2.975 ± 0.037
3.51ArgGly: 3.51 ± 0.042
1.442ArgHis: 1.442 ± 0.023
4.434ArgIle: 4.434 ± 0.044
3.278ArgLys: 3.278 ± 0.04
6.119ArgLeu: 6.119 ± 0.053
1.589ArgMet: 1.589 ± 0.024
2.268ArgAsn: 2.268 ± 0.03
2.511ArgPro: 2.511 ± 0.036
2.741ArgGln: 2.741 ± 0.03
3.956ArgArg: 3.956 ± 0.047
3.56ArgSer: 3.56 ± 0.039
2.599ArgThr: 2.599 ± 0.032
3.688ArgVal: 3.688 ± 0.038
0.743ArgTrp: 0.743 ± 0.016
2.017ArgTyr: 2.017 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
4.955SerAla: 4.955 ± 0.044
0.739SerCys: 0.739 ± 0.02
3.326SerAsp: 3.326 ± 0.044
3.455SerGlu: 3.455 ± 0.041
2.795SerPhe: 2.795 ± 0.03
5.321SerGly: 5.321 ± 0.048
1.288SerHis: 1.288 ± 0.018
4.335SerIle: 4.335 ± 0.042
2.877SerLys: 2.877 ± 0.037
5.947SerLeu: 5.947 ± 0.053
1.637SerMet: 1.637 ± 0.025
2.218SerAsn: 2.218 ± 0.027
2.665SerPro: 2.665 ± 0.037
2.167SerGln: 2.167 ± 0.027
3.614SerArg: 3.614 ± 0.039
3.802SerSer: 3.802 ± 0.045
2.834SerThr: 2.834 ± 0.031
3.771SerVal: 3.771 ± 0.038
0.742SerTrp: 0.742 ± 0.015
1.801SerTyr: 1.801 ± 0.026
0.0SerXaa: 0.0 ± 0.0
Thr
4.691ThrAla: 4.691 ± 0.047
0.552ThrCys: 0.552 ± 0.013
2.78ThrAsp: 2.78 ± 0.043
2.802ThrGlu: 2.802 ± 0.033
2.138ThrPhe: 2.138 ± 0.028
4.44ThrGly: 4.44 ± 0.045
1.037ThrHis: 1.037 ± 0.018
3.559ThrIle: 3.559 ± 0.042
1.967ThrLys: 1.967 ± 0.029
4.944ThrLeu: 4.944 ± 0.056
1.151ThrMet: 1.151 ± 0.021
1.669ThrAsn: 1.669 ± 0.028
2.546ThrPro: 2.546 ± 0.04
1.454ThrGln: 1.454 ± 0.02
2.749ThrArg: 2.749 ± 0.032
2.728ThrSer: 2.728 ± 0.03
2.591ThrThr: 2.591 ± 0.038
3.758ThrVal: 3.758 ± 0.06
0.576ThrTrp: 0.576 ± 0.016
1.472ThrTyr: 1.472 ± 0.028
0.0ThrXaa: 0.0 ± 0.0
Val
5.888ValAla: 5.888 ± 0.062
0.861ValCys: 0.861 ± 0.02
3.906ValAsp: 3.906 ± 0.043
4.076ValGlu: 4.076 ± 0.041
3.117ValPhe: 3.117 ± 0.03
4.574ValGly: 4.574 ± 0.043
1.21ValHis: 1.21 ± 0.021
4.591ValIle: 4.591 ± 0.044
3.538ValLys: 3.538 ± 0.04
6.525ValLeu: 6.525 ± 0.058
1.76ValMet: 1.76 ± 0.028
2.708ValAsn: 2.708 ± 0.04
2.853ValPro: 2.853 ± 0.041
2.143ValGln: 2.143 ± 0.031
3.654ValArg: 3.654 ± 0.038
4.195ValSer: 4.195 ± 0.041
3.45ValThr: 3.45 ± 0.044
4.852ValVal: 4.852 ± 0.054
0.708ValTrp: 0.708 ± 0.018
1.978ValTyr: 1.978 ± 0.026
0.0ValXaa: 0.0 ± 0.0
Trp
0.87TrpAla: 0.87 ± 0.018
0.137TrpCys: 0.137 ± 0.007
0.67TrpAsp: 0.67 ± 0.017
0.716TrpGlu: 0.716 ± 0.015
0.524TrpPhe: 0.524 ± 0.014
0.901TrpGly: 0.901 ± 0.019
0.304TrpHis: 0.304 ± 0.011
0.856TrpIle: 0.856 ± 0.018
0.592TrpLys: 0.592 ± 0.015
1.261TrpLeu: 1.261 ± 0.022
0.341TrpMet: 0.341 ± 0.012
0.543TrpAsn: 0.543 ± 0.017
0.528TrpPro: 0.528 ± 0.013
0.698TrpGln: 0.698 ± 0.016
0.714TrpArg: 0.714 ± 0.016
0.741TrpSer: 0.741 ± 0.016
0.621TrpThr: 0.621 ± 0.014
0.799TrpVal: 0.799 ± 0.017
0.196TrpTrp: 0.196 ± 0.009
0.368TrpTyr: 0.368 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.238TyrAla: 2.238 ± 0.031
0.448TyrCys: 0.448 ± 0.013
1.82TyrAsp: 1.82 ± 0.038
1.693TyrGlu: 1.693 ± 0.027
1.645TyrPhe: 1.645 ± 0.028
2.205TyrGly: 2.205 ± 0.034
0.777TyrHis: 0.777 ± 0.017
1.835TyrIle: 1.835 ± 0.026
1.331TyrLys: 1.331 ± 0.026
3.353TyrLeu: 3.353 ± 0.036
0.691TyrMet: 0.691 ± 0.018
1.099TyrAsn: 1.099 ± 0.021
1.531TyrPro: 1.531 ± 0.027
1.28TyrGln: 1.28 ± 0.024
2.235TyrArg: 2.235 ± 0.031
2.031TyrSer: 2.031 ± 0.029
1.439TyrThr: 1.439 ± 0.028
1.686TyrVal: 1.686 ± 0.026
0.522TyrTrp: 0.522 ± 0.016
1.148TyrTyr: 1.148 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12909 proteins (2800634 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski