Amino acid dipepetide frequency for Marinobacterium sp. AK27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.241AlaAla: 10.241 ± 0.114
1.226AlaCys: 1.226 ± 0.035
5.663AlaAsp: 5.663 ± 0.066
6.56AlaGlu: 6.56 ± 0.079
3.653AlaPhe: 3.653 ± 0.059
7.908AlaGly: 7.908 ± 0.098
2.068AlaHis: 2.068 ± 0.041
5.572AlaIle: 5.572 ± 0.067
3.323AlaLys: 3.323 ± 0.052
12.818AlaLeu: 12.818 ± 0.117
2.881AlaMet: 2.881 ± 0.049
2.675AlaAsn: 2.675 ± 0.044
3.746AlaPro: 3.746 ± 0.063
4.401AlaGln: 4.401 ± 0.062
6.153AlaArg: 6.153 ± 0.08
5.715AlaSer: 5.715 ± 0.074
4.361AlaThr: 4.361 ± 0.063
7.001AlaVal: 7.001 ± 0.088
1.166AlaTrp: 1.166 ± 0.03
2.412AlaTyr: 2.412 ± 0.041
0.0AlaXaa: 0.0 ± 0.0
Cys
1.059CysAla: 1.059 ± 0.033
0.172CysCys: 0.172 ± 0.012
0.667CysAsp: 0.667 ± 0.022
0.683CysGlu: 0.683 ± 0.026
0.427CysPhe: 0.427 ± 0.018
1.007CysGly: 1.007 ± 0.031
0.317CysHis: 0.317 ± 0.018
0.533CysIle: 0.533 ± 0.021
0.308CysLys: 0.308 ± 0.015
0.996CysLeu: 0.996 ± 0.029
0.238CysMet: 0.238 ± 0.014
0.324CysAsn: 0.324 ± 0.017
0.539CysPro: 0.539 ± 0.021
0.363CysGln: 0.363 ± 0.016
0.686CysArg: 0.686 ± 0.023
0.7CysSer: 0.7 ± 0.02
0.556CysThr: 0.556 ± 0.021
0.738CysVal: 0.738 ± 0.023
0.163CysTrp: 0.163 ± 0.012
0.293CysTyr: 0.293 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
5.572AspAla: 5.572 ± 0.071
0.509AspCys: 0.509 ± 0.021
3.184AspAsp: 3.184 ± 0.062
4.005AspGlu: 4.005 ± 0.059
1.944AspPhe: 1.944 ± 0.037
4.136AspGly: 4.136 ± 0.058
1.211AspHis: 1.211 ± 0.032
3.365AspIle: 3.365 ± 0.051
2.119AspLys: 2.119 ± 0.045
6.092AspLeu: 6.092 ± 0.082
1.45AspMet: 1.45 ± 0.033
1.829AspAsn: 1.829 ± 0.038
2.821AspPro: 2.821 ± 0.045
2.401AspGln: 2.401 ± 0.04
3.45AspArg: 3.45 ± 0.049
3.251AspSer: 3.251 ± 0.052
2.974AspThr: 2.974 ± 0.045
3.469AspVal: 3.469 ± 0.051
0.857AspTrp: 0.857 ± 0.027
1.737AspTyr: 1.737 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
6.517GluAla: 6.517 ± 0.08
0.542GluCys: 0.542 ± 0.02
2.745GluAsp: 2.745 ± 0.052
3.792GluGlu: 3.792 ± 0.066
2.037GluPhe: 2.037 ± 0.041
4.339GluGly: 4.339 ± 0.061
1.717GluHis: 1.717 ± 0.043
3.592GluIle: 3.592 ± 0.058
2.572GluLys: 2.572 ± 0.054
7.698GluLeu: 7.698 ± 0.088
1.767GluMet: 1.767 ± 0.039
1.804GluAsn: 1.804 ± 0.031
2.67GluPro: 2.67 ± 0.045
4.186GluGln: 4.186 ± 0.066
5.004GluArg: 5.004 ± 0.078
3.688GluSer: 3.688 ± 0.053
3.014GluThr: 3.014 ± 0.046
4.586GluVal: 4.586 ± 0.059
0.77GluTrp: 0.77 ± 0.028
1.555GluTyr: 1.555 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
3.445PheAla: 3.445 ± 0.06
0.477PheCys: 0.477 ± 0.019
2.607PheAsp: 2.607 ± 0.047
2.41PheGlu: 2.41 ± 0.042
1.428PhePhe: 1.428 ± 0.04
3.261PheGly: 3.261 ± 0.059
0.759PheHis: 0.759 ± 0.021
2.066PheIle: 2.066 ± 0.043
1.282PheLys: 1.282 ± 0.031
3.165PheLeu: 3.165 ± 0.061
0.914PheMet: 0.914 ± 0.024
1.437PheAsn: 1.437 ± 0.032
1.418PhePro: 1.418 ± 0.032
1.12PheGln: 1.12 ± 0.03
1.89PheArg: 1.89 ± 0.037
2.721PheSer: 2.721 ± 0.046
1.947PheThr: 1.947 ± 0.038
2.52PheVal: 2.52 ± 0.05
0.516PheTrp: 0.516 ± 0.024
1.111PheTyr: 1.111 ± 0.036
0.0PheXaa: 0.0 ± 0.0
Gly
6.825GlyAla: 6.825 ± 0.081
1.049GlyCys: 1.049 ± 0.031
4.139GlyAsp: 4.139 ± 0.073
4.944GlyGlu: 4.944 ± 0.063
3.331GlyPhe: 3.331 ± 0.059
5.838GlyGly: 5.838 ± 0.08
1.761GlyHis: 1.761 ± 0.042
4.733GlyIle: 4.733 ± 0.068
3.095GlyLys: 3.095 ± 0.056
8.335GlyLeu: 8.335 ± 0.105
2.321GlyMet: 2.321 ± 0.044
2.249GlyAsn: 2.249 ± 0.049
2.327GlyPro: 2.327 ± 0.041
3.08GlyGln: 3.08 ± 0.047
4.669GlyArg: 4.669 ± 0.058
4.39GlySer: 4.39 ± 0.066
3.649GlyThr: 3.649 ± 0.054
6.076GlyVal: 6.076 ± 0.075
1.173GlyTrp: 1.173 ± 0.035
2.636GlyTyr: 2.636 ± 0.051
0.0GlyXaa: 0.0 ± 0.0
His
1.98HisAla: 1.98 ± 0.037
0.334HisCys: 0.334 ± 0.016
1.186HisAsp: 1.186 ± 0.029
1.334HisGlu: 1.334 ± 0.032
0.979HisPhe: 0.979 ± 0.023
1.602HisGly: 1.602 ± 0.036
0.655HisHis: 0.655 ± 0.023
1.192HisIle: 1.192 ± 0.028
0.708HisLys: 0.708 ± 0.02
2.535HisLeu: 2.535 ± 0.045
0.563HisMet: 0.563 ± 0.02
0.704HisAsn: 0.704 ± 0.021
1.375HisPro: 1.375 ± 0.036
1.068HisGln: 1.068 ± 0.029
1.48HisArg: 1.48 ± 0.038
1.264HisSer: 1.264 ± 0.033
1.081HisThr: 1.081 ± 0.027
1.217HisVal: 1.217 ± 0.029
0.406HisTrp: 0.406 ± 0.019
0.822HisTyr: 0.822 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
6.175IleAla: 6.175 ± 0.068
0.698IleCys: 0.698 ± 0.024
4.056IleAsp: 4.056 ± 0.055
4.363IleGlu: 4.363 ± 0.058
1.658IlePhe: 1.658 ± 0.039
4.701IleGly: 4.701 ± 0.062
1.184IleHis: 1.184 ± 0.028
2.569IleIle: 2.569 ± 0.044
2.084IleLys: 2.084 ± 0.04
4.903IleLeu: 4.903 ± 0.064
1.115IleMet: 1.115 ± 0.029
2.153IleAsn: 2.153 ± 0.04
2.62IlePro: 2.62 ± 0.04
1.803IleGln: 1.803 ± 0.034
3.395IleArg: 3.395 ± 0.052
3.617IleSer: 3.617 ± 0.063
2.928IleThr: 2.928 ± 0.047
3.563IleVal: 3.563 ± 0.052
0.599IleTrp: 0.599 ± 0.026
1.39IleTyr: 1.39 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
3.944LysAla: 3.944 ± 0.06
0.221LysCys: 0.221 ± 0.015
1.679LysAsp: 1.679 ± 0.034
2.177LysGlu: 2.177 ± 0.043
0.875LysPhe: 0.875 ± 0.029
2.774LysGly: 2.774 ± 0.047
0.797LysHis: 0.797 ± 0.026
1.749LysIle: 1.749 ± 0.038
1.452LysLys: 1.452 ± 0.045
3.92LysLeu: 3.92 ± 0.06
0.887LysMet: 0.887 ± 0.028
1.021LysAsn: 1.021 ± 0.03
2.116LysPro: 2.116 ± 0.048
1.74LysGln: 1.74 ± 0.04
2.694LysArg: 2.694 ± 0.048
2.102LysSer: 2.102 ± 0.04
1.853LysThr: 1.853 ± 0.039
2.691LysVal: 2.691 ± 0.047
0.361LysTrp: 0.361 ± 0.015
0.773LysTyr: 0.773 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
11.795LeuAla: 11.795 ± 0.112
1.235LeuCys: 1.235 ± 0.028
6.363LeuAsp: 6.363 ± 0.077
7.087LeuGlu: 7.087 ± 0.089
4.289LeuPhe: 4.289 ± 0.075
8.284LeuGly: 8.284 ± 0.097
2.358LeuHis: 2.358 ± 0.05
6.542LeuIle: 6.542 ± 0.088
4.635LeuLys: 4.635 ± 0.061
12.48LeuLeu: 12.48 ± 0.133
3.073LeuMet: 3.073 ± 0.051
3.955LeuAsn: 3.955 ± 0.058
5.585LeuPro: 5.585 ± 0.068
3.932LeuGln: 3.932 ± 0.068
6.343LeuArg: 6.343 ± 0.077
8.102LeuSer: 8.102 ± 0.082
5.744LeuThr: 5.744 ± 0.065
7.624LeuVal: 7.624 ± 0.09
1.264LeuTrp: 1.264 ± 0.032
2.72LeuTyr: 2.72 ± 0.052
0.0LeuXaa: 0.0 ± 0.0
Met
2.808MetAla: 2.808 ± 0.049
0.185MetCys: 0.185 ± 0.012
1.269MetAsp: 1.269 ± 0.03
1.314MetGlu: 1.314 ± 0.031
0.833MetPhe: 0.833 ± 0.03
1.864MetGly: 1.864 ± 0.047
0.553MetHis: 0.553 ± 0.022
1.45MetIle: 1.45 ± 0.039
1.183MetLys: 1.183 ± 0.032
3.103MetLeu: 3.103 ± 0.052
0.761MetMet: 0.761 ± 0.027
0.966MetAsn: 0.966 ± 0.029
1.431MetPro: 1.431 ± 0.031
1.104MetGln: 1.104 ± 0.027
1.593MetArg: 1.593 ± 0.038
1.867MetSer: 1.867 ± 0.039
1.524MetThr: 1.524 ± 0.037
1.74MetVal: 1.74 ± 0.04
0.183MetTrp: 0.183 ± 0.012
0.402MetTyr: 0.402 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.208AsnAla: 3.208 ± 0.052
0.368AsnCys: 0.368 ± 0.016
1.724AsnAsp: 1.724 ± 0.039
1.74AsnGlu: 1.74 ± 0.034
1.013AsnPhe: 1.013 ± 0.031
2.552AsnGly: 2.552 ± 0.043
0.634AsnHis: 0.634 ± 0.021
1.775AsnIle: 1.775 ± 0.037
1.079AsnLys: 1.079 ± 0.029
3.407AsnLeu: 3.407 ± 0.053
0.733AsnMet: 0.733 ± 0.024
0.979AsnAsn: 0.979 ± 0.03
1.943AsnPro: 1.943 ± 0.038
1.233AsnGln: 1.233 ± 0.033
2.205AsnArg: 2.205 ± 0.047
1.769AsnSer: 1.769 ± 0.038
1.685AsnThr: 1.685 ± 0.037
1.889AsnVal: 1.889 ± 0.042
0.473AsnTrp: 0.473 ± 0.019
0.855AsnTyr: 0.855 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
4.369ProAla: 4.369 ± 0.068
0.363ProCys: 0.363 ± 0.016
3.176ProAsp: 3.176 ± 0.056
3.875ProGlu: 3.875 ± 0.046
1.741ProPhe: 1.741 ± 0.034
3.588ProGly: 3.588 ± 0.055
0.96ProHis: 0.96 ± 0.026
2.258ProIle: 2.258 ± 0.05
1.528ProLys: 1.528 ± 0.031
4.884ProLeu: 4.884 ± 0.069
1.167ProMet: 1.167 ± 0.03
1.362ProAsn: 1.362 ± 0.035
1.688ProPro: 1.688 ± 0.041
1.725ProGln: 1.725 ± 0.04
2.13ProArg: 2.13 ± 0.043
2.607ProSer: 2.607 ± 0.043
2.149ProThr: 2.149 ± 0.039
3.82ProVal: 3.82 ± 0.055
0.647ProTrp: 0.647 ± 0.02
1.23ProTyr: 1.23 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
4.564GlnAla: 4.564 ± 0.073
0.385GlnCys: 0.385 ± 0.019
1.654GlnAsp: 1.654 ± 0.039
2.145GlnGlu: 2.145 ± 0.038
1.386GlnPhe: 1.386 ± 0.031
3.099GlnGly: 3.099 ± 0.048
1.004GlnHis: 1.004 ± 0.026
2.309GlnIle: 2.309 ± 0.042
1.337GlnLys: 1.337 ± 0.035
5.255GlnLeu: 5.255 ± 0.08
1.168GlnMet: 1.168 ± 0.031
1.148GlnAsn: 1.148 ± 0.029
1.928GlnPro: 1.928 ± 0.041
2.495GlnGln: 2.495 ± 0.055
3.096GlnArg: 3.096 ± 0.055
2.536GlnSer: 2.536 ± 0.048
1.87GlnThr: 1.87 ± 0.038
3.073GlnVal: 3.073 ± 0.054
0.624GlnTrp: 0.624 ± 0.023
1.089GlnTyr: 1.089 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
5.667ArgAla: 5.667 ± 0.064
0.643ArgCys: 0.643 ± 0.026
3.445ArgAsp: 3.445 ± 0.054
4.413ArgGlu: 4.413 ± 0.077
2.83ArgPhe: 2.83 ± 0.044
3.719ArgGly: 3.719 ± 0.047
1.631ArgHis: 1.631 ± 0.038
3.898ArgIle: 3.898 ± 0.058
2.272ArgLys: 2.272 ± 0.044
7.455ArgLeu: 7.455 ± 0.089
1.704ArgMet: 1.704 ± 0.035
1.995ArgAsn: 1.995 ± 0.038
2.388ArgPro: 2.388 ± 0.043
2.919ArgGln: 2.919 ± 0.048
4.128ArgArg: 4.128 ± 0.065
3.497ArgSer: 3.497 ± 0.057
2.807ArgThr: 2.807 ± 0.056
4.452ArgVal: 4.452 ± 0.058
0.96ArgTrp: 0.96 ± 0.027
2.207ArgTyr: 2.207 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
6.35SerAla: 6.35 ± 0.068
0.624SerCys: 0.624 ± 0.023
3.597SerAsp: 3.597 ± 0.058
3.814SerGlu: 3.814 ± 0.066
2.289SerPhe: 2.289 ± 0.044
5.557SerGly: 5.557 ± 0.071
1.306SerHis: 1.306 ± 0.033
3.32SerIle: 3.32 ± 0.057
1.877SerLys: 1.877 ± 0.039
6.764SerLeu: 6.764 ± 0.068
1.584SerMet: 1.584 ± 0.038
1.838SerAsn: 1.838 ± 0.043
2.711SerPro: 2.711 ± 0.055
2.301SerGln: 2.301 ± 0.044
3.924SerArg: 3.924 ± 0.061
3.709SerSer: 3.709 ± 0.06
2.982SerThr: 2.982 ± 0.047
4.45SerVal: 4.45 ± 0.062
0.822SerTrp: 0.822 ± 0.024
1.535SerTyr: 1.535 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
4.757ThrAla: 4.757 ± 0.065
0.487ThrCys: 0.487 ± 0.021
2.809ThrAsp: 2.809 ± 0.043
3.03ThrGlu: 3.03 ± 0.047
1.768ThrPhe: 1.768 ± 0.038
4.347ThrGly: 4.347 ± 0.059
1.149ThrHis: 1.149 ± 0.032
2.387ThrIle: 2.387 ± 0.047
1.186ThrLys: 1.186 ± 0.031
7.058ThrLeu: 7.058 ± 0.075
0.882ThrMet: 0.882 ± 0.026
1.279ThrAsn: 1.279 ± 0.034
3.014ThrPro: 3.014 ± 0.049
1.902ThrGln: 1.902 ± 0.044
2.988ThrArg: 2.988 ± 0.062
2.685ThrSer: 2.685 ± 0.048
2.495ThrThr: 2.495 ± 0.05
3.296ThrVal: 3.296 ± 0.057
0.559ThrTrp: 0.559 ± 0.022
1.148ThrTyr: 1.148 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
6.987ValAla: 6.987 ± 0.078
0.769ValCys: 0.769 ± 0.022
4.268ValAsp: 4.268 ± 0.054
4.856ValGlu: 4.856 ± 0.059
2.411ValPhe: 2.411 ± 0.044
5.06ValGly: 5.06 ± 0.072
1.423ValHis: 1.423 ± 0.037
4.216ValIle: 4.216 ± 0.056
2.502ValLys: 2.502 ± 0.043
7.547ValLeu: 7.547 ± 0.097
1.966ValMet: 1.966 ± 0.044
2.304ValAsn: 2.304 ± 0.036
3.019ValPro: 3.019 ± 0.049
2.396ValGln: 2.396 ± 0.046
4.203ValArg: 4.203 ± 0.059
4.679ValSer: 4.679 ± 0.058
3.648ValThr: 3.648 ± 0.053
5.375ValVal: 5.375 ± 0.074
0.788ValTrp: 0.788 ± 0.024
1.703ValTyr: 1.703 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.966TrpAla: 0.966 ± 0.031
0.158TrpCys: 0.158 ± 0.011
0.619TrpAsp: 0.619 ± 0.021
0.542TrpGlu: 0.542 ± 0.019
0.543TrpPhe: 0.543 ± 0.022
0.859TrpGly: 0.859 ± 0.029
0.38TrpHis: 0.38 ± 0.017
0.763TrpIle: 0.763 ± 0.026
0.435TrpLys: 0.435 ± 0.019
1.853TrpLeu: 1.853 ± 0.054
0.395TrpMet: 0.395 ± 0.018
0.432TrpAsn: 0.432 ± 0.018
0.567TrpPro: 0.567 ± 0.025
0.757TrpGln: 0.757 ± 0.029
0.862TrpArg: 0.862 ± 0.026
0.793TrpSer: 0.793 ± 0.026
0.576TrpThr: 0.576 ± 0.025
0.932TrpVal: 0.932 ± 0.025
0.183TrpTrp: 0.183 ± 0.012
0.325TrpTyr: 0.325 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.314TyrAla: 2.314 ± 0.04
0.325TyrCys: 0.325 ± 0.014
1.527TyrAsp: 1.527 ± 0.037
1.509TyrGlu: 1.509 ± 0.039
1.059TyrPhe: 1.059 ± 0.031
2.125TyrGly: 2.125 ± 0.044
0.616TyrHis: 0.616 ± 0.022
1.281TyrIle: 1.281 ± 0.027
0.851TyrLys: 0.851 ± 0.032
3.143TyrLeu: 3.143 ± 0.046
0.563TyrMet: 0.563 ± 0.019
0.809TyrAsn: 0.809 ± 0.026
1.365TyrPro: 1.365 ± 0.031
1.241TyrGln: 1.241 ± 0.032
2.136TyrArg: 2.136 ± 0.048
1.664TyrSer: 1.664 ± 0.036
1.385TyrThr: 1.385 ± 0.036
1.609TyrVal: 1.609 ± 0.035
0.424TyrTrp: 0.424 ± 0.019
0.865TyrTyr: 0.865 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4141 proteins (1346198 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski