Amino acid dipepetide frequency for Methylobrevis pamukkalensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.176AlaAla: 21.176 ± 0.203
1.238AlaCys: 1.238 ± 0.033
8.297AlaAsp: 8.297 ± 0.08
8.223AlaGlu: 8.223 ± 0.094
4.649AlaPhe: 4.649 ± 0.068
13.291AlaGly: 13.291 ± 0.125
2.268AlaHis: 2.268 ± 0.052
6.465AlaIle: 6.465 ± 0.077
3.011AlaLys: 3.011 ± 0.063
13.502AlaLeu: 13.502 ± 0.132
3.689AlaMet: 3.689 ± 0.058
2.422AlaAsn: 2.422 ± 0.045
6.695AlaPro: 6.695 ± 0.099
2.998AlaGln: 2.998 ± 0.054
11.046AlaArg: 11.046 ± 0.124
6.92AlaSer: 6.92 ± 0.084
6.816AlaThr: 6.816 ± 0.07
9.994AlaVal: 9.994 ± 0.097
1.457AlaTrp: 1.457 ± 0.034
2.285AlaTyr: 2.285 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
1.041CysAla: 1.041 ± 0.027
0.18CysCys: 0.18 ± 0.014
0.522CysAsp: 0.522 ± 0.023
0.391CysGlu: 0.391 ± 0.019
0.278CysPhe: 0.278 ± 0.016
0.933CysGly: 0.933 ± 0.027
0.229CysHis: 0.229 ± 0.013
0.285CysIle: 0.285 ± 0.014
0.153CysLys: 0.153 ± 0.012
0.773CysLeu: 0.773 ± 0.029
0.185CysMet: 0.185 ± 0.014
0.173CysAsn: 0.173 ± 0.012
0.522CysPro: 0.522 ± 0.02
0.18CysGln: 0.18 ± 0.01
0.989CysArg: 0.989 ± 0.032
0.578CysSer: 0.578 ± 0.023
0.395CysThr: 0.395 ± 0.018
0.566CysVal: 0.566 ± 0.021
0.166CysTrp: 0.166 ± 0.012
0.166CysTyr: 0.166 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
7.668AspAla: 7.668 ± 0.09
0.491AspCys: 0.491 ± 0.019
3.751AspAsp: 3.751 ± 0.074
3.542AspGlu: 3.542 ± 0.058
2.237AspPhe: 2.237 ± 0.045
6.074AspGly: 6.074 ± 0.083
1.407AspHis: 1.407 ± 0.036
2.945AspIle: 2.945 ± 0.057
1.324AspLys: 1.324 ± 0.036
7.081AspLeu: 7.081 ± 0.088
1.325AspMet: 1.325 ± 0.035
1.106AspAsn: 1.106 ± 0.034
4.024AspPro: 4.024 ± 0.059
1.373AspGln: 1.373 ± 0.038
5.031AspArg: 5.031 ± 0.075
1.939AspSer: 1.939 ± 0.041
2.796AspThr: 2.796 ± 0.06
4.639AspVal: 4.639 ± 0.063
0.894AspTrp: 0.894 ± 0.029
1.349AspTyr: 1.349 ± 0.033
0.0AspXaa: 0.0 ± 0.0
Glu
8.354GluAla: 8.354 ± 0.105
0.298GluCys: 0.298 ± 0.015
2.951GluAsp: 2.951 ± 0.061
3.009GluGlu: 3.009 ± 0.057
1.566GluPhe: 1.566 ± 0.037
4.594GluGly: 4.594 ± 0.066
1.024GluHis: 1.024 ± 0.034
3.681GluIle: 3.681 ± 0.062
1.752GluLys: 1.752 ± 0.045
4.562GluLeu: 4.562 ± 0.073
1.466GluMet: 1.466 ± 0.034
1.171GluAsn: 1.171 ± 0.033
2.645GluPro: 2.645 ± 0.045
1.573GluGln: 1.573 ± 0.045
4.758GluArg: 4.758 ± 0.077
2.073GluSer: 2.073 ± 0.037
3.903GluThr: 3.903 ± 0.071
4.189GluVal: 4.189 ± 0.067
0.554GluTrp: 0.554 ± 0.024
0.735GluTyr: 0.735 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
4.698PheAla: 4.698 ± 0.069
0.352PheCys: 0.352 ± 0.017
2.661PheAsp: 2.661 ± 0.055
2.037PheGlu: 2.037 ± 0.037
1.268PhePhe: 1.268 ± 0.037
3.635PheGly: 3.635 ± 0.063
0.746PheHis: 0.746 ± 0.027
1.32PheIle: 1.32 ± 0.036
0.749PheLys: 0.749 ± 0.026
3.291PheLeu: 3.291 ± 0.063
0.717PheMet: 0.717 ± 0.025
0.84PheAsn: 0.84 ± 0.027
1.546PhePro: 1.546 ± 0.039
0.867PheGln: 0.867 ± 0.029
2.502PheArg: 2.502 ± 0.049
2.06PheSer: 2.06 ± 0.045
1.831PheThr: 1.831 ± 0.037
2.941PheVal: 2.941 ± 0.051
0.475PheTrp: 0.475 ± 0.019
0.764PheTyr: 0.764 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
10.449GlyAla: 10.449 ± 0.103
0.87GlyCys: 0.87 ± 0.028
5.247GlyAsp: 5.247 ± 0.082
5.28GlyGlu: 5.28 ± 0.074
3.595GlyPhe: 3.595 ± 0.059
8.308GlyGly: 8.308 ± 0.118
2.067GlyHis: 2.067 ± 0.051
4.483GlyIle: 4.483 ± 0.063
2.633GlyLys: 2.633 ± 0.054
9.641GlyLeu: 9.641 ± 0.102
2.234GlyMet: 2.234 ± 0.043
2.149GlyAsn: 2.149 ± 0.061
4.071GlyPro: 4.071 ± 0.062
2.379GlyGln: 2.379 ± 0.052
7.823GlyArg: 7.823 ± 0.098
4.764GlySer: 4.764 ± 0.066
5.091GlyThr: 5.091 ± 0.071
6.457GlyVal: 6.457 ± 0.078
1.413GlyTrp: 1.413 ± 0.037
2.131GlyTyr: 2.131 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
2.407HisAla: 2.407 ± 0.047
0.194HisCys: 0.194 ± 0.012
1.383HisAsp: 1.383 ± 0.033
0.988HisGlu: 0.988 ± 0.031
0.754HisPhe: 0.754 ± 0.027
2.158HisGly: 2.158 ± 0.052
0.605HisHis: 0.605 ± 0.026
0.759HisIle: 0.759 ± 0.024
0.358HisLys: 0.358 ± 0.019
2.139HisLeu: 2.139 ± 0.041
0.457HisMet: 0.457 ± 0.02
0.343HisAsn: 0.343 ± 0.016
1.416HisPro: 1.416 ± 0.034
0.484HisGln: 0.484 ± 0.022
1.546HisArg: 1.546 ± 0.042
0.741HisSer: 0.741 ± 0.025
0.735HisThr: 0.735 ± 0.028
1.674HisVal: 1.674 ± 0.039
0.258HisTrp: 0.258 ± 0.014
0.448HisTyr: 0.448 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
7.391IleAla: 7.391 ± 0.088
0.445IleCys: 0.445 ± 0.02
3.77IleAsp: 3.77 ± 0.057
3.416IleGlu: 3.416 ± 0.052
1.529IlePhe: 1.529 ± 0.039
4.967IleGly: 4.967 ± 0.074
0.882IleHis: 0.882 ± 0.027
1.669IleIle: 1.669 ± 0.051
0.95IleLys: 0.95 ± 0.031
4.209IleLeu: 4.209 ± 0.066
0.807IleMet: 0.807 ± 0.026
1.055IleAsn: 1.055 ± 0.033
2.113IlePro: 2.113 ± 0.049
0.936IleGln: 0.936 ± 0.028
3.4IleArg: 3.4 ± 0.056
2.465IleSer: 2.465 ± 0.047
2.368IleThr: 2.368 ± 0.043
4.205IleVal: 4.205 ± 0.059
0.561IleTrp: 0.561 ± 0.021
0.91IleTyr: 0.91 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
3.683LysAla: 3.683 ± 0.064
0.128LysCys: 0.128 ± 0.012
1.388LysAsp: 1.388 ± 0.042
1.076LysGlu: 1.076 ± 0.031
0.698LysPhe: 0.698 ± 0.02
2.167LysGly: 2.167 ± 0.052
0.395LysHis: 0.395 ± 0.019
1.413LysIle: 1.413 ± 0.035
0.829LysLys: 0.829 ± 0.03
2.414LysLeu: 2.414 ± 0.048
0.597LysMet: 0.597 ± 0.023
0.55LysAsn: 0.55 ± 0.023
1.556LysPro: 1.556 ± 0.036
0.563LysGln: 0.563 ± 0.023
1.847LysArg: 1.847 ± 0.041
1.337LysSer: 1.337 ± 0.037
1.622LysThr: 1.622 ± 0.039
2.23LysVal: 2.23 ± 0.044
0.283LysTrp: 0.283 ± 0.015
0.414LysTyr: 0.414 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
15.34LeuAla: 15.34 ± 0.151
0.828LeuCys: 0.828 ± 0.025
6.491LeuAsp: 6.491 ± 0.087
4.798LeuGlu: 4.798 ± 0.068
3.398LeuPhe: 3.398 ± 0.063
8.673LeuGly: 8.673 ± 0.093
1.828LeuHis: 1.828 ± 0.045
4.164LeuIle: 4.164 ± 0.066
2.893LeuLys: 2.893 ± 0.049
9.303LeuLeu: 9.303 ± 0.114
2.187LeuMet: 2.187 ± 0.047
1.901LeuAsn: 1.901 ± 0.043
5.788LeuPro: 5.788 ± 0.086
2.124LeuGln: 2.124 ± 0.04
6.993LeuArg: 6.993 ± 0.08
5.69LeuSer: 5.69 ± 0.075
5.37LeuThr: 5.37 ± 0.069
8.384LeuVal: 8.384 ± 0.086
1.059LeuTrp: 1.059 ± 0.03
1.829LeuTyr: 1.829 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
3.18MetAla: 3.18 ± 0.046
0.126MetCys: 0.126 ± 0.009
1.132MetAsp: 1.132 ± 0.027
1.045MetGlu: 1.045 ± 0.03
0.669MetPhe: 0.669 ± 0.027
1.582MetGly: 1.582 ± 0.039
0.36MetHis: 0.36 ± 0.017
1.401MetIle: 1.401 ± 0.033
0.82MetLys: 0.82 ± 0.029
2.408MetLeu: 2.408 ± 0.048
0.675MetMet: 0.675 ± 0.023
0.647MetAsn: 0.647 ± 0.024
1.661MetPro: 1.661 ± 0.043
0.667MetGln: 0.667 ± 0.021
1.957MetArg: 1.957 ± 0.039
1.623MetSer: 1.623 ± 0.037
2.095MetThr: 2.095 ± 0.04
1.718MetVal: 1.718 ± 0.037
0.207MetTrp: 0.207 ± 0.015
0.251MetTyr: 0.251 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.603AsnAla: 2.603 ± 0.047
0.189AsnCys: 0.189 ± 0.014
1.269AsnAsp: 1.269 ± 0.045
0.924AsnGlu: 0.924 ± 0.029
0.741AsnPhe: 0.741 ± 0.025
1.967AsnGly: 1.967 ± 0.051
0.434AsnHis: 0.434 ± 0.02
1.03AsnIle: 1.03 ± 0.029
0.443AsnLys: 0.443 ± 0.019
1.997AsnLeu: 1.997 ± 0.048
0.468AsnMet: 0.468 ± 0.021
0.472AsnAsn: 0.472 ± 0.019
1.493AsnPro: 1.493 ± 0.037
0.493AsnGln: 0.493 ± 0.022
1.583AsnArg: 1.583 ± 0.035
0.941AsnSer: 0.941 ± 0.032
1.026AsnThr: 1.026 ± 0.032
1.705AsnVal: 1.705 ± 0.04
0.353AsnTrp: 0.353 ± 0.017
0.474AsnTyr: 0.474 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
8.157ProAla: 8.157 ± 0.107
0.425ProCys: 0.425 ± 0.022
4.014ProAsp: 4.014 ± 0.059
3.637ProGlu: 3.637 ± 0.057
1.981ProPhe: 1.981 ± 0.042
5.187ProGly: 5.187 ± 0.078
1.097ProHis: 1.097 ± 0.032
2.068ProIle: 2.068 ± 0.044
1.435ProLys: 1.435 ± 0.034
4.808ProLeu: 4.808 ± 0.069
1.358ProMet: 1.358 ± 0.033
1.041ProAsn: 1.041 ± 0.033
3.222ProPro: 3.222 ± 0.078
1.388ProGln: 1.388 ± 0.033
3.768ProArg: 3.768 ± 0.066
2.968ProSer: 2.968 ± 0.056
2.688ProThr: 2.688 ± 0.049
4.762ProVal: 4.762 ± 0.07
0.695ProTrp: 0.695 ± 0.023
1.041ProTyr: 1.041 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
3.185GlnAla: 3.185 ± 0.059
0.14GlnCys: 0.14 ± 0.011
1.194GlnAsp: 1.194 ± 0.034
1.148GlnGlu: 1.148 ± 0.037
0.822GlnPhe: 0.822 ± 0.023
1.831GlnGly: 1.831 ± 0.044
0.505GlnHis: 0.505 ± 0.023
1.563GlnIle: 1.563 ± 0.038
0.755GlnLys: 0.755 ± 0.026
2.126GlnLeu: 2.126 ± 0.045
0.689GlnMet: 0.689 ± 0.026
0.538GlnAsn: 0.538 ± 0.023
1.443GlnPro: 1.443 ± 0.039
0.81GlnGln: 0.81 ± 0.031
2.11GlnArg: 2.11 ± 0.05
1.29GlnSer: 1.29 ± 0.04
1.401GlnThr: 1.401 ± 0.037
1.883GlnVal: 1.883 ± 0.044
0.25GlnTrp: 0.25 ± 0.015
0.403GlnTyr: 0.403 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
9.306ArgAla: 9.306 ± 0.112
0.796ArgCys: 0.796 ± 0.029
4.35ArgAsp: 4.35 ± 0.064
4.119ArgGlu: 4.119 ± 0.05
2.977ArgPhe: 2.977 ± 0.057
5.594ArgGly: 5.594 ± 0.084
2.006ArgHis: 2.006 ± 0.042
4.463ArgIle: 4.463 ± 0.065
1.957ArgLys: 1.957 ± 0.044
9.124ArgLeu: 9.124 ± 0.125
2.122ArgMet: 2.122 ± 0.046
1.642ArgAsn: 1.642 ± 0.037
5.024ArgPro: 5.024 ± 0.082
2.357ArgGln: 2.357 ± 0.051
8.265ArgArg: 8.265 ± 0.117
4.339ArgSer: 4.339 ± 0.075
3.83ArgThr: 3.83 ± 0.056
4.989ArgVal: 4.989 ± 0.071
1.157ArgTrp: 1.157 ± 0.033
1.469ArgTyr: 1.469 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
6.738SerAla: 6.738 ± 0.088
0.483SerCys: 0.483 ± 0.021
2.79SerAsp: 2.79 ± 0.054
2.422SerGlu: 2.422 ± 0.045
2.056SerPhe: 2.056 ± 0.046
5.699SerGly: 5.699 ± 0.089
0.95SerHis: 0.95 ± 0.031
2.353SerIle: 2.353 ± 0.047
1.176SerLys: 1.176 ± 0.03
5.065SerLeu: 5.065 ± 0.066
1.27SerMet: 1.27 ± 0.032
1.019SerAsn: 1.019 ± 0.032
3.127SerPro: 3.127 ± 0.068
1.237SerGln: 1.237 ± 0.031
4.167SerArg: 4.167 ± 0.069
3.1SerSer: 3.1 ± 0.068
2.743SerThr: 2.743 ± 0.054
3.735SerVal: 3.735 ± 0.06
0.653SerTrp: 0.653 ± 0.023
1.054SerTyr: 1.054 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
7.042ThrAla: 7.042 ± 0.085
0.502ThrCys: 0.502 ± 0.022
3.088ThrAsp: 3.088 ± 0.052
2.613ThrGlu: 2.613 ± 0.043
1.969ThrPhe: 1.969 ± 0.042
5.597ThrGly: 5.597 ± 0.07
0.91ThrHis: 0.91 ± 0.025
2.697ThrIle: 2.697 ± 0.052
1.151ThrLys: 1.151 ± 0.032
5.81ThrLeu: 5.81 ± 0.077
1.276ThrMet: 1.276 ± 0.037
1.053ThrAsn: 1.053 ± 0.034
3.428ThrPro: 3.428 ± 0.054
1.078ThrGln: 1.078 ± 0.032
3.874ThrArg: 3.874 ± 0.058
3.063ThrSer: 3.063 ± 0.058
3.169ThrThr: 3.169 ± 0.058
4.565ThrVal: 4.565 ± 0.064
0.696ThrTrp: 0.696 ± 0.028
1.073ThrTyr: 1.073 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
10.62ValAla: 10.62 ± 0.113
0.653ValCys: 0.653 ± 0.024
4.798ValAsp: 4.798 ± 0.06
4.714ValGlu: 4.714 ± 0.073
2.884ValPhe: 2.884 ± 0.056
6.002ValGly: 6.002 ± 0.074
1.443ValHis: 1.443 ± 0.035
3.972ValIle: 3.972 ± 0.065
1.971ValLys: 1.971 ± 0.042
7.424ValLeu: 7.424 ± 0.088
1.941ValMet: 1.941 ± 0.039
1.706ValAsn: 1.706 ± 0.045
4.217ValPro: 4.217 ± 0.067
1.619ValGln: 1.619 ± 0.041
5.362ValArg: 5.362 ± 0.07
4.187ValSer: 4.187 ± 0.055
4.975ValThr: 4.975 ± 0.063
6.936ValVal: 6.936 ± 0.092
0.954ValTrp: 0.954 ± 0.03
1.49ValTyr: 1.49 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
1.125TrpAla: 1.125 ± 0.034
0.182TrpCys: 0.182 ± 0.014
0.568TrpAsp: 0.568 ± 0.024
0.448TrpGlu: 0.448 ± 0.021
0.483TrpPhe: 0.483 ± 0.024
0.802TrpGly: 0.802 ± 0.025
0.295TrpHis: 0.295 ± 0.015
0.602TrpIle: 0.602 ± 0.026
0.379TrpLys: 0.379 ± 0.017
1.43TrpLeu: 1.43 ± 0.035
0.372TrpMet: 0.372 ± 0.018
0.377TrpAsn: 0.377 ± 0.018
0.783TrpPro: 0.783 ± 0.029
0.47TrpGln: 0.47 ± 0.02
1.32TrpArg: 1.32 ± 0.035
0.863TrpSer: 0.863 ± 0.033
0.858TrpThr: 0.858 ± 0.031
0.727TrpVal: 0.727 ± 0.023
0.242TrpTrp: 0.242 ± 0.014
0.266TrpTyr: 0.266 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.281TyrAla: 2.281 ± 0.044
0.189TyrCys: 0.189 ± 0.013
1.331TyrAsp: 1.331 ± 0.029
1.024TyrGlu: 1.024 ± 0.027
0.747TyrPhe: 0.747 ± 0.022
2.025TyrGly: 2.025 ± 0.044
0.395TyrHis: 0.395 ± 0.018
0.715TyrIle: 0.715 ± 0.025
0.453TyrLys: 0.453 ± 0.022
1.936TyrLeu: 1.936 ± 0.044
0.388TyrMet: 0.388 ± 0.018
0.4TyrAsn: 0.4 ± 0.019
0.967TyrPro: 0.967 ± 0.027
0.491TyrGln: 0.491 ± 0.021
1.546TyrArg: 1.546 ± 0.036
0.893TyrSer: 0.893 ± 0.027
0.943TyrThr: 0.943 ± 0.027
1.568TyrVal: 1.568 ± 0.038
0.261TyrTrp: 0.261 ± 0.014
0.437TyrTyr: 0.437 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4503 proteins (1213789 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski