Amino acid dipepetide frequency for Methylophaga thiooxydans DMS010

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.517AlaAla: 8.517 ± 0.115
0.925AlaCys: 0.925 ± 0.038
5.759AlaAsp: 5.759 ± 0.102
6.604AlaGlu: 6.604 ± 0.11
3.421AlaPhe: 3.421 ± 0.066
6.81AlaGly: 6.81 ± 0.106
1.782AlaHis: 1.782 ± 0.053
6.03AlaIle: 6.03 ± 0.092
4.225AlaLys: 4.225 ± 0.087
9.889AlaLeu: 9.889 ± 0.132
2.848AlaMet: 2.848 ± 0.063
3.279AlaAsn: 3.279 ± 0.068
2.826AlaPro: 2.826 ± 0.069
3.608AlaGln: 3.608 ± 0.085
4.073AlaArg: 4.073 ± 0.084
5.156AlaSer: 5.156 ± 0.087
4.699AlaThr: 4.699 ± 0.084
6.378AlaVal: 6.378 ± 0.107
1.1AlaTrp: 1.1 ± 0.039
2.314AlaTyr: 2.314 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.653CysAla: 0.653 ± 0.028
0.165CysCys: 0.165 ± 0.015
0.557CysAsp: 0.557 ± 0.026
0.538CysGlu: 0.538 ± 0.026
0.412CysPhe: 0.412 ± 0.024
0.818CysGly: 0.818 ± 0.036
0.339CysHis: 0.339 ± 0.022
0.499CysIle: 0.499 ± 0.024
0.311CysLys: 0.311 ± 0.021
0.915CysLeu: 0.915 ± 0.036
0.202CysMet: 0.202 ± 0.016
0.292CysAsn: 0.292 ± 0.018
0.466CysPro: 0.466 ± 0.028
0.457CysGln: 0.457 ± 0.025
0.466CysArg: 0.466 ± 0.025
0.507CysSer: 0.507 ± 0.027
0.36CysThr: 0.36 ± 0.02
0.57CysVal: 0.57 ± 0.028
0.093CysTrp: 0.093 ± 0.011
0.288CysTyr: 0.288 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
5.222AspAla: 5.222 ± 0.092
0.544AspCys: 0.544 ± 0.026
4.024AspAsp: 4.024 ± 0.125
4.16AspGlu: 4.16 ± 0.09
2.619AspPhe: 2.619 ± 0.066
4.059AspGly: 4.059 ± 0.146
1.235AspHis: 1.235 ± 0.036
4.525AspIle: 4.525 ± 0.071
3.27AspLys: 3.27 ± 0.072
5.644AspLeu: 5.644 ± 0.11
1.701AspMet: 1.701 ± 0.057
2.74AspAsn: 2.74 ± 0.069
2.304AspPro: 2.304 ± 0.058
1.996AspGln: 1.996 ± 0.052
2.717AspArg: 2.717 ± 0.063
3.412AspSer: 3.412 ± 0.086
3.031AspThr: 3.031 ± 0.083
4.177AspVal: 4.177 ± 0.081
1.012AspTrp: 1.012 ± 0.032
2.07AspTyr: 2.07 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
5.83GluAla: 5.83 ± 0.092
0.426GluCys: 0.426 ± 0.023
3.112GluAsp: 3.112 ± 0.074
3.465GluGlu: 3.465 ± 0.086
2.321GluPhe: 2.321 ± 0.062
3.698GluGly: 3.698 ± 0.078
1.635GluHis: 1.635 ± 0.053
3.921GluIle: 3.921 ± 0.079
3.597GluLys: 3.597 ± 0.07
6.464GluLeu: 6.464 ± 0.104
1.83GluMet: 1.83 ± 0.05
2.597GluAsn: 2.597 ± 0.06
2.301GluPro: 2.301 ± 0.082
4.335GluGln: 4.335 ± 0.091
3.503GluArg: 3.503 ± 0.066
3.487GluSer: 3.487 ± 0.073
3.889GluThr: 3.889 ± 0.092
4.26GluVal: 4.26 ± 0.072
0.781GluTrp: 0.781 ± 0.034
1.509GluTyr: 1.509 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
3.316PheAla: 3.316 ± 0.067
0.422PheCys: 0.422 ± 0.021
2.862PheAsp: 2.862 ± 0.063
2.442PheGlu: 2.442 ± 0.055
1.725PhePhe: 1.725 ± 0.051
2.894PheGly: 2.894 ± 0.064
0.844PheHis: 0.844 ± 0.035
2.651PheIle: 2.651 ± 0.061
1.732PheLys: 1.732 ± 0.048
3.488PheLeu: 3.488 ± 0.078
0.993PheMet: 0.993 ± 0.036
1.898PheAsn: 1.898 ± 0.053
1.439PhePro: 1.439 ± 0.043
1.243PheGln: 1.243 ± 0.044
1.655PheArg: 1.655 ± 0.046
3.021PheSer: 3.021 ± 0.064
2.24PheThr: 2.24 ± 0.059
2.552PheVal: 2.552 ± 0.063
0.521PheTrp: 0.521 ± 0.028
1.297PheTyr: 1.297 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
5.206GlyAla: 5.206 ± 0.108
0.748GlyCys: 0.748 ± 0.033
4.035GlyAsp: 4.035 ± 0.112
4.489GlyGlu: 4.489 ± 0.084
3.08GlyPhe: 3.08 ± 0.058
4.991GlyGly: 4.991 ± 0.121
1.741GlyHis: 1.741 ± 0.055
4.575GlyIle: 4.575 ± 0.081
3.749GlyLys: 3.749 ± 0.084
7.365GlyLeu: 7.365 ± 0.115
2.084GlyMet: 2.084 ± 0.056
2.597GlyAsn: 2.597 ± 0.08
1.821GlyPro: 1.821 ± 0.055
3.008GlyGln: 3.008 ± 0.058
3.457GlyArg: 3.457 ± 0.073
3.857GlySer: 3.857 ± 0.117
3.428GlyThr: 3.428 ± 0.078
4.938GlyVal: 4.938 ± 0.096
1.043GlyTrp: 1.043 ± 0.032
2.374GlyTyr: 2.374 ± 0.052
0.001GlyXaa: 0.001 ± 0.001
His
2.022HisAla: 2.022 ± 0.062
0.326HisCys: 0.326 ± 0.021
1.417HisAsp: 1.417 ± 0.046
1.274HisGlu: 1.274 ± 0.038
1.019HisPhe: 1.019 ± 0.043
1.672HisGly: 1.672 ± 0.048
0.826HisHis: 0.826 ± 0.036
1.498HisIle: 1.498 ± 0.047
1.051HisLys: 1.051 ± 0.037
2.317HisLeu: 2.317 ± 0.062
0.567HisMet: 0.567 ± 0.027
0.942HisAsn: 0.942 ± 0.039
1.29HisPro: 1.29 ± 0.045
1.268HisGln: 1.268 ± 0.044
1.238HisArg: 1.238 ± 0.045
1.404HisSer: 1.404 ± 0.046
1.202HisThr: 1.202 ± 0.038
1.542HisVal: 1.542 ± 0.049
0.399HisTrp: 0.399 ± 0.023
0.945HisTyr: 0.945 ± 0.038
0.0HisXaa: 0.0 ± 0.0
Ile
6.144IleAla: 6.144 ± 0.095
0.591IleCys: 0.591 ± 0.029
4.448IleAsp: 4.448 ± 0.089
4.602IleGlu: 4.602 ± 0.082
1.964IlePhe: 1.964 ± 0.056
4.495IleGly: 4.495 ± 0.091
1.452IleHis: 1.452 ± 0.047
3.813IleIle: 3.813 ± 0.074
3.191IleLys: 3.191 ± 0.068
5.124IleLeu: 5.124 ± 0.099
1.247IleMet: 1.247 ± 0.041
3.095IleAsn: 3.095 ± 0.068
2.619IlePro: 2.619 ± 0.059
2.359IleGln: 2.359 ± 0.057
3.261IleArg: 3.261 ± 0.067
4.32IleSer: 4.32 ± 0.083
3.768IleThr: 3.768 ± 0.069
4.024IleVal: 4.024 ± 0.077
0.664IleTrp: 0.664 ± 0.031
1.54IleTyr: 1.54 ± 0.043
0.0IleXaa: 0.0 ± 0.0
Lys
4.7LysAla: 4.7 ± 0.085
0.229LysCys: 0.229 ± 0.021
2.369LysAsp: 2.369 ± 0.069
2.527LysGlu: 2.527 ± 0.056
1.331LysPhe: 1.331 ± 0.038
2.917LysGly: 2.917 ± 0.064
1.247LysHis: 1.247 ± 0.045
2.537LysIle: 2.537 ± 0.066
2.642LysLys: 2.642 ± 0.074
5.156LysLeu: 5.156 ± 0.094
1.263LysMet: 1.263 ± 0.04
1.882LysAsn: 1.882 ± 0.053
2.353LysPro: 2.353 ± 0.056
3.355LysGln: 3.355 ± 0.062
2.636LysArg: 2.636 ± 0.066
2.541LysSer: 2.541 ± 0.063
3.027LysThr: 3.027 ± 0.072
3.268LysVal: 3.268 ± 0.074
0.547LysTrp: 0.547 ± 0.03
1.05LysTyr: 1.05 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
10.112LeuAla: 10.112 ± 0.149
0.892LeuCys: 0.892 ± 0.033
6.22LeuAsp: 6.22 ± 0.099
6.124LeuGlu: 6.124 ± 0.118
4.285LeuPhe: 4.285 ± 0.092
6.819LeuGly: 6.819 ± 0.119
2.322LeuHis: 2.322 ± 0.057
6.488LeuIle: 6.488 ± 0.113
5.192LeuLys: 5.192 ± 0.086
11.059LeuLeu: 11.059 ± 0.169
2.665LeuMet: 2.665 ± 0.062
4.398LeuAsn: 4.398 ± 0.081
4.94LeuPro: 4.94 ± 0.094
4.294LeuGln: 4.294 ± 0.093
5.052LeuArg: 5.052 ± 0.104
7.844LeuSer: 7.844 ± 0.116
6.157LeuThr: 6.157 ± 0.092
6.625LeuVal: 6.625 ± 0.109
1.085LeuTrp: 1.085 ± 0.04
2.47LeuTyr: 2.47 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
2.783MetAla: 2.783 ± 0.062
0.173MetCys: 0.173 ± 0.015
1.381MetAsp: 1.381 ± 0.046
1.303MetGlu: 1.303 ± 0.045
0.847MetPhe: 0.847 ± 0.032
1.819MetGly: 1.819 ± 0.051
0.586MetHis: 0.586 ± 0.026
1.518MetIle: 1.518 ± 0.043
1.311MetLys: 1.311 ± 0.039
2.924MetLeu: 2.924 ± 0.065
0.842MetMet: 0.842 ± 0.039
1.08MetAsn: 1.08 ± 0.037
1.33MetPro: 1.33 ± 0.042
1.373MetGln: 1.373 ± 0.039
1.362MetArg: 1.362 ± 0.043
1.853MetSer: 1.853 ± 0.051
1.818MetThr: 1.818 ± 0.054
1.774MetVal: 1.774 ± 0.051
0.246MetTrp: 0.246 ± 0.018
0.485MetTyr: 0.485 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
3.59AsnAla: 3.59 ± 0.069
0.288AsnCys: 0.288 ± 0.018
2.401AsnAsp: 2.401 ± 0.064
2.259AsnGlu: 2.259 ± 0.065
1.252AsnPhe: 1.252 ± 0.046
2.732AsnGly: 2.732 ± 0.084
0.959AsnHis: 0.959 ± 0.038
2.618AsnIle: 2.618 ± 0.052
2.084AsnLys: 2.084 ± 0.053
3.82AsnLeu: 3.82 ± 0.072
1.018AsnMet: 1.018 ± 0.036
1.782AsnAsn: 1.782 ± 0.069
2.007AsnPro: 2.007 ± 0.057
1.938AsnGln: 1.938 ± 0.054
1.987AsnArg: 1.987 ± 0.057
2.191AsnSer: 2.191 ± 0.055
2.23AsnThr: 2.23 ± 0.065
2.627AsnVal: 2.627 ± 0.067
0.644AsnTrp: 0.644 ± 0.028
1.165AsnTyr: 1.165 ± 0.043
0.0AsnXaa: 0.0 ± 0.0
Pro
3.634ProAla: 3.634 ± 0.069
0.334ProCys: 0.334 ± 0.02
2.945ProAsp: 2.945 ± 0.067
3.47ProGlu: 3.47 ± 0.1
1.649ProPhe: 1.649 ± 0.05
2.756ProGly: 2.756 ± 0.068
0.979ProHis: 0.979 ± 0.038
2.34ProIle: 2.34 ± 0.062
1.668ProLys: 1.668 ± 0.051
4.052ProLeu: 4.052 ± 0.078
1.041ProMet: 1.041 ± 0.039
1.441ProAsn: 1.441 ± 0.045
1.26ProPro: 1.26 ± 0.049
1.72ProGln: 1.72 ± 0.053
1.556ProArg: 1.556 ± 0.052
2.248ProSer: 2.248 ± 0.053
2.042ProThr: 2.042 ± 0.05
3.468ProVal: 3.468 ± 0.07
0.461ProTrp: 0.461 ± 0.024
1.244ProTyr: 1.244 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
4.932GlnAla: 4.932 ± 0.083
0.387GlnCys: 0.387 ± 0.024
2.223GlnAsp: 2.223 ± 0.056
2.349GlnGlu: 2.349 ± 0.058
1.806GlnPhe: 1.806 ± 0.046
2.866GlnGly: 2.866 ± 0.062
1.503GlnHis: 1.503 ± 0.052
2.692GlnIle: 2.692 ± 0.058
2.108GlnLys: 2.108 ± 0.062
5.613GlnLeu: 5.613 ± 0.089
1.121GlnMet: 1.121 ± 0.04
1.622GlnAsn: 1.622 ± 0.055
1.912GlnPro: 1.912 ± 0.053
3.907GlnGln: 3.907 ± 0.093
2.862GlnArg: 2.862 ± 0.073
2.801GlnSer: 2.801 ± 0.056
2.592GlnThr: 2.592 ± 0.07
3.084GlnVal: 3.084 ± 0.067
0.632GlnTrp: 0.632 ± 0.03
1.133GlnTyr: 1.133 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
3.745ArgAla: 3.745 ± 0.064
0.448ArgCys: 0.448 ± 0.028
3.067ArgAsp: 3.067 ± 0.059
3.378ArgGlu: 3.378 ± 0.068
2.317ArgPhe: 2.317 ± 0.057
2.881ArgGly: 2.881 ± 0.064
1.426ArgHis: 1.426 ± 0.048
3.064ArgIle: 3.064 ± 0.074
2.156ArgLys: 2.156 ± 0.059
5.612ArgLeu: 5.612 ± 0.102
1.376ArgMet: 1.376 ± 0.044
1.789ArgAsn: 1.789 ± 0.044
1.811ArgPro: 1.811 ± 0.05
2.801ArgGln: 2.801 ± 0.068
2.877ArgArg: 2.877 ± 0.077
2.665ArgSer: 2.665 ± 0.069
2.157ArgThr: 2.157 ± 0.052
3.383ArgVal: 3.383 ± 0.057
0.753ArgTrp: 0.753 ± 0.031
1.988ArgTyr: 1.988 ± 0.055
0.0ArgXaa: 0.0 ± 0.0
Ser
5.235SerAla: 5.235 ± 0.084
0.522SerCys: 0.522 ± 0.027
3.66SerAsp: 3.66 ± 0.084
3.743SerGlu: 3.743 ± 0.083
2.536SerPhe: 2.536 ± 0.063
4.835SerGly: 4.835 ± 0.099
1.61SerHis: 1.61 ± 0.042
3.508SerIle: 3.508 ± 0.063
2.597SerLys: 2.597 ± 0.073
6.962SerLeu: 6.962 ± 0.105
1.768SerMet: 1.768 ± 0.044
2.254SerAsn: 2.254 ± 0.055
2.451SerPro: 2.451 ± 0.064
2.926SerGln: 2.926 ± 0.066
3.16SerArg: 3.16 ± 0.068
3.713SerSer: 3.713 ± 0.091
2.975SerThr: 2.975 ± 0.069
4.233SerVal: 4.233 ± 0.093
0.87SerTrp: 0.87 ± 0.032
1.847SerTyr: 1.847 ± 0.056
0.0SerXaa: 0.0 ± 0.0
Thr
5.143ThrAla: 5.143 ± 0.098
0.383ThrCys: 0.383 ± 0.023
3.382ThrAsp: 3.382 ± 0.084
3.466ThrGlu: 3.466 ± 0.068
1.987ThrPhe: 1.987 ± 0.05
4.324ThrGly: 4.324 ± 0.092
1.297ThrHis: 1.297 ± 0.039
3.377ThrIle: 3.377 ± 0.071
1.929ThrLys: 1.929 ± 0.058
6.675ThrLeu: 6.675 ± 0.118
1.316ThrMet: 1.316 ± 0.043
1.812ThrAsn: 1.812 ± 0.053
2.729ThrPro: 2.729 ± 0.061
2.4ThrGln: 2.4 ± 0.06
2.328ThrArg: 2.328 ± 0.058
3.164ThrSer: 3.164 ± 0.073
2.994ThrThr: 2.994 ± 0.074
4.105ThrVal: 4.105 ± 0.098
0.557ThrTrp: 0.557 ± 0.03
1.29ThrTyr: 1.29 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
6.303ValAla: 6.303 ± 0.109
0.669ValCys: 0.669 ± 0.03
4.274ValAsp: 4.274 ± 0.074
4.6ValGlu: 4.6 ± 0.079
2.886ValPhe: 2.886 ± 0.078
4.407ValGly: 4.407 ± 0.094
1.322ValHis: 1.322 ± 0.043
4.901ValIle: 4.901 ± 0.088
3.122ValLys: 3.122 ± 0.063
6.913ValLeu: 6.913 ± 0.109
2.057ValMet: 2.057 ± 0.051
2.758ValAsn: 2.758 ± 0.065
2.609ValPro: 2.609 ± 0.064
2.172ValGln: 2.172 ± 0.049
3.059ValArg: 3.059 ± 0.07
4.754ValSer: 4.754 ± 0.081
4.142ValThr: 4.142 ± 0.088
5.103ValVal: 5.103 ± 0.101
0.829ValTrp: 0.829 ± 0.036
1.755ValTyr: 1.755 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.865TrpAla: 0.865 ± 0.039
0.132TrpCys: 0.132 ± 0.012
0.613TrpAsp: 0.613 ± 0.029
0.594TrpGlu: 0.594 ± 0.026
0.598TrpPhe: 0.598 ± 0.031
0.724TrpGly: 0.724 ± 0.032
0.375TrpHis: 0.375 ± 0.021
0.646TrpIle: 0.646 ± 0.031
0.492TrpLys: 0.492 ± 0.025
1.898TrpLeu: 1.898 ± 0.065
0.326TrpMet: 0.326 ± 0.021
0.453TrpAsn: 0.453 ± 0.024
0.559TrpPro: 0.559 ± 0.029
1.152TrpGln: 1.152 ± 0.042
0.764TrpArg: 0.764 ± 0.035
0.764TrpSer: 0.764 ± 0.034
0.516TrpThr: 0.516 ± 0.027
0.828TrpVal: 0.828 ± 0.036
0.179TrpTrp: 0.179 ± 0.016
0.358TrpTyr: 0.358 ± 0.033
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.29TyrAla: 2.29 ± 0.05
0.302TyrCys: 0.302 ± 0.02
1.715TyrAsp: 1.715 ± 0.057
1.576TyrGlu: 1.576 ± 0.048
1.181TyrPhe: 1.181 ± 0.042
2.071TyrGly: 2.071 ± 0.06
0.742TyrHis: 0.742 ± 0.032
1.521TyrIle: 1.521 ± 0.045
1.149TyrLys: 1.149 ± 0.037
3.154TyrLeu: 3.154 ± 0.069
0.541TyrMet: 0.541 ± 0.024
0.876TyrAsn: 0.876 ± 0.035
1.28TyrPro: 1.28 ± 0.041
1.83TyrGln: 1.83 ± 0.055
1.678TyrArg: 1.678 ± 0.044
1.704TyrSer: 1.704 ± 0.052
1.393TyrThr: 1.393 ± 0.052
1.697TyrVal: 1.697 ± 0.045
0.421TyrTrp: 0.421 ± 0.024
0.835TyrTyr: 0.835 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.008XaaXaa: 0.008 ± 0.008
Statistics based on 2492 proteins (781252 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski