Amino acid dipepetide frequency for Formosa sp. Hel3_A1_48

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.657AlaAla: 4.657 ± 0.123
0.558AlaCys: 0.558 ± 0.031
3.259AlaAsp: 3.259 ± 0.084
3.869AlaGlu: 3.869 ± 0.083
3.486AlaPhe: 3.486 ± 0.083
3.888AlaGly: 3.888 ± 0.092
1.42AlaHis: 1.42 ± 0.054
5.035AlaIle: 5.035 ± 0.097
4.874AlaLys: 4.874 ± 0.115
7.243AlaLeu: 7.243 ± 0.122
1.476AlaMet: 1.476 ± 0.053
3.521AlaAsn: 3.521 ± 0.075
2.091AlaPro: 2.091 ± 0.063
3.002AlaGln: 3.002 ± 0.071
2.11AlaArg: 2.11 ± 0.061
4.343AlaSer: 4.343 ± 0.093
3.564AlaThr: 3.564 ± 0.096
4.661AlaVal: 4.661 ± 0.114
0.596AlaTrp: 0.596 ± 0.032
2.633AlaTyr: 2.633 ± 0.071
0.0AlaXaa: 0.0 ± 0.0
Cys
0.542CysAla: 0.542 ± 0.03
0.113CysCys: 0.113 ± 0.016
0.442CysAsp: 0.442 ± 0.033
0.428CysGlu: 0.428 ± 0.028
0.482CysPhe: 0.482 ± 0.028
0.676CysGly: 0.676 ± 0.041
0.176CysHis: 0.176 ± 0.021
0.577CysIle: 0.577 ± 0.027
0.449CysLys: 0.449 ± 0.027
0.681CysLeu: 0.681 ± 0.035
0.144CysMet: 0.144 ± 0.015
0.412CysAsn: 0.412 ± 0.03
0.353CysPro: 0.353 ± 0.031
0.264CysGln: 0.264 ± 0.021
0.211CysArg: 0.211 ± 0.019
0.61CysSer: 0.61 ± 0.044
0.503CysThr: 0.503 ± 0.029
0.55CysVal: 0.55 ± 0.032
0.061CysTrp: 0.061 ± 0.009
0.331CysTyr: 0.331 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
3.867AspAla: 3.867 ± 0.083
0.468AspCys: 0.468 ± 0.027
3.131AspAsp: 3.131 ± 0.094
3.783AspGlu: 3.783 ± 0.083
3.738AspPhe: 3.738 ± 0.078
3.65AspGly: 3.65 ± 0.098
1.096AspHis: 1.096 ± 0.046
4.22AspIle: 4.22 ± 0.088
3.503AspLys: 3.503 ± 0.085
5.69AspLeu: 5.69 ± 0.098
1.123AspMet: 1.123 ± 0.043
2.971AspAsn: 2.971 ± 0.083
1.976AspPro: 1.976 ± 0.075
2.117AspGln: 2.117 ± 0.063
1.917AspArg: 1.917 ± 0.05
3.281AspSer: 3.281 ± 0.083
2.609AspThr: 2.609 ± 0.069
3.778AspVal: 3.778 ± 0.079
0.7AspTrp: 0.7 ± 0.035
2.805AspTyr: 2.805 ± 0.058
0.0AspXaa: 0.0 ± 0.0
Glu
4.503GluAla: 4.503 ± 0.1
0.361GluCys: 0.361 ± 0.025
3.53GluAsp: 3.53 ± 0.079
4.075GluGlu: 4.075 ± 0.106
3.029GluPhe: 3.029 ± 0.066
3.647GluGly: 3.647 ± 0.081
1.262GluHis: 1.262 ± 0.044
4.984GluIle: 4.984 ± 0.087
4.519GluLys: 4.519 ± 0.103
6.214GluLeu: 6.214 ± 0.104
1.369GluMet: 1.369 ± 0.048
4.094GluAsn: 4.094 ± 0.083
1.481GluPro: 1.481 ± 0.045
2.575GluGln: 2.575 ± 0.07
2.423GluArg: 2.423 ± 0.071
3.457GluSer: 3.457 ± 0.086
3.417GluThr: 3.417 ± 0.074
4.176GluVal: 4.176 ± 0.091
0.577GluTrp: 0.577 ± 0.032
2.002GluTyr: 2.002 ± 0.049
0.0GluXaa: 0.0 ± 0.0
Phe
2.912PheAla: 2.912 ± 0.07
0.511PheCys: 0.511 ± 0.031
3.679PheAsp: 3.679 ± 0.091
3.775PheGlu: 3.775 ± 0.078
2.859PhePhe: 2.859 ± 0.088
3.816PheGly: 3.816 ± 0.089
0.775PheHis: 0.775 ± 0.037
3.955PheIle: 3.955 ± 0.079
3.815PheLys: 3.815 ± 0.099
4.703PheLeu: 4.703 ± 0.103
1.173PheMet: 1.173 ± 0.035
3.316PheAsn: 3.316 ± 0.071
1.641PhePro: 1.641 ± 0.044
1.559PheGln: 1.559 ± 0.051
1.677PheArg: 1.677 ± 0.056
4.423PheSer: 4.423 ± 0.095
2.887PheThr: 2.887 ± 0.08
3.12PheVal: 3.12 ± 0.072
0.58PheTrp: 0.58 ± 0.035
2.083PheTyr: 2.083 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
4.569GlyAla: 4.569 ± 0.108
0.66GlyCys: 0.66 ± 0.049
3.458GlyAsp: 3.458 ± 0.087
3.289GlyGlu: 3.289 ± 0.078
3.775GlyPhe: 3.775 ± 0.086
4.618GlyGly: 4.618 ± 0.113
1.209GlyHis: 1.209 ± 0.044
5.113GlyIle: 5.113 ± 0.096
4.302GlyLys: 4.302 ± 0.084
6.115GlyLeu: 6.115 ± 0.101
1.546GlyMet: 1.546 ± 0.059
3.102GlyAsn: 3.102 ± 0.088
1.554GlyPro: 1.554 ± 0.051
2.331GlyGln: 2.331 ± 0.063
2.22GlyArg: 2.22 ± 0.068
4.133GlySer: 4.133 ± 0.097
3.899GlyThr: 3.899 ± 0.097
4.673GlyVal: 4.673 ± 0.097
0.759GlyTrp: 0.759 ± 0.04
2.695GlyTyr: 2.695 ± 0.075
0.0GlyXaa: 0.0 ± 0.0
His
1.026HisAla: 1.026 ± 0.046
0.193HisCys: 0.193 ± 0.02
0.855HisAsp: 0.855 ± 0.038
0.92HisGlu: 0.92 ± 0.035
1.16HisPhe: 1.16 ± 0.043
1.145HisGly: 1.145 ± 0.046
0.5HisHis: 0.5 ± 0.029
1.524HisIle: 1.524 ± 0.048
1.42HisLys: 1.42 ± 0.054
1.834HisLeu: 1.834 ± 0.053
0.383HisMet: 0.383 ± 0.026
1.088HisAsn: 1.088 ± 0.043
0.957HisPro: 0.957 ± 0.04
0.859HisGln: 0.859 ± 0.039
0.661HisArg: 0.661 ± 0.031
1.126HisSer: 1.126 ± 0.045
0.931HisThr: 0.931 ± 0.039
1.029HisVal: 1.029 ± 0.044
0.225HisTrp: 0.225 ± 0.021
0.875HisTyr: 0.875 ± 0.035
0.0HisXaa: 0.0 ± 0.0
Ile
5.233IleAla: 5.233 ± 0.097
0.685IleCys: 0.685 ± 0.037
5.141IleAsp: 5.141 ± 0.082
5.396IleGlu: 5.396 ± 0.097
3.712IlePhe: 3.712 ± 0.085
5.099IleGly: 5.099 ± 0.108
1.367IleHis: 1.367 ± 0.05
5.521IleIle: 5.521 ± 0.126
5.714IleLys: 5.714 ± 0.108
6.845IleLeu: 6.845 ± 0.115
1.265IleMet: 1.265 ± 0.043
4.733IleAsn: 4.733 ± 0.091
3.142IlePro: 3.142 ± 0.071
2.73IleGln: 2.73 ± 0.064
2.573IleArg: 2.573 ± 0.071
5.922IleSer: 5.922 ± 0.106
4.377IleThr: 4.377 ± 0.103
4.744IleVal: 4.744 ± 0.1
0.669IleTrp: 0.669 ± 0.04
2.647IleTyr: 2.647 ± 0.071
0.0IleXaa: 0.0 ± 0.0
Lys
5.174LysAla: 5.174 ± 0.113
0.356LysCys: 0.356 ± 0.025
3.944LysAsp: 3.944 ± 0.085
4.979LysGlu: 4.979 ± 0.114
2.947LysPhe: 2.947 ± 0.077
4.292LysGly: 4.292 ± 0.083
1.49LysHis: 1.49 ± 0.053
6.002LysIle: 6.002 ± 0.097
6.388LysLys: 6.388 ± 0.13
6.665LysLeu: 6.665 ± 0.135
1.717LysMet: 1.717 ± 0.056
5.101LysAsn: 5.101 ± 0.117
2.236LysPro: 2.236 ± 0.069
2.8LysGln: 2.8 ± 0.074
3.219LysArg: 3.219 ± 0.093
5.075LysSer: 5.075 ± 0.097
4.842LysThr: 4.842 ± 0.118
4.032LysVal: 4.032 ± 0.102
0.696LysTrp: 0.696 ± 0.032
2.665LysTyr: 2.665 ± 0.074
0.0LysXaa: 0.0 ± 0.0
Leu
5.85LeuAla: 5.85 ± 0.097
0.776LeuCys: 0.776 ± 0.042
5.441LeuAsp: 5.441 ± 0.089
6.34LeuGlu: 6.34 ± 0.11
4.998LeuPhe: 4.998 ± 0.119
6.04LeuGly: 6.04 ± 0.098
1.597LeuHis: 1.597 ± 0.051
7.225LeuIle: 7.225 ± 0.134
8.225LeuLys: 8.225 ± 0.142
8.232LeuLeu: 8.232 ± 0.138
2.187LeuMet: 2.187 ± 0.064
6.142LeuAsn: 6.142 ± 0.107
3.383LeuPro: 3.383 ± 0.072
3.035LeuGln: 3.035 ± 0.074
3.329LeuArg: 3.329 ± 0.087
7.388LeuSer: 7.388 ± 0.115
4.804LeuThr: 4.804 ± 0.108
5.529LeuVal: 5.529 ± 0.099
0.773LeuTrp: 0.773 ± 0.034
3.066LeuTyr: 3.066 ± 0.065
0.0LeuXaa: 0.0 ± 0.0
Met
1.661MetAla: 1.661 ± 0.045
0.137MetCys: 0.137 ± 0.015
1.153MetAsp: 1.153 ± 0.041
1.235MetGlu: 1.235 ± 0.045
0.864MetPhe: 0.864 ± 0.043
1.457MetGly: 1.457 ± 0.053
0.45MetHis: 0.45 ± 0.025
1.516MetIle: 1.516 ± 0.059
1.899MetLys: 1.899 ± 0.057
1.837MetLeu: 1.837 ± 0.052
0.511MetMet: 0.511 ± 0.032
1.26MetAsn: 1.26 ± 0.042
0.834MetPro: 0.834 ± 0.041
0.756MetGln: 0.756 ± 0.031
0.864MetArg: 0.864 ± 0.04
1.519MetSer: 1.519 ± 0.051
1.174MetThr: 1.174 ± 0.044
1.236MetVal: 1.236 ± 0.046
0.149MetTrp: 0.149 ± 0.016
0.692MetTyr: 0.692 ± 0.038
0.0MetXaa: 0.0 ± 0.0
Asn
3.883AsnAla: 3.883 ± 0.077
0.495AsnCys: 0.495 ± 0.032
3.284AsnAsp: 3.284 ± 0.074
3.58AsnGlu: 3.58 ± 0.082
3.209AsnPhe: 3.209 ± 0.088
3.82AsnGly: 3.82 ± 0.089
1.069AsnHis: 1.069 ± 0.043
4.623AsnIle: 4.623 ± 0.079
4.192AsnLys: 4.192 ± 0.091
5.334AsnLeu: 5.334 ± 0.105
1.265AsnMet: 1.265 ± 0.046
3.829AsnAsn: 3.829 ± 0.101
2.917AsnPro: 2.917 ± 0.074
2.369AsnGln: 2.369 ± 0.07
2.101AsnArg: 2.101 ± 0.065
4.256AsnSer: 4.256 ± 0.108
3.941AsnThr: 3.941 ± 0.088
3.343AsnVal: 3.343 ± 0.075
0.653AsnTrp: 0.653 ± 0.035
2.712AsnTyr: 2.712 ± 0.08
0.0AsnXaa: 0.0 ± 0.0
Pro
1.826ProAla: 1.826 ± 0.059
0.211ProCys: 0.211 ± 0.021
1.89ProAsp: 1.89 ± 0.063
2.542ProGlu: 2.542 ± 0.063
1.939ProPhe: 1.939 ± 0.058
1.885ProGly: 1.885 ± 0.061
0.657ProHis: 0.657 ± 0.032
2.837ProIle: 2.837 ± 0.082
2.871ProLys: 2.871 ± 0.066
3.115ProLeu: 3.115 ± 0.074
0.741ProMet: 0.741 ± 0.035
2.585ProAsn: 2.585 ± 0.068
0.832ProPro: 0.832 ± 0.038
1.219ProGln: 1.219 ± 0.049
0.871ProArg: 0.871 ± 0.039
2.361ProSer: 2.361 ± 0.07
1.984ProThr: 1.984 ± 0.071
2.244ProVal: 2.244 ± 0.06
0.377ProTrp: 0.377 ± 0.025
1.412ProTyr: 1.412 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
2.201GlnAla: 2.201 ± 0.059
0.222GlnCys: 0.222 ± 0.021
1.736GlnAsp: 1.736 ± 0.051
2.024GlnGlu: 2.024 ± 0.067
1.818GlnPhe: 1.818 ± 0.052
1.939GlnGly: 1.939 ± 0.062
0.788GlnHis: 0.788 ± 0.031
2.965GlnIle: 2.965 ± 0.067
3.056GlnLys: 3.056 ± 0.085
3.851GlnLeu: 3.851 ± 0.083
0.93GlnMet: 0.93 ± 0.04
2.588GlnAsn: 2.588 ± 0.071
1.117GlnPro: 1.117 ± 0.041
1.628GlnGln: 1.628 ± 0.071
1.455GlnArg: 1.455 ± 0.057
2.358GlnSer: 2.358 ± 0.069
2.125GlnThr: 2.125 ± 0.063
1.923GlnVal: 1.923 ± 0.061
0.466GlnTrp: 0.466 ± 0.031
1.415GlnTyr: 1.415 ± 0.049
0.0GlnXaa: 0.0 ± 0.0
Arg
2.422ArgAla: 2.422 ± 0.065
0.244ArgCys: 0.244 ± 0.02
1.727ArgAsp: 1.727 ± 0.053
1.864ArgGlu: 1.864 ± 0.065
2.003ArgPhe: 2.003 ± 0.053
1.987ArgGly: 1.987 ± 0.061
0.655ArgHis: 0.655 ± 0.032
2.903ArgIle: 2.903 ± 0.075
2.566ArgLys: 2.566 ± 0.08
3.527ArgLeu: 3.527 ± 0.085
0.845ArgMet: 0.845 ± 0.037
1.938ArgAsn: 1.938 ± 0.057
1.214ArgPro: 1.214 ± 0.045
1.286ArgGln: 1.286 ± 0.043
1.401ArgArg: 1.401 ± 0.053
2.145ArgSer: 2.145 ± 0.062
1.831ArgThr: 1.831 ± 0.056
2.179ArgVal: 2.179 ± 0.07
0.391ArgTrp: 0.391 ± 0.021
1.618ArgTyr: 1.618 ± 0.053
0.0ArgXaa: 0.0 ± 0.0
Ser
4.387SerAla: 4.387 ± 0.088
0.669SerCys: 0.669 ± 0.038
3.899SerAsp: 3.899 ± 0.083
4.123SerGlu: 4.123 ± 0.082
3.864SerPhe: 3.864 ± 0.086
5.01SerGly: 5.01 ± 0.101
0.981SerHis: 0.981 ± 0.038
5.888SerIle: 5.888 ± 0.107
5.209SerLys: 5.209 ± 0.107
6.133SerLeu: 6.133 ± 0.103
1.369SerMet: 1.369 ± 0.051
4.155SerAsn: 4.155 ± 0.099
2.264SerPro: 2.264 ± 0.062
2.037SerGln: 2.037 ± 0.062
2.128SerArg: 2.128 ± 0.058
5.174SerSer: 5.174 ± 0.117
3.903SerThr: 3.903 ± 0.079
4.435SerVal: 4.435 ± 0.082
0.677SerTrp: 0.677 ± 0.036
2.832SerTyr: 2.832 ± 0.082
0.0SerXaa: 0.0 ± 0.0
Thr
4.243ThrAla: 4.243 ± 0.084
0.273ThrCys: 0.273 ± 0.025
3.011ThrAsp: 3.011 ± 0.083
2.994ThrGlu: 2.994 ± 0.075
2.858ThrPhe: 2.858 ± 0.075
3.636ThrGly: 3.636 ± 0.086
1.176ThrHis: 1.176 ± 0.041
4.604ThrIle: 4.604 ± 0.107
3.966ThrLys: 3.966 ± 0.094
5.492ThrLeu: 5.492 ± 0.104
0.871ThrMet: 0.871 ± 0.041
3.351ThrAsn: 3.351 ± 0.085
2.588ThrPro: 2.588 ± 0.083
2.109ThrGln: 2.109 ± 0.06
1.508ThrArg: 1.508 ± 0.065
3.856ThrSer: 3.856 ± 0.103
3.692ThrThr: 3.692 ± 0.112
3.585ThrVal: 3.585 ± 0.108
0.473ThrTrp: 0.473 ± 0.032
2.227ThrTyr: 2.227 ± 0.068
0.0ThrXaa: 0.0 ± 0.0
Val
4.129ValAla: 4.129 ± 0.094
0.604ValCys: 0.604 ± 0.031
3.856ValAsp: 3.856 ± 0.084
3.773ValGlu: 3.773 ± 0.083
3.604ValPhe: 3.604 ± 0.082
4.066ValGly: 4.066 ± 0.097
1.056ValHis: 1.056 ± 0.045
4.593ValIle: 4.593 ± 0.102
3.997ValLys: 3.997 ± 0.096
6.514ValLeu: 6.514 ± 0.108
1.265ValMet: 1.265 ± 0.047
3.342ValAsn: 3.342 ± 0.089
2.117ValPro: 2.117 ± 0.058
2.126ValGln: 2.126 ± 0.052
2.196ValArg: 2.196 ± 0.072
4.482ValSer: 4.482 ± 0.082
3.113ValThr: 3.113 ± 0.081
4.561ValVal: 4.561 ± 0.114
0.589ValTrp: 0.589 ± 0.031
2.444ValTyr: 2.444 ± 0.066
0.0ValXaa: 0.0 ± 0.0
Trp
0.655TrpAla: 0.655 ± 0.035
0.093TrpCys: 0.093 ± 0.011
0.602TrpAsp: 0.602 ± 0.033
0.58TrpGlu: 0.58 ± 0.032
0.524TrpPhe: 0.524 ± 0.033
0.663TrpGly: 0.663 ± 0.033
0.24TrpHis: 0.24 ± 0.023
0.714TrpIle: 0.714 ± 0.032
0.693TrpLys: 0.693 ± 0.032
0.898TrpLeu: 0.898 ± 0.036
0.313TrpMet: 0.313 ± 0.025
0.631TrpAsn: 0.631 ± 0.031
0.238TrpPro: 0.238 ± 0.019
0.403TrpGln: 0.403 ± 0.028
0.401TrpArg: 0.401 ± 0.026
0.623TrpSer: 0.623 ± 0.036
0.564TrpThr: 0.564 ± 0.035
0.559TrpVal: 0.559 ± 0.033
0.118TrpTrp: 0.118 ± 0.017
0.446TrpTyr: 0.446 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.545TyrAla: 2.545 ± 0.066
0.372TyrCys: 0.372 ± 0.024
2.383TyrAsp: 2.383 ± 0.066
2.126TyrGlu: 2.126 ± 0.069
2.39TyrPhe: 2.39 ± 0.071
2.687TyrGly: 2.687 ± 0.071
0.749TyrHis: 0.749 ± 0.034
2.719TyrIle: 2.719 ± 0.074
2.963TyrLys: 2.963 ± 0.07
3.439TyrLeu: 3.439 ± 0.079
0.711TyrMet: 0.711 ± 0.034
2.716TyrAsn: 2.716 ± 0.076
1.463TyrPro: 1.463 ± 0.05
1.403TyrGln: 1.403 ± 0.054
1.479TyrArg: 1.479 ± 0.059
2.602TyrSer: 2.602 ± 0.066
2.353TyrThr: 2.353 ± 0.074
2.072TyrVal: 2.072 ± 0.055
0.427TyrTrp: 0.427 ± 0.028
1.637TyrTyr: 1.637 ± 0.06
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1866 proteins (625999 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski