Amino acid dipepetide frequency for Methylocystis sp. (strain SC2)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.738AlaAla: 18.738 ± 0.199
1.239AlaCys: 1.239 ± 0.039
6.673AlaAsp: 6.673 ± 0.078
7.802AlaGlu: 7.802 ± 0.087
5.033AlaPhe: 5.033 ± 0.067
9.755AlaGly: 9.755 ± 0.1
2.669AlaHis: 2.669 ± 0.05
6.627AlaIle: 6.627 ± 0.077
4.685AlaLys: 4.685 ± 0.064
15.116AlaLeu: 15.116 ± 0.138
3.199AlaMet: 3.199 ± 0.048
3.127AlaAsn: 3.127 ± 0.061
7.837AlaPro: 7.837 ± 0.103
4.723AlaGln: 4.723 ± 0.067
11.278AlaArg: 11.278 ± 0.134
7.028AlaSer: 7.028 ± 0.09
6.26AlaThr: 6.26 ± 0.07
7.873AlaVal: 7.873 ± 0.096
1.489AlaTrp: 1.489 ± 0.038
2.771AlaTyr: 2.771 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
1.476CysAla: 1.476 ± 0.039
0.163CysCys: 0.163 ± 0.012
0.595CysAsp: 0.595 ± 0.021
0.573CysGlu: 0.573 ± 0.019
0.36CysPhe: 0.36 ± 0.017
1.037CysGly: 1.037 ± 0.031
0.27CysHis: 0.27 ± 0.02
0.359CysIle: 0.359 ± 0.018
0.239CysLys: 0.239 ± 0.015
0.781CysLeu: 0.781 ± 0.026
0.177CysMet: 0.177 ± 0.012
0.232CysAsn: 0.232 ± 0.015
0.54CysPro: 0.54 ± 0.023
0.231CysGln: 0.231 ± 0.013
0.7CysArg: 0.7 ± 0.026
0.514CysSer: 0.514 ± 0.024
0.307CysThr: 0.307 ± 0.017
0.819CysVal: 0.819 ± 0.023
0.126CysTrp: 0.126 ± 0.01
0.214CysTyr: 0.214 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
8.072AspAla: 8.072 ± 0.097
0.581AspCys: 0.581 ± 0.023
3.254AspAsp: 3.254 ± 0.054
3.584AspGlu: 3.584 ± 0.059
2.369AspPhe: 2.369 ± 0.049
4.833AspGly: 4.833 ± 0.082
1.167AspHis: 1.167 ± 0.034
2.808AspIle: 2.808 ± 0.051
1.828AspLys: 1.828 ± 0.042
5.669AspLeu: 5.669 ± 0.073
1.201AspMet: 1.201 ± 0.03
1.204AspAsn: 1.204 ± 0.032
3.329AspPro: 3.329 ± 0.056
1.503AspGln: 1.503 ± 0.037
4.142AspArg: 4.142 ± 0.061
2.396AspSer: 2.396 ± 0.05
1.88AspThr: 1.88 ± 0.045
4.265AspVal: 4.265 ± 0.065
0.974AspTrp: 0.974 ± 0.027
1.541AspTyr: 1.541 ± 0.033
0.0AspXaa: 0.0 ± 0.0
Glu
8.15GluAla: 8.15 ± 0.114
0.443GluCys: 0.443 ± 0.022
2.759GluAsp: 2.759 ± 0.052
3.412GluGlu: 3.412 ± 0.067
1.997GluPhe: 1.997 ± 0.041
4.243GluGly: 4.243 ± 0.063
1.225GluHis: 1.225 ± 0.033
3.632GluIle: 3.632 ± 0.055
2.59GluLys: 2.59 ± 0.05
5.457GluLeu: 5.457 ± 0.071
1.422GluMet: 1.422 ± 0.034
1.706GluAsn: 1.706 ± 0.04
2.665GluPro: 2.665 ± 0.054
2.014GluGln: 2.014 ± 0.041
5.766GluArg: 5.766 ± 0.085
2.991GluSer: 2.991 ± 0.055
3.685GluThr: 3.685 ± 0.058
3.203GluVal: 3.203 ± 0.062
0.742GluTrp: 0.742 ± 0.028
0.958GluTyr: 0.958 ± 0.028
0.0GluXaa: 0.0 ± 0.0
Phe
5.223PheAla: 5.223 ± 0.075
0.488PheCys: 0.488 ± 0.021
2.872PheAsp: 2.872 ± 0.054
2.424PheGlu: 2.424 ± 0.051
1.634PhePhe: 1.634 ± 0.042
3.65PheGly: 3.65 ± 0.054
0.799PheHis: 0.799 ± 0.023
1.79PheIle: 1.79 ± 0.04
1.121PheLys: 1.121 ± 0.031
3.678PheLeu: 3.678 ± 0.062
0.756PheMet: 0.756 ± 0.027
1.069PheAsn: 1.069 ± 0.032
1.644PhePro: 1.644 ± 0.038
0.973PheGln: 0.973 ± 0.023
2.432PheArg: 2.432 ± 0.043
2.463PheSer: 2.463 ± 0.053
1.879PheThr: 1.879 ± 0.039
3.036PheVal: 3.036 ± 0.047
0.577PheTrp: 0.577 ± 0.024
0.988PheTyr: 0.988 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
11.427GlyAla: 11.427 ± 0.12
0.884GlyCys: 0.884 ± 0.029
4.574GlyAsp: 4.574 ± 0.073
4.886GlyGlu: 4.886 ± 0.075
3.681GlyPhe: 3.681 ± 0.059
7.122GlyGly: 7.122 ± 0.144
1.668GlyHis: 1.668 ± 0.044
2.726GlyIle: 2.726 ± 0.052
3.184GlyLys: 3.184 ± 0.061
8.08GlyLeu: 8.08 ± 0.086
1.826GlyMet: 1.826 ± 0.046
1.628GlyAsn: 1.628 ± 0.048
3.396GlyPro: 3.396 ± 0.059
2.33GlyGln: 2.33 ± 0.044
6.235GlyArg: 6.235 ± 0.07
3.918GlySer: 3.918 ± 0.075
2.774GlyThr: 2.774 ± 0.059
7.276GlyVal: 7.276 ± 0.091
1.292GlyTrp: 1.292 ± 0.046
2.115GlyTyr: 2.115 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
2.443HisAla: 2.443 ± 0.044
0.266HisCys: 0.266 ± 0.016
1.204HisAsp: 1.204 ± 0.033
1.154HisGlu: 1.154 ± 0.03
0.878HisPhe: 0.878 ± 0.026
1.898HisGly: 1.898 ± 0.048
0.544HisHis: 0.544 ± 0.028
0.978HisIle: 0.978 ± 0.031
0.525HisLys: 0.525 ± 0.02
1.917HisLeu: 1.917 ± 0.04
0.474HisMet: 0.474 ± 0.019
0.478HisAsn: 0.478 ± 0.019
1.289HisPro: 1.289 ± 0.034
0.506HisGln: 0.506 ± 0.021
1.436HisArg: 1.436 ± 0.036
1.018HisSer: 1.018 ± 0.036
0.711HisThr: 0.711 ± 0.024
1.524HisVal: 1.524 ± 0.038
0.327HisTrp: 0.327 ± 0.016
0.568HisTyr: 0.568 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
7.356IleAla: 7.356 ± 0.073
0.535IleCys: 0.535 ± 0.019
3.733IleAsp: 3.733 ± 0.054
3.8IleGlu: 3.8 ± 0.053
1.868IlePhe: 1.868 ± 0.047
4.673IleGly: 4.673 ± 0.067
0.935IleHis: 0.935 ± 0.029
2.072IleIle: 2.072 ± 0.048
1.372IleLys: 1.372 ± 0.035
3.956IleLeu: 3.956 ± 0.053
0.88IleMet: 0.88 ± 0.025
1.238IleAsn: 1.238 ± 0.034
2.071IlePro: 2.071 ± 0.044
1.095IleGln: 1.095 ± 0.034
2.899IleArg: 2.899 ± 0.052
2.747IleSer: 2.747 ± 0.054
1.969IleThr: 1.969 ± 0.048
4.456IleVal: 4.456 ± 0.071
0.536IleTrp: 0.536 ± 0.02
1.133IleTyr: 1.133 ± 0.029
0.0IleXaa: 0.0 ± 0.0
Lys
4.149LysAla: 4.149 ± 0.076
0.238LysCys: 0.238 ± 0.015
1.802LysAsp: 1.802 ± 0.038
1.853LysGlu: 1.853 ± 0.044
1.129LysPhe: 1.129 ± 0.033
2.731LysGly: 2.731 ± 0.047
0.616LysHis: 0.616 ± 0.023
2.067LysIle: 2.067 ± 0.046
1.535LysLys: 1.535 ± 0.043
3.391LysLeu: 3.391 ± 0.052
0.817LysMet: 0.817 ± 0.025
1.039LysAsn: 1.039 ± 0.032
2.118LysPro: 2.118 ± 0.049
1.016LysGln: 1.016 ± 0.029
2.784LysArg: 2.784 ± 0.048
2.09LysSer: 2.09 ± 0.045
2.092LysThr: 2.092 ± 0.049
2.069LysVal: 2.069 ± 0.043
0.379LysTrp: 0.379 ± 0.018
0.631LysTyr: 0.631 ± 0.028
0.0LysXaa: 0.0 ± 0.0
Leu
14.464LeuAla: 14.464 ± 0.147
0.952LeuCys: 0.952 ± 0.031
5.987LeuAsp: 5.987 ± 0.067
5.447LeuGlu: 5.447 ± 0.068
3.845LeuPhe: 3.845 ± 0.067
7.66LeuGly: 7.66 ± 0.079
1.852LeuHis: 1.852 ± 0.044
4.941LeuIle: 4.941 ± 0.064
3.374LeuLys: 3.374 ± 0.051
9.448LeuLeu: 9.448 ± 0.107
2.063LeuMet: 2.063 ± 0.042
2.491LeuAsn: 2.491 ± 0.055
5.203LeuPro: 5.203 ± 0.073
2.551LeuGln: 2.551 ± 0.048
7.707LeuArg: 7.707 ± 0.092
6.459LeuSer: 6.459 ± 0.07
5.296LeuThr: 5.296 ± 0.07
6.796LeuVal: 6.796 ± 0.086
1.188LeuTrp: 1.188 ± 0.037
2.133LeuTyr: 2.133 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
2.64MetAla: 2.64 ± 0.054
0.177MetCys: 0.177 ± 0.013
0.931MetAsp: 0.931 ± 0.027
1.082MetGlu: 1.082 ± 0.03
0.709MetPhe: 0.709 ± 0.025
1.506MetGly: 1.506 ± 0.037
0.414MetHis: 0.414 ± 0.019
1.181MetIle: 1.181 ± 0.033
0.912MetLys: 0.912 ± 0.03
2.166MetLeu: 2.166 ± 0.046
0.523MetMet: 0.523 ± 0.026
0.731MetAsn: 0.731 ± 0.024
1.154MetPro: 1.154 ± 0.027
0.725MetGln: 0.725 ± 0.025
2.057MetArg: 2.057 ± 0.04
1.698MetSer: 1.698 ± 0.036
1.634MetThr: 1.634 ± 0.034
1.162MetVal: 1.162 ± 0.035
0.196MetTrp: 0.196 ± 0.014
0.275MetTyr: 0.275 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.348AsnAla: 3.348 ± 0.064
0.285AsnCys: 0.285 ± 0.019
1.388AsnAsp: 1.388 ± 0.033
1.377AsnGlu: 1.377 ± 0.038
1.105AsnPhe: 1.105 ± 0.031
2.212AsnGly: 2.212 ± 0.074
0.531AsnHis: 0.531 ± 0.024
1.303AsnIle: 1.303 ± 0.036
0.742AsnLys: 0.742 ± 0.028
2.439AsnLeu: 2.439 ± 0.047
0.547AsnMet: 0.547 ± 0.021
0.65AsnAsn: 0.65 ± 0.032
1.604AsnPro: 1.604 ± 0.042
0.662AsnGln: 0.662 ± 0.023
1.746AsnArg: 1.746 ± 0.037
1.237AsnSer: 1.237 ± 0.038
0.95AsnThr: 0.95 ± 0.03
2.041AsnVal: 2.041 ± 0.043
0.452AsnTrp: 0.452 ± 0.024
0.692AsnTyr: 0.692 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
6.408ProAla: 6.408 ± 0.089
0.432ProCys: 0.432 ± 0.017
3.32ProAsp: 3.32 ± 0.062
3.429ProGlu: 3.429 ± 0.053
2.085ProPhe: 2.085 ± 0.045
4.234ProGly: 4.234 ± 0.068
1.154ProHis: 1.154 ± 0.03
2.589ProIle: 2.589 ± 0.05
1.897ProLys: 1.897 ± 0.038
5.026ProLeu: 5.026 ± 0.074
1.123ProMet: 1.123 ± 0.027
1.416ProAsn: 1.416 ± 0.031
3.338ProPro: 3.338 ± 0.074
1.873ProGln: 1.873 ± 0.043
3.57ProArg: 3.57 ± 0.062
3.036ProSer: 3.036 ± 0.055
2.627ProThr: 2.627 ± 0.052
3.472ProVal: 3.472 ± 0.058
0.697ProTrp: 0.697 ± 0.026
1.176ProTyr: 1.176 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
3.915GlnAla: 3.915 ± 0.062
0.249GlnCys: 0.249 ± 0.017
1.343GlnAsp: 1.343 ± 0.035
1.658GlnGlu: 1.658 ± 0.037
1.081GlnPhe: 1.081 ± 0.03
2.226GlnGly: 2.226 ± 0.045
0.599GlnHis: 0.599 ± 0.023
1.763GlnIle: 1.763 ± 0.036
1.117GlnLys: 1.117 ± 0.03
2.737GlnLeu: 2.737 ± 0.05
0.729GlnMet: 0.729 ± 0.024
0.888GlnAsn: 0.888 ± 0.034
1.614GlnPro: 1.614 ± 0.037
1.028GlnGln: 1.028 ± 0.035
2.507GlnArg: 2.507 ± 0.049
1.829GlnSer: 1.829 ± 0.046
1.604GlnThr: 1.604 ± 0.043
1.68GlnVal: 1.68 ± 0.043
0.408GlnTrp: 0.408 ± 0.018
0.571GlnTyr: 0.571 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
10.077ArgAla: 10.077 ± 0.129
0.655ArgCys: 0.655 ± 0.024
4.545ArgAsp: 4.545 ± 0.067
4.889ArgGlu: 4.889 ± 0.071
3.305ArgPhe: 3.305 ± 0.059
5.429ArgGly: 5.429 ± 0.069
1.673ArgHis: 1.673 ± 0.036
4.03ArgIle: 4.03 ± 0.054
2.561ArgLys: 2.561 ± 0.048
8.793ArgLeu: 8.793 ± 0.114
1.753ArgMet: 1.753 ± 0.037
1.994ArgAsn: 1.994 ± 0.035
3.886ArgPro: 3.886 ± 0.06
2.437ArgGln: 2.437 ± 0.048
7.321ArgArg: 7.321 ± 0.091
4.159ArgSer: 4.159 ± 0.062
3.003ArgThr: 3.003 ± 0.05
5.06ArgVal: 5.06 ± 0.068
1.11ArgTrp: 1.11 ± 0.032
1.851ArgTyr: 1.851 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
7.098SerAla: 7.098 ± 0.082
0.616SerCys: 0.616 ± 0.023
3.006SerAsp: 3.006 ± 0.047
3.056SerGlu: 3.056 ± 0.057
2.44SerPhe: 2.44 ± 0.048
5.247SerGly: 5.247 ± 0.085
1.092SerHis: 1.092 ± 0.033
2.684SerIle: 2.684 ± 0.054
1.777SerLys: 1.777 ± 0.047
5.586SerLeu: 5.586 ± 0.072
1.178SerMet: 1.178 ± 0.034
1.364SerAsn: 1.364 ± 0.033
3.087SerPro: 3.087 ± 0.053
1.623SerGln: 1.623 ± 0.039
4.201SerArg: 4.201 ± 0.061
3.023SerSer: 3.023 ± 0.061
2.492SerThr: 2.492 ± 0.056
3.901SerVal: 3.901 ± 0.058
0.742SerTrp: 0.742 ± 0.023
1.284SerTyr: 1.284 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
5.267ThrAla: 5.267 ± 0.065
0.409ThrCys: 0.409 ± 0.018
2.19ThrAsp: 2.19 ± 0.051
2.076ThrGlu: 2.076 ± 0.044
1.734ThrPhe: 1.734 ± 0.038
4.031ThrGly: 4.031 ± 0.066
0.991ThrHis: 0.991 ± 0.03
2.712ThrIle: 2.712 ± 0.053
1.542ThrLys: 1.542 ± 0.038
5.61ThrLeu: 5.61 ± 0.072
0.955ThrMet: 0.955 ± 0.031
1.178ThrAsn: 1.178 ± 0.036
3.569ThrPro: 3.569 ± 0.067
1.502ThrGln: 1.502 ± 0.036
3.551ThrArg: 3.551 ± 0.055
2.627ThrSer: 2.627 ± 0.062
2.527ThrThr: 2.527 ± 0.082
3.193ThrVal: 3.193 ± 0.069
0.522ThrTrp: 0.522 ± 0.02
1.009ThrTyr: 1.009 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
9.74ValAla: 9.74 ± 0.116
0.698ValCys: 0.698 ± 0.024
4.362ValAsp: 4.362 ± 0.055
4.722ValGlu: 4.722 ± 0.08
2.707ValPhe: 2.707 ± 0.052
5.497ValGly: 5.497 ± 0.085
1.246ValHis: 1.246 ± 0.035
3.569ValIle: 3.569 ± 0.054
2.461ValLys: 2.461 ± 0.054
6.169ValLeu: 6.169 ± 0.068
1.504ValMet: 1.504 ± 0.036
1.925ValAsn: 1.925 ± 0.039
2.783ValPro: 2.783 ± 0.059
1.718ValGln: 1.718 ± 0.038
4.636ValArg: 4.636 ± 0.065
4.222ValSer: 4.222 ± 0.063
3.871ValThr: 3.871 ± 0.064
5.657ValVal: 5.657 ± 0.084
0.897ValTrp: 0.897 ± 0.029
1.506ValTyr: 1.506 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.233TrpAla: 1.233 ± 0.032
0.127TrpCys: 0.127 ± 0.01
0.595TrpAsp: 0.595 ± 0.024
0.558TrpGlu: 0.558 ± 0.024
0.523TrpPhe: 0.523 ± 0.02
0.927TrpGly: 0.927 ± 0.036
0.262TrpHis: 0.262 ± 0.015
0.662TrpIle: 0.662 ± 0.026
0.453TrpLys: 0.453 ± 0.022
1.59TrpLeu: 1.59 ± 0.042
0.352TrpMet: 0.352 ± 0.02
0.398TrpAsn: 0.398 ± 0.021
0.739TrpPro: 0.739 ± 0.026
0.379TrpGln: 0.379 ± 0.017
1.715TrpArg: 1.715 ± 0.038
0.884TrpSer: 0.884 ± 0.026
0.776TrpThr: 0.776 ± 0.022
0.633TrpVal: 0.633 ± 0.025
0.224TrpTrp: 0.224 ± 0.016
0.248TrpTyr: 0.248 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.698TyrAla: 2.698 ± 0.041
0.276TyrCys: 0.276 ± 0.014
1.469TyrAsp: 1.469 ± 0.037
1.277TyrGlu: 1.277 ± 0.035
1.014TyrPhe: 1.014 ± 0.03
2.112TyrGly: 2.112 ± 0.044
0.428TyrHis: 0.428 ± 0.02
0.83TyrIle: 0.83 ± 0.029
0.633TyrLys: 0.633 ± 0.023
2.244TyrLeu: 2.244 ± 0.042
0.391TyrMet: 0.391 ± 0.019
0.555TyrAsn: 0.555 ± 0.024
1.039TyrPro: 1.039 ± 0.029
0.657TyrGln: 0.657 ± 0.025
1.847TyrArg: 1.847 ± 0.04
1.16TyrSer: 1.16 ± 0.031
0.855TyrThr: 0.855 ± 0.028
1.776TyrVal: 1.776 ± 0.036
0.402TyrTrp: 0.402 ± 0.019
0.623TyrTyr: 0.623 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4040 proteins (1212217 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski