Amino acid dipepetide frequency for Salinisphaera hydrothermalis (strain C41B8)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.507AlaAla: 17.507 ± 0.18
1.198AlaCys: 1.198 ± 0.036
8.187AlaAsp: 8.187 ± 0.085
7.583AlaGlu: 7.583 ± 0.094
4.002AlaPhe: 4.002 ± 0.068
11.093AlaGly: 11.093 ± 0.115
3.023AlaHis: 3.023 ± 0.048
6.297AlaIle: 6.297 ± 0.075
3.15AlaLys: 3.15 ± 0.063
13.577AlaLeu: 13.577 ± 0.134
3.271AlaMet: 3.271 ± 0.052
3.073AlaAsn: 3.073 ± 0.055
5.444AlaPro: 5.444 ± 0.08
4.065AlaGln: 4.065 ± 0.057
9.736AlaArg: 9.736 ± 0.11
6.546AlaSer: 6.546 ± 0.082
6.374AlaThr: 6.374 ± 0.081
8.644AlaVal: 8.644 ± 0.099
1.792AlaTrp: 1.792 ± 0.038
2.788AlaTyr: 2.788 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
0.987CysAla: 0.987 ± 0.03
0.117CysCys: 0.117 ± 0.011
0.608CysAsp: 0.608 ± 0.023
0.446CysGlu: 0.446 ± 0.02
0.336CysPhe: 0.336 ± 0.016
0.889CysGly: 0.889 ± 0.027
0.27CysHis: 0.27 ± 0.015
0.388CysIle: 0.388 ± 0.017
0.174CysLys: 0.174 ± 0.014
0.845CysLeu: 0.845 ± 0.027
0.175CysMet: 0.175 ± 0.014
0.214CysAsn: 0.214 ± 0.014
0.474CysPro: 0.474 ± 0.021
0.251CysGln: 0.251 ± 0.014
0.723CysArg: 0.723 ± 0.025
0.433CysSer: 0.433 ± 0.02
0.381CysThr: 0.381 ± 0.019
0.749CysVal: 0.749 ± 0.024
0.126CysTrp: 0.126 ± 0.012
0.251CysTyr: 0.251 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
7.899AspAla: 7.899 ± 0.089
0.546AspCys: 0.546 ± 0.023
4.376AspAsp: 4.376 ± 0.061
3.844AspGlu: 3.844 ± 0.063
2.158AspPhe: 2.158 ± 0.042
5.041AspGly: 5.041 ± 0.075
1.603AspHis: 1.603 ± 0.038
3.117AspIle: 3.117 ± 0.051
1.856AspLys: 1.856 ± 0.045
5.55AspLeu: 5.55 ± 0.069
1.483AspMet: 1.483 ± 0.038
1.939AspAsn: 1.939 ± 0.037
3.154AspPro: 3.154 ± 0.047
2.029AspGln: 2.029 ± 0.045
4.648AspArg: 4.648 ± 0.071
2.841AspSer: 2.841 ± 0.059
3.5AspThr: 3.5 ± 0.058
4.4AspVal: 4.4 ± 0.068
1.154AspTrp: 1.154 ± 0.029
1.952AspTyr: 1.952 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
7.18GluAla: 7.18 ± 0.094
0.399GluCys: 0.399 ± 0.02
2.541GluAsp: 2.541 ± 0.051
2.074GluGlu: 2.074 ± 0.053
1.644GluPhe: 1.644 ± 0.033
3.147GluGly: 3.147 ± 0.057
1.696GluHis: 1.696 ± 0.036
3.051GluIle: 3.051 ± 0.062
1.542GluLys: 1.542 ± 0.043
5.496GluLeu: 5.496 ± 0.077
1.228GluMet: 1.228 ± 0.029
1.446GluAsn: 1.446 ± 0.039
2.593GluPro: 2.593 ± 0.05
2.385GluGln: 2.385 ± 0.049
5.191GluArg: 5.191 ± 0.069
2.799GluSer: 2.799 ± 0.05
3.178GluThr: 3.178 ± 0.057
3.227GluVal: 3.227 ± 0.066
0.646GluTrp: 0.646 ± 0.024
1.292GluTyr: 1.292 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
4.204PheAla: 4.204 ± 0.059
0.372PheCys: 0.372 ± 0.02
2.77PheAsp: 2.77 ± 0.053
2.022PheGlu: 2.022 ± 0.044
1.337PhePhe: 1.337 ± 0.042
3.34PheGly: 3.34 ± 0.06
0.776PheHis: 0.776 ± 0.027
1.656PheIle: 1.656 ± 0.043
0.949PheLys: 0.949 ± 0.029
2.732PheLeu: 2.732 ± 0.052
0.804PheMet: 0.804 ± 0.024
1.123PheAsn: 1.123 ± 0.032
1.262PhePro: 1.262 ± 0.035
0.929PheGln: 0.929 ± 0.029
1.835PheArg: 1.835 ± 0.035
1.987PheSer: 1.987 ± 0.04
1.823PheThr: 1.823 ± 0.045
2.949PheVal: 2.949 ± 0.052
0.532PheTrp: 0.532 ± 0.022
0.978PheTyr: 0.978 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
8.93GlyAla: 8.93 ± 0.108
0.875GlyCys: 0.875 ± 0.026
4.873GlyAsp: 4.873 ± 0.066
4.422GlyGlu: 4.422 ± 0.066
3.253GlyPhe: 3.253 ± 0.049
6.811GlyGly: 6.811 ± 0.095
2.366GlyHis: 2.366 ± 0.048
4.179GlyIle: 4.179 ± 0.062
2.45GlyLys: 2.45 ± 0.053
8.803GlyLeu: 8.803 ± 0.107
2.335GlyMet: 2.335 ± 0.051
2.198GlyAsn: 2.198 ± 0.049
3.321GlyPro: 3.321 ± 0.052
3.181GlyGln: 3.181 ± 0.056
6.013GlyArg: 6.013 ± 0.08
4.311GlySer: 4.311 ± 0.07
4.054GlyThr: 4.054 ± 0.065
6.488GlyVal: 6.488 ± 0.08
1.48GlyTrp: 1.48 ± 0.036
2.554GlyTyr: 2.554 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
2.993HisAla: 2.993 ± 0.047
0.313HisCys: 0.313 ± 0.015
1.694HisAsp: 1.694 ± 0.038
1.293HisGlu: 1.293 ± 0.033
0.911HisPhe: 0.911 ± 0.029
2.421HisGly: 2.421 ± 0.045
0.77HisHis: 0.77 ± 0.026
1.245HisIle: 1.245 ± 0.038
0.616HisLys: 0.616 ± 0.026
2.337HisLeu: 2.337 ± 0.048
0.58HisMet: 0.58 ± 0.02
0.643HisAsn: 0.643 ± 0.023
1.523HisPro: 1.523 ± 0.04
0.768HisGln: 0.768 ± 0.024
1.89HisArg: 1.89 ± 0.044
1.075HisSer: 1.075 ± 0.028
1.114HisThr: 1.114 ± 0.031
1.862HisVal: 1.862 ± 0.042
0.469HisTrp: 0.469 ± 0.02
0.808HisTyr: 0.808 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
7.119IleAla: 7.119 ± 0.078
0.486IleCys: 0.486 ± 0.022
4.232IleAsp: 4.232 ± 0.059
3.636IleGlu: 3.636 ± 0.06
1.475IlePhe: 1.475 ± 0.037
5.035IleGly: 5.035 ± 0.067
1.101IleHis: 1.101 ± 0.033
1.82IleIle: 1.82 ± 0.044
1.448IleLys: 1.448 ± 0.034
3.532IleLeu: 3.532 ± 0.057
0.915IleMet: 0.915 ± 0.027
1.556IleAsn: 1.556 ± 0.037
2.064IlePro: 2.064 ± 0.041
1.433IleGln: 1.433 ± 0.033
3.102IleArg: 3.102 ± 0.053
2.526IleSer: 2.526 ± 0.044
2.618IleThr: 2.618 ± 0.056
4.099IleVal: 4.099 ± 0.064
0.601IleTrp: 0.601 ± 0.023
1.191IleTyr: 1.191 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
3.29LysAla: 3.29 ± 0.071
0.143LysCys: 0.143 ± 0.01
1.219LysAsp: 1.219 ± 0.039
0.98LysGlu: 0.98 ± 0.034
0.632LysPhe: 0.632 ± 0.024
1.821LysGly: 1.821 ± 0.051
0.692LysHis: 0.692 ± 0.025
1.409LysIle: 1.409 ± 0.034
1.229LysLys: 1.229 ± 0.046
2.728LysLeu: 2.728 ± 0.062
0.529LysMet: 0.529 ± 0.021
0.903LysAsn: 0.903 ± 0.031
1.798LysPro: 1.798 ± 0.049
1.341LysGln: 1.341 ± 0.043
2.322LysArg: 2.322 ± 0.047
1.651LysSer: 1.651 ± 0.043
1.75LysThr: 1.75 ± 0.039
1.741LysVal: 1.741 ± 0.038
0.284LysTrp: 0.284 ± 0.015
0.651LysTyr: 0.651 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
14.497LeuAla: 14.497 ± 0.163
0.894LeuCys: 0.894 ± 0.027
6.655LeuAsp: 6.655 ± 0.073
4.927LeuGlu: 4.927 ± 0.067
3.596LeuPhe: 3.596 ± 0.061
8.238LeuGly: 8.238 ± 0.103
2.172LeuHis: 2.172 ± 0.041
5.247LeuIle: 5.247 ± 0.073
2.829LeuLys: 2.829 ± 0.048
9.27LeuLeu: 9.27 ± 0.137
2.333LeuMet: 2.333 ± 0.048
2.586LeuAsn: 2.586 ± 0.048
5.119LeuPro: 5.119 ± 0.067
2.573LeuGln: 2.573 ± 0.051
6.461LeuArg: 6.461 ± 0.079
6.084LeuSer: 6.084 ± 0.071
5.575LeuThr: 5.575 ± 0.074
7.373LeuVal: 7.373 ± 0.085
1.3LeuTrp: 1.3 ± 0.034
2.365LeuTyr: 2.365 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
3.217MetAla: 3.217 ± 0.059
0.165MetCys: 0.165 ± 0.012
1.133MetAsp: 1.133 ± 0.031
0.861MetGlu: 0.861 ± 0.028
0.726MetPhe: 0.726 ± 0.025
1.708MetGly: 1.708 ± 0.039
0.601MetHis: 0.601 ± 0.023
1.281MetIle: 1.281 ± 0.033
0.737MetLys: 0.737 ± 0.026
2.43MetLeu: 2.43 ± 0.049
0.551MetMet: 0.551 ± 0.026
0.774MetAsn: 0.774 ± 0.024
1.413MetPro: 1.413 ± 0.032
0.885MetGln: 0.885 ± 0.028
1.797MetArg: 1.797 ± 0.04
1.746MetSer: 1.746 ± 0.037
1.555MetThr: 1.555 ± 0.032
1.511MetVal: 1.511 ± 0.038
0.174MetTrp: 0.174 ± 0.012
0.372MetTyr: 0.372 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.556AsnAla: 3.556 ± 0.058
0.22AsnCys: 0.22 ± 0.014
1.674AsnAsp: 1.674 ± 0.04
1.326AsnGlu: 1.326 ± 0.037
0.876AsnPhe: 0.876 ± 0.03
2.553AsnGly: 2.553 ± 0.057
0.614AsnHis: 0.614 ± 0.026
1.292AsnIle: 1.292 ± 0.032
0.805AsnLys: 0.805 ± 0.025
2.592AsnLeu: 2.592 ± 0.05
0.568AsnMet: 0.568 ± 0.022
0.966AsnAsn: 0.966 ± 0.032
1.722AsnPro: 1.722 ± 0.037
0.96AsnGln: 0.96 ± 0.033
2.016AsnArg: 2.016 ± 0.041
1.256AsnSer: 1.256 ± 0.041
1.492AsnThr: 1.492 ± 0.035
2.016AsnVal: 2.016 ± 0.044
0.409AsnTrp: 0.409 ± 0.021
0.717AsnTyr: 0.717 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
6.27ProAla: 6.27 ± 0.086
0.311ProCys: 0.311 ± 0.016
3.701ProAsp: 3.701 ± 0.064
3.203ProGlu: 3.203 ± 0.052
1.57ProPhe: 1.57 ± 0.039
4.46ProGly: 4.46 ± 0.063
1.119ProHis: 1.119 ± 0.035
2.296ProIle: 2.296 ± 0.043
1.427ProLys: 1.427 ± 0.042
4.349ProLeu: 4.349 ± 0.058
1.165ProMet: 1.165 ± 0.027
1.412ProAsn: 1.412 ± 0.039
2.292ProPro: 2.292 ± 0.058
1.467ProGln: 1.467 ± 0.042
3.075ProArg: 3.075 ± 0.054
2.693ProSer: 2.693 ± 0.049
2.525ProThr: 2.525 ± 0.051
3.893ProVal: 3.893 ± 0.05
0.761ProTrp: 0.761 ± 0.032
1.224ProTyr: 1.224 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
4.616GlnAla: 4.616 ± 0.072
0.257GlnCys: 0.257 ± 0.013
1.399GlnAsp: 1.399 ± 0.04
1.047GlnGlu: 1.047 ± 0.029
1.124GlnPhe: 1.124 ± 0.034
2.305GlnGly: 2.305 ± 0.039
0.975GlnHis: 0.975 ± 0.027
1.934GlnIle: 1.934 ± 0.043
0.982GlnLys: 0.982 ± 0.034
3.542GlnLeu: 3.542 ± 0.065
0.797GlnMet: 0.797 ± 0.024
0.907GlnAsn: 0.907 ± 0.031
2.021GlnPro: 2.021 ± 0.042
1.619GlnGln: 1.619 ± 0.048
3.102GlnArg: 3.102 ± 0.062
1.997GlnSer: 1.997 ± 0.045
1.935GlnThr: 1.935 ± 0.041
2.271GlnVal: 2.271 ± 0.047
0.491GlnTrp: 0.491 ± 0.021
0.868GlnTyr: 0.868 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
8.459ArgAla: 8.459 ± 0.101
0.641ArgCys: 0.641 ± 0.026
4.558ArgAsp: 4.558 ± 0.065
4.36ArgGlu: 4.36 ± 0.071
2.841ArgPhe: 2.841 ± 0.05
4.978ArgGly: 4.978 ± 0.064
2.163ArgHis: 2.163 ± 0.046
3.923ArgIle: 3.923 ± 0.056
1.689ArgLys: 1.689 ± 0.04
8.541ArgLeu: 8.541 ± 0.092
1.845ArgMet: 1.845 ± 0.041
1.776ArgAsn: 1.776 ± 0.035
3.401ArgPro: 3.401 ± 0.055
2.851ArgGln: 2.851 ± 0.056
6.643ArgArg: 6.643 ± 0.089
3.457ArgSer: 3.457 ± 0.052
3.028ArgThr: 3.028 ± 0.053
5.514ArgVal: 5.514 ± 0.065
1.271ArgTrp: 1.271 ± 0.032
2.357ArgTyr: 2.357 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
6.533SerAla: 6.533 ± 0.077
0.392SerCys: 0.392 ± 0.019
3.254SerAsp: 3.254 ± 0.058
2.637SerGlu: 2.637 ± 0.055
1.846SerPhe: 1.846 ± 0.045
5.195SerGly: 5.195 ± 0.073
1.2SerHis: 1.2 ± 0.035
2.627SerIle: 2.627 ± 0.048
1.425SerLys: 1.425 ± 0.037
5.243SerLeu: 5.243 ± 0.069
1.422SerMet: 1.422 ± 0.029
1.626SerAsn: 1.626 ± 0.039
2.646SerPro: 2.646 ± 0.056
1.971SerGln: 1.971 ± 0.042
3.746SerArg: 3.746 ± 0.061
3.191SerSer: 3.191 ± 0.056
2.715SerThr: 2.715 ± 0.052
3.752SerVal: 3.752 ± 0.059
0.702SerTrp: 0.702 ± 0.024
1.256SerTyr: 1.256 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
6.275ThrAla: 6.275 ± 0.078
0.399ThrCys: 0.399 ± 0.019
3.096ThrAsp: 3.096 ± 0.058
2.572ThrGlu: 2.572 ± 0.047
1.639ThrPhe: 1.639 ± 0.038
5.003ThrGly: 5.003 ± 0.074
1.384ThrHis: 1.384 ± 0.033
2.535ThrIle: 2.535 ± 0.048
1.071ThrLys: 1.071 ± 0.031
6.243ThrLeu: 6.243 ± 0.076
1.052ThrMet: 1.052 ± 0.028
1.253ThrAsn: 1.253 ± 0.033
3.345ThrPro: 3.345 ± 0.055
1.766ThrGln: 1.766 ± 0.04
3.752ThrArg: 3.752 ± 0.056
2.514ThrSer: 2.514 ± 0.05
2.824ThrThr: 2.824 ± 0.053
4.047ThrVal: 4.047 ± 0.064
0.614ThrTrp: 0.614 ± 0.025
1.155ThrTyr: 1.155 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
9.538ValAla: 9.538 ± 0.098
0.69ValCys: 0.69 ± 0.025
4.831ValAsp: 4.831 ± 0.069
3.881ValGlu: 3.881 ± 0.078
2.844ValPhe: 2.844 ± 0.049
5.607ValGly: 5.607 ± 0.066
1.666ValHis: 1.666 ± 0.043
3.907ValIle: 3.907 ± 0.07
1.738ValLys: 1.738 ± 0.043
7.537ValLeu: 7.537 ± 0.087
1.716ValMet: 1.716 ± 0.038
2.108ValAsn: 2.108 ± 0.052
3.611ValPro: 3.611 ± 0.057
1.922ValGln: 1.922 ± 0.047
4.61ValArg: 4.61 ± 0.063
4.151ValSer: 4.151 ± 0.065
4.153ValThr: 4.153 ± 0.059
6.011ValVal: 6.011 ± 0.087
0.944ValTrp: 0.944 ± 0.027
1.768ValTyr: 1.768 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
1.472TrpAla: 1.472 ± 0.035
0.14TrpCys: 0.14 ± 0.011
0.567TrpAsp: 0.567 ± 0.02
0.398TrpGlu: 0.398 ± 0.016
0.563TrpPhe: 0.563 ± 0.025
0.902TrpGly: 0.902 ± 0.029
0.516TrpHis: 0.516 ± 0.021
0.758TrpIle: 0.758 ± 0.028
0.309TrpLys: 0.309 ± 0.018
2.18TrpLeu: 2.18 ± 0.052
0.398TrpMet: 0.398 ± 0.019
0.393TrpAsn: 0.393 ± 0.017
0.779TrpPro: 0.779 ± 0.027
0.795TrpGln: 0.795 ± 0.026
1.344TrpArg: 1.344 ± 0.032
0.803TrpSer: 0.803 ± 0.027
0.636TrpThr: 0.636 ± 0.028
0.906TrpVal: 0.906 ± 0.03
0.256TrpTrp: 0.256 ± 0.016
0.306TrpTyr: 0.306 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.809TyrAla: 2.809 ± 0.049
0.279TyrCys: 0.279 ± 0.017
1.719TyrAsp: 1.719 ± 0.036
1.245TyrGlu: 1.245 ± 0.035
1.006TyrPhe: 1.006 ± 0.037
2.349TyrGly: 2.349 ± 0.045
0.619TyrHis: 0.619 ± 0.023
1.025TyrIle: 1.025 ± 0.027
0.69TyrLys: 0.69 ± 0.025
2.569TyrLeu: 2.569 ± 0.051
0.47TyrMet: 0.47 ± 0.021
0.735TyrAsn: 0.735 ± 0.029
1.229TyrPro: 1.229 ± 0.036
0.995TyrGln: 0.995 ± 0.03
2.29TyrArg: 2.29 ± 0.045
1.317TyrSer: 1.317 ± 0.035
1.305TyrThr: 1.305 ± 0.033
1.782TyrVal: 1.782 ± 0.041
0.42TyrTrp: 0.42 ± 0.019
0.78TyrTyr: 0.78 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3807 proteins (1233367 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski