Amino acid dipepetide frequency for Thermolongibacillus altinsuensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.889AlaAla: 5.889 ± 0.114
0.676AlaCys: 0.676 ± 0.027
3.17AlaAsp: 3.17 ± 0.064
5.003AlaGlu: 5.003 ± 0.085
3.367AlaPhe: 3.367 ± 0.074
4.842AlaGly: 4.842 ± 0.081
1.594AlaHis: 1.594 ± 0.046
6.268AlaIle: 6.268 ± 0.085
5.65AlaLys: 5.65 ± 0.089
8.025AlaLeu: 8.025 ± 0.113
2.226AlaMet: 2.226 ± 0.054
2.81AlaAsn: 2.81 ± 0.065
2.134AlaPro: 2.134 ± 0.055
2.547AlaGln: 2.547 ± 0.059
3.145AlaArg: 3.145 ± 0.059
3.743AlaSer: 3.743 ± 0.059
3.604AlaThr: 3.604 ± 0.072
5.524AlaVal: 5.524 ± 0.094
0.66AlaTrp: 0.66 ± 0.03
2.548AlaTyr: 2.548 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
0.544CysAla: 0.544 ± 0.029
0.122CysCys: 0.122 ± 0.013
0.451CysAsp: 0.451 ± 0.024
0.579CysGlu: 0.579 ± 0.029
0.371CysPhe: 0.371 ± 0.023
0.706CysGly: 0.706 ± 0.034
0.225CysHis: 0.225 ± 0.016
0.526CysIle: 0.526 ± 0.026
0.38CysLys: 0.38 ± 0.021
0.751CysLeu: 0.751 ± 0.031
0.204CysMet: 0.204 ± 0.018
0.293CysAsn: 0.293 ± 0.018
0.445CysPro: 0.445 ± 0.023
0.283CysGln: 0.283 ± 0.022
0.367CysArg: 0.367 ± 0.024
0.519CysSer: 0.519 ± 0.022
0.435CysThr: 0.435 ± 0.022
0.455CysVal: 0.455 ± 0.025
0.064CysTrp: 0.064 ± 0.007
0.276CysTyr: 0.276 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.065AspAla: 3.065 ± 0.062
0.41AspCys: 0.41 ± 0.023
2.229AspAsp: 2.229 ± 0.055
4.874AspGlu: 4.874 ± 0.084
2.123AspPhe: 2.123 ± 0.052
3.181AspGly: 3.181 ± 0.075
1.199AspHis: 1.199 ± 0.038
3.539AspIle: 3.539 ± 0.073
2.121AspLys: 2.121 ± 0.049
4.766AspLeu: 4.766 ± 0.081
1.073AspMet: 1.073 ± 0.035
1.139AspAsn: 1.139 ± 0.039
2.055AspPro: 2.055 ± 0.053
1.703AspGln: 1.703 ± 0.056
2.41AspArg: 2.41 ± 0.058
1.869AspSer: 1.869 ± 0.05
1.88AspThr: 1.88 ± 0.053
4.201AspVal: 4.201 ± 0.073
0.58AspTrp: 0.58 ± 0.031
1.952AspTyr: 1.952 ± 0.05
0.0AspXaa: 0.0 ± 0.0
Glu
6.2GluAla: 6.2 ± 0.098
0.441GluCys: 0.441 ± 0.027
3.114GluAsp: 3.114 ± 0.063
7.71GluGlu: 7.71 ± 0.139
2.425GluPhe: 2.425 ± 0.054
4.174GluGly: 4.174 ± 0.075
1.903GluHis: 1.903 ± 0.047
5.812GluIle: 5.812 ± 0.096
7.623GluLys: 7.623 ± 0.112
7.03GluLeu: 7.03 ± 0.115
2.58GluMet: 2.58 ± 0.054
3.35GluAsn: 3.35 ± 0.062
2.056GluPro: 2.056 ± 0.054
4.698GluGln: 4.698 ± 0.099
4.675GluArg: 4.675 ± 0.091
2.743GluSer: 2.743 ± 0.065
4.051GluThr: 4.051 ± 0.074
5.112GluVal: 5.112 ± 0.092
0.959GluTrp: 0.959 ± 0.033
2.228GluTyr: 2.228 ± 0.058
0.0GluXaa: 0.0 ± 0.0
Phe
3.47PheAla: 3.47 ± 0.067
0.373PheCys: 0.373 ± 0.02
2.204PheAsp: 2.204 ± 0.045
2.756PheGlu: 2.756 ± 0.055
2.539PhePhe: 2.539 ± 0.068
3.309PheGly: 3.309 ± 0.072
1.174PheHis: 1.174 ± 0.035
3.771PheIle: 3.771 ± 0.078
2.068PheLys: 2.068 ± 0.054
4.782PheLeu: 4.782 ± 0.095
1.181PheMet: 1.181 ± 0.04
1.427PheAsn: 1.427 ± 0.044
1.754PhePro: 1.754 ± 0.045
1.713PheGln: 1.713 ± 0.048
1.779PheArg: 1.779 ± 0.054
3.036PheSer: 3.036 ± 0.063
2.403PheThr: 2.403 ± 0.058
3.542PheVal: 3.542 ± 0.066
0.482PheTrp: 0.482 ± 0.027
1.696PheTyr: 1.696 ± 0.058
0.0PheXaa: 0.0 ± 0.0
Gly
4.796GlyAla: 4.796 ± 0.102
0.651GlyCys: 0.651 ± 0.029
2.999GlyAsp: 2.999 ± 0.056
4.614GlyGlu: 4.614 ± 0.078
3.255GlyPhe: 3.255 ± 0.073
4.525GlyGly: 4.525 ± 0.096
1.354GlyHis: 1.354 ± 0.043
5.631GlyIle: 5.631 ± 0.094
5.263GlyLys: 5.263 ± 0.081
6.066GlyLeu: 6.066 ± 0.102
2.095GlyMet: 2.095 ± 0.053
2.397GlyAsn: 2.397 ± 0.064
1.682GlyPro: 1.682 ± 0.043
2.13GlyGln: 2.13 ± 0.053
2.814GlyArg: 2.814 ± 0.071
3.241GlySer: 3.241 ± 0.061
3.803GlyThr: 3.803 ± 0.077
5.058GlyVal: 5.058 ± 0.075
0.768GlyTrp: 0.768 ± 0.034
2.653GlyTyr: 2.653 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
1.472HisAla: 1.472 ± 0.046
0.239HisCys: 0.239 ± 0.017
1.035HisAsp: 1.035 ± 0.037
1.658HisGlu: 1.658 ± 0.038
1.134HisPhe: 1.134 ± 0.036
1.498HisGly: 1.498 ± 0.046
0.821HisHis: 0.821 ± 0.035
1.722HisIle: 1.722 ± 0.04
1.01HisLys: 1.01 ± 0.036
2.558HisLeu: 2.558 ± 0.054
0.582HisMet: 0.582 ± 0.022
0.689HisAsn: 0.689 ± 0.029
1.356HisPro: 1.356 ± 0.042
0.893HisGln: 0.893 ± 0.037
1.166HisArg: 1.166 ± 0.039
1.241HisSer: 1.241 ± 0.035
1.001HisThr: 1.001 ± 0.035
1.793HisVal: 1.793 ± 0.041
0.263HisTrp: 0.263 ± 0.016
1.0HisTyr: 1.0 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
6.554IleAla: 6.554 ± 0.106
0.714IleCys: 0.714 ± 0.026
4.336IleAsp: 4.336 ± 0.08
6.422IleGlu: 6.422 ± 0.094
3.229IlePhe: 3.229 ± 0.067
6.027IleGly: 6.027 ± 0.099
1.864IleHis: 1.864 ± 0.048
5.477IleIle: 5.477 ± 0.106
4.124IleLys: 4.124 ± 0.071
6.537IleLeu: 6.537 ± 0.105
1.562IleMet: 1.562 ± 0.045
2.68IleAsn: 2.68 ± 0.052
3.463IlePro: 3.463 ± 0.069
3.045IleGln: 3.045 ± 0.058
3.701IleArg: 3.701 ± 0.071
4.432IleSer: 4.432 ± 0.081
3.885IleThr: 3.885 ± 0.068
6.353IleVal: 6.353 ± 0.094
0.676IleTrp: 0.676 ± 0.033
2.473IleTyr: 2.473 ± 0.052
0.0IleXaa: 0.0 ± 0.0
Lys
5.038LysAla: 5.038 ± 0.088
0.378LysCys: 0.378 ± 0.023
3.071LysAsp: 3.071 ± 0.067
7.377LysGlu: 7.377 ± 0.108
1.834LysPhe: 1.834 ± 0.049
4.55LysGly: 4.55 ± 0.069
1.439LysHis: 1.439 ± 0.039
4.844LysIle: 4.844 ± 0.083
6.454LysLys: 6.454 ± 0.096
5.674LysLeu: 5.674 ± 0.09
2.236LysMet: 2.236 ± 0.052
3.024LysAsn: 3.024 ± 0.073
2.278LysPro: 2.278 ± 0.055
3.827LysGln: 3.827 ± 0.079
4.049LysArg: 4.049 ± 0.072
2.78LysSer: 2.78 ± 0.058
3.691LysThr: 3.691 ± 0.066
4.705LysVal: 4.705 ± 0.087
0.951LysTrp: 0.951 ± 0.035
2.201LysTyr: 2.201 ± 0.049
0.0LysXaa: 0.0 ± 0.0
Leu
8.07LeuAla: 8.07 ± 0.105
0.745LeuCys: 0.745 ± 0.028
4.5LeuAsp: 4.5 ± 0.075
6.882LeuGlu: 6.882 ± 0.125
4.972LeuPhe: 4.972 ± 0.109
6.205LeuGly: 6.205 ± 0.095
2.249LeuHis: 2.249 ± 0.051
6.876LeuIle: 6.876 ± 0.112
6.872LeuLys: 6.872 ± 0.093
10.472LeuLeu: 10.472 ± 0.136
2.342LeuMet: 2.342 ± 0.056
3.813LeuAsn: 3.813 ± 0.071
4.039LeuPro: 4.039 ± 0.076
4.02LeuGln: 4.02 ± 0.074
4.404LeuArg: 4.404 ± 0.074
6.191LeuSer: 6.191 ± 0.094
5.462LeuThr: 5.462 ± 0.071
6.348LeuVal: 6.348 ± 0.087
0.833LeuTrp: 0.833 ± 0.035
3.251LeuTyr: 3.251 ± 0.068
0.001LeuXaa: 0.001 ± 0.001
Met
2.134MetAla: 2.134 ± 0.05
0.149MetCys: 0.149 ± 0.014
1.248MetAsp: 1.248 ± 0.041
2.038MetGlu: 2.038 ± 0.048
1.13MetPhe: 1.13 ± 0.042
1.595MetGly: 1.595 ± 0.048
0.415MetHis: 0.415 ± 0.022
2.324MetIle: 2.324 ± 0.056
2.703MetLys: 2.703 ± 0.057
2.519MetLeu: 2.519 ± 0.067
1.015MetMet: 1.015 ± 0.044
1.607MetAsn: 1.607 ± 0.041
1.038MetPro: 1.038 ± 0.045
0.908MetGln: 0.908 ± 0.042
1.304MetArg: 1.304 ± 0.045
1.473MetSer: 1.473 ± 0.045
1.609MetThr: 1.609 ± 0.047
1.671MetVal: 1.671 ± 0.044
0.218MetTrp: 0.218 ± 0.016
0.764MetTyr: 0.764 ± 0.032
0.0MetXaa: 0.0 ± 0.0
Asn
2.599AsnAla: 2.599 ± 0.056
0.269AsnCys: 0.269 ± 0.018
2.073AsnAsp: 2.073 ± 0.053
3.69AsnGlu: 3.69 ± 0.072
1.393AsnPhe: 1.393 ± 0.041
3.204AsnGly: 3.204 ± 0.075
0.968AsnHis: 0.968 ± 0.032
3.195AsnIle: 3.195 ± 0.065
2.414AsnLys: 2.414 ± 0.06
3.11AsnLeu: 3.11 ± 0.058
1.067AsnMet: 1.067 ± 0.037
1.47AsnAsn: 1.47 ± 0.05
2.033AsnPro: 2.033 ± 0.055
1.426AsnGln: 1.426 ± 0.043
1.921AsnArg: 1.921 ± 0.046
1.553AsnSer: 1.553 ± 0.044
1.689AsnThr: 1.689 ± 0.049
3.116AsnVal: 3.116 ± 0.068
0.42AsnTrp: 0.42 ± 0.023
1.301AsnTyr: 1.301 ± 0.042
0.0AsnXaa: 0.0 ± 0.0
Pro
2.123ProAla: 2.123 ± 0.047
0.256ProCys: 0.256 ± 0.016
1.8ProAsp: 1.8 ± 0.051
2.736ProGlu: 2.736 ± 0.07
2.175ProPhe: 2.175 ± 0.058
2.11ProGly: 2.11 ± 0.06
1.016ProHis: 1.016 ± 0.037
3.039ProIle: 3.039 ± 0.055
2.609ProLys: 2.609 ± 0.049
3.937ProLeu: 3.937 ± 0.081
0.888ProMet: 0.888 ± 0.038
1.924ProAsn: 1.924 ± 0.05
1.141ProPro: 1.141 ± 0.044
1.118ProGln: 1.118 ± 0.033
1.261ProArg: 1.261 ± 0.038
2.375ProSer: 2.375 ± 0.057
2.119ProThr: 2.119 ± 0.057
2.654ProVal: 2.654 ± 0.057
0.397ProTrp: 0.397 ± 0.022
1.598ProTyr: 1.598 ± 0.051
0.0ProXaa: 0.0 ± 0.0
Gln
3.041GlnAla: 3.041 ± 0.067
0.278GlnCys: 0.278 ± 0.021
1.333GlnAsp: 1.333 ± 0.043
2.921GlnGlu: 2.921 ± 0.072
1.813GlnPhe: 1.813 ± 0.043
2.03GlnGly: 2.03 ± 0.054
0.917GlnHis: 0.917 ± 0.033
2.826GlnIle: 2.826 ± 0.069
3.174GlnLys: 3.174 ± 0.07
4.84GlnLeu: 4.84 ± 0.087
1.344GlnMet: 1.344 ± 0.042
1.426GlnAsn: 1.426 ± 0.044
1.379GlnPro: 1.379 ± 0.056
2.285GlnGln: 2.285 ± 0.065
1.76GlnArg: 1.76 ± 0.053
2.029GlnSer: 2.029 ± 0.049
2.219GlnThr: 2.219 ± 0.063
2.189GlnVal: 2.189 ± 0.054
0.5GlnTrp: 0.5 ± 0.022
1.487GlnTyr: 1.487 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
2.981ArgAla: 2.981 ± 0.064
0.367ArgCys: 0.367 ± 0.025
2.115ArgAsp: 2.115 ± 0.048
3.852ArgGlu: 3.852 ± 0.084
2.267ArgPhe: 2.267 ± 0.046
2.682ArgGly: 2.682 ± 0.057
1.01ArgHis: 1.01 ± 0.033
3.485ArgIle: 3.485 ± 0.07
3.645ArgLys: 3.645 ± 0.073
4.938ArgLeu: 4.938 ± 0.086
1.503ArgMet: 1.503 ± 0.041
1.967ArgAsn: 1.967 ± 0.048
1.595ArgPro: 1.595 ± 0.045
1.96ArgGln: 1.96 ± 0.047
2.213ArgArg: 2.213 ± 0.057
2.329ArgSer: 2.329 ± 0.059
2.267ArgThr: 2.267 ± 0.052
2.908ArgVal: 2.908 ± 0.068
0.557ArgTrp: 0.557 ± 0.024
1.893ArgTyr: 1.893 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
3.273SerAla: 3.273 ± 0.071
0.41SerCys: 0.41 ± 0.022
2.317SerAsp: 2.317 ± 0.057
3.591SerGlu: 3.591 ± 0.067
3.292SerPhe: 3.292 ± 0.071
3.684SerGly: 3.684 ± 0.071
1.152SerHis: 1.152 ± 0.035
4.283SerIle: 4.283 ± 0.075
3.018SerLys: 3.018 ± 0.066
5.99SerLeu: 5.99 ± 0.098
1.459SerMet: 1.459 ± 0.043
1.842SerAsn: 1.842 ± 0.043
2.075SerPro: 2.075 ± 0.049
1.619SerGln: 1.619 ± 0.047
2.074SerArg: 2.074 ± 0.057
3.14SerSer: 3.14 ± 0.067
2.455SerThr: 2.455 ± 0.055
3.749SerVal: 3.749 ± 0.075
0.528SerTrp: 0.528 ± 0.024
1.938SerTyr: 1.938 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
3.679ThrAla: 3.679 ± 0.063
0.396ThrCys: 0.396 ± 0.022
2.292ThrAsp: 2.292 ± 0.057
3.425ThrGlu: 3.425 ± 0.067
2.75ThrPhe: 2.75 ± 0.064
3.686ThrGly: 3.686 ± 0.07
1.028ThrHis: 1.028 ± 0.034
4.701ThrIle: 4.701 ± 0.081
3.568ThrLys: 3.568 ± 0.062
5.093ThrLeu: 5.093 ± 0.081
1.398ThrMet: 1.398 ± 0.043
2.478ThrAsn: 2.478 ± 0.059
2.114ThrPro: 2.114 ± 0.048
1.175ThrGln: 1.175 ± 0.04
1.779ThrArg: 1.779 ± 0.046
2.645ThrSer: 2.645 ± 0.06
2.747ThrThr: 2.747 ± 0.071
4.184ThrVal: 4.184 ± 0.085
0.495ThrTrp: 0.495 ± 0.025
1.956ThrTyr: 1.956 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
5.521ValAla: 5.521 ± 0.098
0.68ValCys: 0.68 ± 0.03
3.685ValAsp: 3.685 ± 0.072
4.988ValGlu: 4.988 ± 0.084
3.101ValPhe: 3.101 ± 0.067
4.64ValGly: 4.64 ± 0.084
1.648ValHis: 1.648 ± 0.047
5.777ValIle: 5.777 ± 0.082
4.831ValLys: 4.831 ± 0.071
6.89ValLeu: 6.89 ± 0.104
1.835ValMet: 1.835 ± 0.046
2.817ValAsn: 2.817 ± 0.06
2.979ValPro: 2.979 ± 0.062
2.799ValGln: 2.799 ± 0.057
3.306ValArg: 3.306 ± 0.072
4.178ValSer: 4.178 ± 0.066
4.026ValThr: 4.026 ± 0.071
5.575ValVal: 5.575 ± 0.097
0.685ValTrp: 0.685 ± 0.032
2.525ValTyr: 2.525 ± 0.059
0.0ValXaa: 0.0 ± 0.0
Trp
0.655TrpAla: 0.655 ± 0.035
0.076TrpCys: 0.076 ± 0.01
0.445TrpAsp: 0.445 ± 0.024
0.683TrpGlu: 0.683 ± 0.03
0.527TrpPhe: 0.527 ± 0.029
0.618TrpGly: 0.618 ± 0.029
0.253TrpHis: 0.253 ± 0.021
0.942TrpIle: 0.942 ± 0.034
0.828TrpLys: 0.828 ± 0.037
1.258TrpLeu: 1.258 ± 0.044
0.407TrpMet: 0.407 ± 0.021
0.528TrpAsn: 0.528 ± 0.027
0.275TrpPro: 0.275 ± 0.02
0.39TrpGln: 0.39 ± 0.025
0.476TrpArg: 0.476 ± 0.024
0.535TrpSer: 0.535 ± 0.027
0.547TrpThr: 0.547 ± 0.025
0.624TrpVal: 0.624 ± 0.028
0.152TrpTrp: 0.152 ± 0.013
0.351TrpTyr: 0.351 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.323TyrAla: 2.323 ± 0.056
0.365TyrCys: 0.365 ± 0.021
1.953TyrAsp: 1.953 ± 0.052
3.084TyrGlu: 3.084 ± 0.065
1.765TyrPhe: 1.765 ± 0.052
2.528TyrGly: 2.528 ± 0.058
0.903TyrHis: 0.903 ± 0.034
2.519TyrIle: 2.519 ± 0.055
2.042TyrLys: 2.042 ± 0.054
3.311TyrLeu: 3.311 ± 0.067
0.866TyrMet: 0.866 ± 0.031
1.228TyrAsn: 1.228 ± 0.041
1.385TyrPro: 1.385 ± 0.043
1.25TyrGln: 1.25 ± 0.037
1.929TyrArg: 1.929 ± 0.044
1.837TyrSer: 1.837 ± 0.04
1.696TyrThr: 1.696 ± 0.05
2.722TyrVal: 2.722 ± 0.052
0.384TyrTrp: 0.384 ± 0.021
1.433TyrTyr: 1.433 ± 0.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.001
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.004XaaXaa: 0.004 ± 0.004
Statistics based on 2961 proteins (844262 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski