Amino acid dipepetide frequency for Salinimicrobium catena

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.45AlaAla: 5.45 ± 0.084
0.513AlaCys: 0.513 ± 0.028
3.436AlaAsp: 3.436 ± 0.069
5.33AlaGlu: 5.33 ± 0.088
3.38AlaPhe: 3.38 ± 0.062
4.99AlaGly: 4.99 ± 0.079
1.213AlaHis: 1.213 ± 0.036
4.85AlaIle: 4.85 ± 0.077
4.294AlaLys: 4.294 ± 0.069
6.366AlaLeu: 6.366 ± 0.09
1.647AlaMet: 1.647 ± 0.043
2.96AlaAsn: 2.96 ± 0.054
2.23AlaPro: 2.23 ± 0.051
2.404AlaGln: 2.404 ± 0.051
2.656AlaArg: 2.656 ± 0.05
3.976AlaSer: 3.976 ± 0.086
3.562AlaThr: 3.562 ± 0.072
4.897AlaVal: 4.897 ± 0.08
0.665AlaTrp: 0.665 ± 0.025
2.398AlaTyr: 2.398 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
0.383CysAla: 0.383 ± 0.022
0.099CysCys: 0.099 ± 0.011
0.395CysAsp: 0.395 ± 0.02
0.482CysGlu: 0.482 ± 0.031
0.341CysPhe: 0.341 ± 0.019
0.566CysGly: 0.566 ± 0.028
0.147CysHis: 0.147 ± 0.013
0.463CysIle: 0.463 ± 0.025
0.346CysLys: 0.346 ± 0.02
0.535CysLeu: 0.535 ± 0.025
0.138CysMet: 0.138 ± 0.013
0.292CysAsn: 0.292 ± 0.02
0.339CysPro: 0.339 ± 0.029
0.183CysGln: 0.183 ± 0.014
0.253CysArg: 0.253 ± 0.017
0.487CysSer: 0.487 ± 0.023
0.392CysThr: 0.392 ± 0.019
0.379CysVal: 0.379 ± 0.02
0.05CysTrp: 0.05 ± 0.007
0.262CysTyr: 0.262 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.437AspAla: 3.437 ± 0.068
0.363AspCys: 0.363 ± 0.02
2.64AspAsp: 2.64 ± 0.057
4.086AspGlu: 4.086 ± 0.07
3.479AspPhe: 3.479 ± 0.057
3.307AspGly: 3.307 ± 0.07
1.246AspHis: 1.246 ± 0.035
3.675AspIle: 3.675 ± 0.072
3.426AspLys: 3.426 ± 0.062
6.195AspLeu: 6.195 ± 0.091
1.122AspMet: 1.122 ± 0.034
2.37AspAsn: 2.37 ± 0.059
2.426AspPro: 2.426 ± 0.059
1.799AspGln: 1.799 ± 0.047
2.417AspArg: 2.417 ± 0.047
3.05AspSer: 3.05 ± 0.059
2.516AspThr: 2.516 ± 0.051
3.428AspVal: 3.428 ± 0.068
0.786AspTrp: 0.786 ± 0.029
2.625AspTyr: 2.625 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
5.39GluAla: 5.39 ± 0.093
0.339GluCys: 0.339 ± 0.021
4.495GluAsp: 4.495 ± 0.063
7.792GluGlu: 7.792 ± 0.111
3.48GluPhe: 3.48 ± 0.064
4.841GluGly: 4.841 ± 0.08
1.357GluHis: 1.357 ± 0.04
6.079GluIle: 6.079 ± 0.078
7.195GluLys: 7.195 ± 0.107
7.163GluLeu: 7.163 ± 0.106
2.051GluMet: 2.051 ± 0.051
4.899GluAsn: 4.899 ± 0.075
2.108GluPro: 2.108 ± 0.049
2.755GluGln: 2.755 ± 0.059
3.273GluArg: 3.273 ± 0.06
3.2GluSer: 3.2 ± 0.051
3.753GluThr: 3.753 ± 0.061
5.657GluVal: 5.657 ± 0.083
0.772GluTrp: 0.772 ± 0.026
2.682GluTyr: 2.682 ± 0.055
0.0GluXaa: 0.0 ± 0.0
Phe
3.126PheAla: 3.126 ± 0.065
0.405PheCys: 0.405 ± 0.019
3.041PheAsp: 3.041 ± 0.054
3.673PheGlu: 3.673 ± 0.051
2.951PhePhe: 2.951 ± 0.07
3.648PheGly: 3.648 ± 0.07
0.902PheHis: 0.902 ± 0.034
3.423PheIle: 3.423 ± 0.067
3.07PheLys: 3.07 ± 0.059
5.344PheLeu: 5.344 ± 0.087
1.166PheMet: 1.166 ± 0.037
2.537PheAsn: 2.537 ± 0.056
1.981PhePro: 1.981 ± 0.045
1.676PheGln: 1.676 ± 0.037
2.173PheArg: 2.173 ± 0.051
4.25PheSer: 4.25 ± 0.084
3.028PheThr: 3.028 ± 0.056
2.778PheVal: 2.778 ± 0.058
0.625PheTrp: 0.625 ± 0.025
2.094PheTyr: 2.094 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
4.434GlyAla: 4.434 ± 0.067
0.567GlyCys: 0.567 ± 0.027
3.472GlyAsp: 3.472 ± 0.069
4.741GlyGlu: 4.741 ± 0.077
3.558GlyPhe: 3.558 ± 0.069
4.672GlyGly: 4.672 ± 0.084
1.199GlyHis: 1.199 ± 0.031
5.413GlyIle: 5.413 ± 0.084
5.241GlyLys: 5.241 ± 0.075
5.916GlyLeu: 5.916 ± 0.091
1.872GlyMet: 1.872 ± 0.046
3.444GlyAsn: 3.444 ± 0.073
1.552GlyPro: 1.552 ± 0.043
1.962GlyGln: 1.962 ± 0.056
2.511GlyArg: 2.511 ± 0.056
4.091GlySer: 4.091 ± 0.074
3.829GlyThr: 3.829 ± 0.077
4.501GlyVal: 4.501 ± 0.067
0.805GlyTrp: 0.805 ± 0.033
2.747GlyTyr: 2.747 ± 0.064
0.0GlyXaa: 0.0 ± 0.0
His
0.967HisAla: 0.967 ± 0.032
0.164HisCys: 0.164 ± 0.013
0.896HisAsp: 0.896 ± 0.029
1.168HisGlu: 1.168 ± 0.041
1.234HisPhe: 1.234 ± 0.035
1.169HisGly: 1.169 ± 0.037
0.495HisHis: 0.495 ± 0.023
1.271HisIle: 1.271 ± 0.035
1.126HisLys: 1.126 ± 0.041
2.057HisLeu: 2.057 ± 0.049
0.34HisMet: 0.34 ± 0.019
0.858HisAsn: 0.858 ± 0.031
1.016HisPro: 1.016 ± 0.032
0.717HisGln: 0.717 ± 0.027
0.851HisArg: 0.851 ± 0.029
1.169HisSer: 1.169 ± 0.039
0.96HisThr: 0.96 ± 0.029
1.008HisVal: 1.008 ± 0.031
0.217HisTrp: 0.217 ± 0.016
0.878HisTyr: 0.878 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.06IleAla: 5.06 ± 0.082
0.477IleCys: 0.477 ± 0.023
4.119IleAsp: 4.119 ± 0.065
4.971IleGlu: 4.971 ± 0.081
3.778IlePhe: 3.778 ± 0.068
4.537IleGly: 4.537 ± 0.082
1.252IleHis: 1.252 ± 0.035
4.881IleIle: 4.881 ± 0.085
4.697IleLys: 4.697 ± 0.085
6.869IleLeu: 6.869 ± 0.096
1.266IleMet: 1.266 ± 0.035
3.686IleAsn: 3.686 ± 0.062
3.186IlePro: 3.186 ± 0.06
2.153IleGln: 2.153 ± 0.047
2.893IleArg: 2.893 ± 0.061
5.232IleSer: 5.232 ± 0.074
3.973IleThr: 3.973 ± 0.076
4.068IleVal: 4.068 ± 0.075
0.676IleTrp: 0.676 ± 0.024
2.618IleTyr: 2.618 ± 0.058
0.0IleXaa: 0.0 ± 0.0
Lys
4.835LysAla: 4.835 ± 0.084
0.335LysCys: 0.335 ± 0.018
4.435LysAsp: 4.435 ± 0.072
6.989LysGlu: 6.989 ± 0.117
2.878LysPhe: 2.878 ± 0.054
4.422LysGly: 4.422 ± 0.073
1.179LysHis: 1.179 ± 0.032
5.291LysIle: 5.291 ± 0.087
6.794LysLys: 6.794 ± 0.108
6.141LysLeu: 6.141 ± 0.089
2.041LysMet: 2.041 ± 0.043
4.246LysAsn: 4.246 ± 0.069
2.39LysPro: 2.39 ± 0.047
2.402LysGln: 2.402 ± 0.049
3.085LysArg: 3.085 ± 0.056
3.543LysSer: 3.543 ± 0.062
3.759LysThr: 3.759 ± 0.06
4.683LysVal: 4.683 ± 0.072
0.728LysTrp: 0.728 ± 0.031
2.728LysTyr: 2.728 ± 0.052
0.0LysXaa: 0.0 ± 0.0
Leu
6.318LeuAla: 6.318 ± 0.092
0.594LeuCys: 0.594 ± 0.028
5.079LeuAsp: 5.079 ± 0.079
7.414LeuGlu: 7.414 ± 0.096
5.019LeuPhe: 5.019 ± 0.095
6.099LeuGly: 6.099 ± 0.089
1.774LeuHis: 1.774 ± 0.041
6.318LeuIle: 6.318 ± 0.085
7.617LeuLys: 7.617 ± 0.1
9.704LeuLeu: 9.704 ± 0.132
2.169LeuMet: 2.169 ± 0.043
4.967LeuAsn: 4.967 ± 0.072
3.921LeuPro: 3.921 ± 0.058
4.038LeuGln: 4.038 ± 0.086
3.909LeuArg: 3.909 ± 0.071
6.542LeuSer: 6.542 ± 0.09
4.781LeuThr: 4.781 ± 0.077
5.793LeuVal: 5.793 ± 0.064
0.885LeuTrp: 0.885 ± 0.034
3.131LeuTyr: 3.131 ± 0.066
0.0LeuXaa: 0.0 ± 0.0
Met
1.898MetAla: 1.898 ± 0.046
0.13MetCys: 0.13 ± 0.012
1.213MetAsp: 1.213 ± 0.036
1.821MetGlu: 1.821 ± 0.043
0.851MetPhe: 0.851 ± 0.031
1.611MetGly: 1.611 ± 0.039
0.419MetHis: 0.419 ± 0.023
1.545MetIle: 1.545 ± 0.047
2.264MetLys: 2.264 ± 0.054
2.08MetLeu: 2.08 ± 0.059
0.623MetMet: 0.623 ± 0.03
1.316MetAsn: 1.316 ± 0.035
0.901MetPro: 0.901 ± 0.034
0.826MetGln: 0.826 ± 0.027
1.045MetArg: 1.045 ± 0.03
1.247MetSer: 1.247 ± 0.037
1.136MetThr: 1.136 ± 0.029
1.475MetVal: 1.475 ± 0.038
0.176MetTrp: 0.176 ± 0.013
0.658MetTyr: 0.658 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
3.189AsnAla: 3.189 ± 0.061
0.37AsnCys: 0.37 ± 0.023
2.706AsnAsp: 2.706 ± 0.063
3.548AsnGlu: 3.548 ± 0.06
2.935AsnPhe: 2.935 ± 0.061
3.332AsnGly: 3.332 ± 0.074
0.94AsnHis: 0.94 ± 0.031
3.835AsnIle: 3.835 ± 0.061
3.286AsnLys: 3.286 ± 0.055
5.028AsnLeu: 5.028 ± 0.067
1.194AsnMet: 1.194 ± 0.033
2.636AsnAsn: 2.636 ± 0.062
2.455AsnPro: 2.455 ± 0.05
1.624AsnGln: 1.624 ± 0.044
2.343AsnArg: 2.343 ± 0.054
3.268AsnSer: 3.268 ± 0.058
2.803AsnThr: 2.803 ± 0.062
3.092AsnVal: 3.092 ± 0.058
0.709AsnTrp: 0.709 ± 0.028
2.383AsnTyr: 2.383 ± 0.05
0.0AsnXaa: 0.0 ± 0.0
Pro
2.543ProAla: 2.543 ± 0.057
0.18ProCys: 0.18 ± 0.015
2.294ProAsp: 2.294 ± 0.054
3.827ProGlu: 3.827 ± 0.064
1.95ProPhe: 1.95 ± 0.051
2.519ProGly: 2.519 ± 0.055
0.737ProHis: 0.737 ± 0.029
2.184ProIle: 2.184 ± 0.046
2.426ProLys: 2.426 ± 0.052
3.286ProLeu: 3.286 ± 0.055
0.756ProMet: 0.756 ± 0.028
1.744ProAsn: 1.744 ± 0.043
0.997ProPro: 0.997 ± 0.035
1.538ProGln: 1.538 ± 0.039
1.154ProArg: 1.154 ± 0.038
2.147ProSer: 2.147 ± 0.054
1.768ProThr: 1.768 ± 0.044
3.153ProVal: 3.153 ± 0.057
0.368ProTrp: 0.368 ± 0.021
1.455ProTyr: 1.455 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
2.263GlnAla: 2.263 ± 0.052
0.138GlnCys: 0.138 ± 0.012
1.771GlnAsp: 1.771 ± 0.041
3.099GlnGlu: 3.099 ± 0.058
1.505GlnPhe: 1.505 ± 0.043
1.882GlnGly: 1.882 ± 0.05
0.611GlnHis: 0.611 ± 0.026
2.373GlnIle: 2.373 ± 0.05
3.176GlnLys: 3.176 ± 0.058
3.499GlnLeu: 3.499 ± 0.073
0.902GlnMet: 0.902 ± 0.029
1.945GlnAsn: 1.945 ± 0.047
1.258GlnPro: 1.258 ± 0.043
1.659GlnGln: 1.659 ± 0.048
1.483GlnArg: 1.483 ± 0.042
1.611GlnSer: 1.611 ± 0.044
1.68GlnThr: 1.68 ± 0.037
2.288GlnVal: 2.288 ± 0.048
0.366GlnTrp: 0.366 ± 0.02
1.255GlnTyr: 1.255 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
2.386ArgAla: 2.386 ± 0.053
0.181ArgCys: 0.181 ± 0.014
2.115ArgAsp: 2.115 ± 0.047
3.463ArgGlu: 3.463 ± 0.067
2.178ArgPhe: 2.178 ± 0.048
2.303ArgGly: 2.303 ± 0.048
0.832ArgHis: 0.832 ± 0.031
3.057ArgIle: 3.057 ± 0.063
3.586ArgLys: 3.586 ± 0.067
3.926ArgLeu: 3.926 ± 0.069
1.074ArgMet: 1.074 ± 0.035
2.448ArgAsn: 2.448 ± 0.054
1.357ArgPro: 1.357 ± 0.042
1.475ArgGln: 1.475 ± 0.039
1.847ArgArg: 1.847 ± 0.047
2.606ArgSer: 2.606 ± 0.058
2.105ArgThr: 2.105 ± 0.044
2.411ArgVal: 2.411 ± 0.056
0.464ArgTrp: 0.464 ± 0.027
1.706ArgTyr: 1.706 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
4.118SerAla: 4.118 ± 0.067
0.572SerCys: 0.572 ± 0.03
2.99SerAsp: 2.99 ± 0.06
4.906SerGlu: 4.906 ± 0.084
3.595SerPhe: 3.595 ± 0.064
4.772SerGly: 4.772 ± 0.085
1.103SerHis: 1.103 ± 0.033
4.246SerIle: 4.246 ± 0.07
3.897SerLys: 3.897 ± 0.068
5.985SerLeu: 5.985 ± 0.085
1.334SerMet: 1.334 ± 0.039
2.792SerAsn: 2.792 ± 0.053
2.233SerPro: 2.233 ± 0.05
1.968SerGln: 1.968 ± 0.042
2.701SerArg: 2.701 ± 0.059
4.083SerSer: 4.083 ± 0.076
3.176SerThr: 3.176 ± 0.057
3.873SerVal: 3.873 ± 0.06
0.711SerTrp: 0.711 ± 0.03
2.508SerTyr: 2.508 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
4.081ThrAla: 4.081 ± 0.06
0.312ThrCys: 0.312 ± 0.021
2.838ThrAsp: 2.838 ± 0.063
3.671ThrGlu: 3.671 ± 0.061
2.704ThrPhe: 2.704 ± 0.057
4.556ThrGly: 4.556 ± 0.079
0.955ThrHis: 0.955 ± 0.032
3.663ThrIle: 3.663 ± 0.072
2.668ThrLys: 2.668 ± 0.056
4.737ThrLeu: 4.737 ± 0.074
0.937ThrMet: 0.937 ± 0.035
2.35ThrAsn: 2.35 ± 0.051
2.384ThrPro: 2.384 ± 0.051
1.543ThrGln: 1.543 ± 0.043
2.161ThrArg: 2.161 ± 0.051
3.514ThrSer: 3.514 ± 0.063
3.028ThrThr: 3.028 ± 0.07
3.626ThrVal: 3.626 ± 0.074
0.562ThrTrp: 0.562 ± 0.029
2.069ThrTyr: 2.069 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
4.445ValAla: 4.445 ± 0.09
0.417ValCys: 0.417 ± 0.025
3.603ValAsp: 3.603 ± 0.06
4.823ValGlu: 4.823 ± 0.074
3.126ValPhe: 3.126 ± 0.058
3.952ValGly: 3.952 ± 0.061
1.147ValHis: 1.147 ± 0.036
4.817ValIle: 4.817 ± 0.079
4.447ValLys: 4.447 ± 0.072
6.273ValLeu: 6.273 ± 0.082
1.545ValMet: 1.545 ± 0.041
3.366ValAsn: 3.366 ± 0.066
2.461ValPro: 2.461 ± 0.056
2.182ValGln: 2.182 ± 0.045
2.404ValArg: 2.404 ± 0.056
4.24ValSer: 4.24 ± 0.064
3.534ValThr: 3.534 ± 0.072
4.538ValVal: 4.538 ± 0.076
0.659ValTrp: 0.659 ± 0.025
2.439ValTyr: 2.439 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.645TrpAla: 0.645 ± 0.03
0.078TrpCys: 0.078 ± 0.008
0.607TrpAsp: 0.607 ± 0.024
0.815TrpGlu: 0.815 ± 0.03
0.58TrpPhe: 0.58 ± 0.025
0.662TrpGly: 0.662 ± 0.024
0.238TrpHis: 0.238 ± 0.017
0.742TrpIle: 0.742 ± 0.03
0.842TrpLys: 0.842 ± 0.027
1.058TrpLeu: 1.058 ± 0.038
0.341TrpMet: 0.341 ± 0.017
0.69TrpAsn: 0.69 ± 0.029
0.3TrpPro: 0.3 ± 0.021
0.412TrpGln: 0.412 ± 0.021
0.463TrpArg: 0.463 ± 0.023
0.602TrpSer: 0.602 ± 0.023
0.501TrpThr: 0.501 ± 0.025
0.653TrpVal: 0.653 ± 0.026
0.179TrpTrp: 0.179 ± 0.014
0.482TrpTyr: 0.482 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.248TyrAla: 2.248 ± 0.044
0.3TyrCys: 0.3 ± 0.019
2.249TyrAsp: 2.249 ± 0.05
2.664TyrGlu: 2.664 ± 0.053
2.368TyrPhe: 2.368 ± 0.047
2.685TyrGly: 2.685 ± 0.055
0.815TyrHis: 0.815 ± 0.028
2.265TyrIle: 2.265 ± 0.044
2.504TyrLys: 2.504 ± 0.052
3.909TyrLeu: 3.909 ± 0.072
0.716TyrMet: 0.716 ± 0.031
2.129TyrAsn: 2.129 ± 0.051
1.532TyrPro: 1.532 ± 0.043
1.451TyrGln: 1.451 ± 0.038
1.9TyrArg: 1.9 ± 0.038
2.705TyrSer: 2.705 ± 0.061
2.069TyrThr: 2.069 ± 0.057
2.12TyrVal: 2.12 ± 0.046
0.489TyrTrp: 0.489 ± 0.021
1.656TyrTyr: 1.656 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2965 proteins (981291 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski