Amino acid dipepetide frequency for Glaciecola sp. (strain KUL10)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.424AlaAla: 6.424 ± 0.101
1.008AlaCys: 1.008 ± 0.031
4.497AlaAsp: 4.497 ± 0.06
4.933AlaGlu: 4.933 ± 0.073
3.786AlaPhe: 3.786 ± 0.063
5.234AlaGly: 5.234 ± 0.088
1.555AlaHis: 1.555 ± 0.041
5.895AlaIle: 5.895 ± 0.078
5.037AlaLys: 5.037 ± 0.074
9.287AlaLeu: 9.287 ± 0.097
2.309AlaMet: 2.309 ± 0.043
4.109AlaAsn: 4.109 ± 0.056
2.667AlaPro: 2.667 ± 0.052
3.922AlaGln: 3.922 ± 0.057
3.425AlaArg: 3.425 ± 0.048
6.1AlaSer: 6.1 ± 0.073
4.184AlaThr: 4.184 ± 0.062
5.29AlaVal: 5.29 ± 0.069
0.893AlaTrp: 0.893 ± 0.025
2.582AlaTyr: 2.582 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.778CysAla: 0.778 ± 0.023
0.128CysCys: 0.128 ± 0.01
0.575CysAsp: 0.575 ± 0.022
0.624CysGlu: 0.624 ± 0.023
0.466CysPhe: 0.466 ± 0.018
0.676CysGly: 0.676 ± 0.028
0.288CysHis: 0.288 ± 0.015
0.673CysIle: 0.673 ± 0.024
0.486CysLys: 0.486 ± 0.02
1.002CysLeu: 1.002 ± 0.029
0.215CysMet: 0.215 ± 0.014
0.387CysAsn: 0.387 ± 0.018
0.374CysPro: 0.374 ± 0.017
0.364CysGln: 0.364 ± 0.019
0.373CysArg: 0.373 ± 0.019
0.684CysSer: 0.684 ± 0.025
0.426CysThr: 0.426 ± 0.018
0.652CysVal: 0.652 ± 0.025
0.087CysTrp: 0.087 ± 0.007
0.285CysTyr: 0.285 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
4.744AspAla: 4.744 ± 0.067
0.481AspCys: 0.481 ± 0.023
3.48AspAsp: 3.48 ± 0.066
4.315AspGlu: 4.315 ± 0.065
2.93AspPhe: 2.93 ± 0.051
3.828AspGly: 3.828 ± 0.079
0.952AspHis: 0.952 ± 0.03
4.543AspIle: 4.543 ± 0.064
3.559AspLys: 3.559 ± 0.067
5.494AspLeu: 5.494 ± 0.072
1.477AspMet: 1.477 ± 0.033
2.762AspAsn: 2.762 ± 0.049
2.007AspPro: 2.007 ± 0.048
1.879AspGln: 1.879 ± 0.032
2.067AspArg: 2.067 ± 0.043
3.667AspSer: 3.667 ± 0.057
3.106AspThr: 3.106 ± 0.051
3.945AspVal: 3.945 ± 0.063
0.875AspTrp: 0.875 ± 0.025
2.192AspTyr: 2.192 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
5.183GluAla: 5.183 ± 0.075
0.478GluCys: 0.478 ± 0.021
3.132GluAsp: 3.132 ± 0.059
3.627GluGlu: 3.627 ± 0.066
2.73GluPhe: 2.73 ± 0.051
3.284GluGly: 3.284 ± 0.062
1.642GluHis: 1.642 ± 0.039
4.136GluIle: 4.136 ± 0.067
4.006GluLys: 4.006 ± 0.066
6.935GluLeu: 6.935 ± 0.082
1.596GluMet: 1.596 ± 0.032
3.157GluAsn: 3.157 ± 0.051
1.823GluPro: 1.823 ± 0.044
3.934GluGln: 3.934 ± 0.061
3.041GluArg: 3.041 ± 0.053
4.221GluSer: 4.221 ± 0.066
3.332GluThr: 3.332 ± 0.056
4.253GluVal: 4.253 ± 0.063
0.712GluTrp: 0.712 ± 0.02
2.053GluTyr: 2.053 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
3.874PheAla: 3.874 ± 0.059
0.499PheCys: 0.499 ± 0.02
3.334PheAsp: 3.334 ± 0.056
3.285PheGlu: 3.285 ± 0.049
1.952PhePhe: 1.952 ± 0.045
3.239PheGly: 3.239 ± 0.064
0.768PheHis: 0.768 ± 0.026
3.106PheIle: 3.106 ± 0.063
2.499PheLys: 2.499 ± 0.044
3.618PheLeu: 3.618 ± 0.058
1.019PheMet: 1.019 ± 0.028
2.406PheAsn: 2.406 ± 0.043
1.426PhePro: 1.426 ± 0.032
1.253PheGln: 1.253 ± 0.036
1.481PheArg: 1.481 ± 0.035
3.731PheSer: 3.731 ± 0.057
2.463PheThr: 2.463 ± 0.049
3.158PheVal: 3.158 ± 0.059
0.491PheTrp: 0.491 ± 0.018
1.504PheTyr: 1.504 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
5.031GlyAla: 5.031 ± 0.066
0.703GlyCys: 0.703 ± 0.026
3.684GlyAsp: 3.684 ± 0.075
4.017GlyGlu: 4.017 ± 0.057
3.528GlyPhe: 3.528 ± 0.055
4.345GlyGly: 4.345 ± 0.083
1.402GlyHis: 1.402 ± 0.034
4.501GlyIle: 4.501 ± 0.068
3.755GlyLys: 3.755 ± 0.059
6.539GlyLeu: 6.539 ± 0.084
1.707GlyMet: 1.707 ± 0.042
2.691GlyAsn: 2.691 ± 0.054
1.504GlyPro: 1.504 ± 0.036
2.492GlyGln: 2.492 ± 0.051
2.652GlyArg: 2.652 ± 0.043
3.867GlySer: 3.867 ± 0.075
3.248GlyThr: 3.248 ± 0.055
4.797GlyVal: 4.797 ± 0.069
0.799GlyTrp: 0.799 ± 0.025
2.29GlyTyr: 2.29 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
1.598HisAla: 1.598 ± 0.038
0.256HisCys: 0.256 ± 0.016
1.099HisAsp: 1.099 ± 0.037
1.186HisGlu: 1.186 ± 0.034
1.074HisPhe: 1.074 ± 0.03
1.296HisGly: 1.296 ± 0.036
0.588HisHis: 0.588 ± 0.025
1.416HisIle: 1.416 ± 0.035
1.159HisLys: 1.159 ± 0.031
2.135HisLeu: 2.135 ± 0.051
0.475HisMet: 0.475 ± 0.018
0.896HisAsn: 0.896 ± 0.029
1.045HisPro: 1.045 ± 0.032
1.039HisGln: 1.039 ± 0.034
0.911HisArg: 0.911 ± 0.027
1.346HisSer: 1.346 ± 0.031
1.053HisThr: 1.053 ± 0.032
1.271HisVal: 1.271 ± 0.035
0.301HisTrp: 0.301 ± 0.017
0.844HisTyr: 0.844 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.258IleAla: 6.258 ± 0.084
0.731IleCys: 0.731 ± 0.025
4.909IleAsp: 4.909 ± 0.069
5.419IleGlu: 5.419 ± 0.065
2.366IlePhe: 2.366 ± 0.052
4.495IleGly: 4.495 ± 0.077
1.175IleHis: 1.175 ± 0.031
3.898IleIle: 3.898 ± 0.059
4.206IleLys: 4.206 ± 0.058
5.229IleLeu: 5.229 ± 0.078
1.284IleMet: 1.284 ± 0.031
3.736IleAsn: 3.736 ± 0.06
2.578IlePro: 2.578 ± 0.046
2.481IleGln: 2.481 ± 0.05
2.539IleArg: 2.539 ± 0.049
5.193IleSer: 5.193 ± 0.069
3.684IleThr: 3.684 ± 0.063
4.321IleVal: 4.321 ± 0.07
0.619IleTrp: 0.619 ± 0.025
1.761IleTyr: 1.761 ± 0.044
0.0IleXaa: 0.0 ± 0.0
Lys
5.192LysAla: 5.192 ± 0.082
0.381LysCys: 0.381 ± 0.016
3.064LysAsp: 3.064 ± 0.053
3.439LysGlu: 3.439 ± 0.057
1.86LysPhe: 1.86 ± 0.041
3.473LysGly: 3.473 ± 0.054
1.603LysHis: 1.603 ± 0.039
3.361LysIle: 3.361 ± 0.056
3.268LysLys: 3.268 ± 0.068
5.829LysLeu: 5.829 ± 0.077
1.385LysMet: 1.385 ± 0.034
2.531LysAsn: 2.531 ± 0.046
2.341LysPro: 2.341 ± 0.044
3.518LysGln: 3.518 ± 0.06
2.956LysArg: 2.956 ± 0.045
3.772LysSer: 3.772 ± 0.065
3.356LysThr: 3.356 ± 0.055
3.902LysVal: 3.902 ± 0.065
0.668LysTrp: 0.668 ± 0.024
1.593LysTyr: 1.593 ± 0.039
0.0LysXaa: 0.0 ± 0.0
Leu
8.996LeuAla: 8.996 ± 0.103
1.026LeuCys: 1.026 ± 0.024
5.797LeuAsp: 5.797 ± 0.068
5.968LeuGlu: 5.968 ± 0.073
4.451LeuPhe: 4.451 ± 0.075
6.428LeuGly: 6.428 ± 0.069
1.882LeuHis: 1.882 ± 0.039
6.417LeuIle: 6.417 ± 0.093
5.697LeuLys: 5.697 ± 0.083
9.692LeuLeu: 9.692 ± 0.118
2.375LeuMet: 2.375 ± 0.042
5.365LeuAsn: 5.365 ± 0.066
4.313LeuPro: 4.313 ± 0.063
3.64LeuGln: 3.64 ± 0.053
4.15LeuArg: 4.15 ± 0.062
8.798LeuSer: 8.798 ± 0.097
5.434LeuThr: 5.434 ± 0.063
6.755LeuVal: 6.755 ± 0.077
0.87LeuTrp: 0.87 ± 0.028
2.63LeuTyr: 2.63 ± 0.049
0.0LeuXaa: 0.0 ± 0.0
Met
2.008MetAla: 2.008 ± 0.044
0.208MetCys: 0.208 ± 0.013
1.123MetAsp: 1.123 ± 0.031
1.02MetGlu: 1.02 ± 0.028
0.983MetPhe: 0.983 ± 0.029
1.582MetGly: 1.582 ± 0.035
0.548MetHis: 0.548 ± 0.02
1.467MetIle: 1.467 ± 0.041
1.38MetLys: 1.38 ± 0.035
2.716MetLeu: 2.716 ± 0.05
0.652MetMet: 0.652 ± 0.022
1.165MetAsn: 1.165 ± 0.031
1.217MetPro: 1.217 ± 0.031
1.301MetGln: 1.301 ± 0.033
1.1MetArg: 1.1 ± 0.024
1.929MetSer: 1.929 ± 0.038
1.465MetThr: 1.465 ± 0.037
1.512MetVal: 1.512 ± 0.036
0.21MetTrp: 0.21 ± 0.014
0.553MetTyr: 0.553 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
4.143AsnAla: 4.143 ± 0.062
0.441AsnCys: 0.441 ± 0.021
2.874AsnAsp: 2.874 ± 0.049
3.267AsnGlu: 3.267 ± 0.045
1.884AsnPhe: 1.884 ± 0.043
3.256AsnGly: 3.256 ± 0.06
0.965AsnHis: 0.965 ± 0.025
3.343AsnIle: 3.343 ± 0.053
2.99AsnLys: 2.99 ± 0.055
4.366AsnLeu: 4.366 ± 0.06
1.126AsnMet: 1.126 ± 0.031
2.465AsnAsn: 2.465 ± 0.05
2.051AsnPro: 2.051 ± 0.043
2.495AsnGln: 2.495 ± 0.045
2.078AsnArg: 2.078 ± 0.04
3.054AsnSer: 3.054 ± 0.051
2.863AsnThr: 2.863 ± 0.051
2.942AsnVal: 2.942 ± 0.049
0.596AsnTrp: 0.596 ± 0.022
1.47AsnTyr: 1.47 ± 0.035
0.0AsnXaa: 0.0 ± 0.0
Pro
2.618ProAla: 2.618 ± 0.047
0.273ProCys: 0.273 ± 0.014
2.36ProAsp: 2.36 ± 0.044
2.753ProGlu: 2.753 ± 0.053
1.778ProPhe: 1.778 ± 0.038
1.92ProGly: 1.92 ± 0.045
0.699ProHis: 0.699 ± 0.024
2.658ProIle: 2.658 ± 0.051
2.05ProLys: 2.05 ± 0.043
3.52ProLeu: 3.52 ± 0.055
0.863ProMet: 0.863 ± 0.026
2.052ProAsn: 2.052 ± 0.041
1.074ProPro: 1.074 ± 0.032
1.434ProGln: 1.434 ± 0.031
1.22ProArg: 1.22 ± 0.035
2.874ProSer: 2.874 ± 0.049
2.008ProThr: 2.008 ± 0.046
2.545ProVal: 2.545 ± 0.043
0.397ProTrp: 0.397 ± 0.018
1.117ProTyr: 1.117 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
4.405GlnAla: 4.405 ± 0.068
0.344GlnCys: 0.344 ± 0.017
2.192GlnAsp: 2.192 ± 0.048
2.435GlnGlu: 2.435 ± 0.046
1.789GlnPhe: 1.789 ± 0.042
2.622GlnGly: 2.622 ± 0.05
1.083GlnHis: 1.083 ± 0.03
2.922GlnIle: 2.922 ± 0.049
2.383GlnLys: 2.383 ± 0.049
4.546GlnLeu: 4.546 ± 0.069
1.071GlnMet: 1.071 ± 0.03
2.047GlnAsn: 2.047 ± 0.04
1.317GlnPro: 1.317 ± 0.033
2.654GlnGln: 2.654 ± 0.05
2.14GlnArg: 2.14 ± 0.041
3.268GlnSer: 3.268 ± 0.057
2.711GlnThr: 2.711 ± 0.051
2.974GlnVal: 2.974 ± 0.045
0.54GlnTrp: 0.54 ± 0.02
1.382GlnTyr: 1.382 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
3.262ArgAla: 3.262 ± 0.047
0.349ArgCys: 0.349 ± 0.017
2.268ArgAsp: 2.268 ± 0.042
2.631ArgGlu: 2.631 ± 0.05
2.328ArgPhe: 2.328 ± 0.048
2.376ArgGly: 2.376 ± 0.05
0.909ArgHis: 0.909 ± 0.03
2.913ArgIle: 2.913 ± 0.052
2.283ArgLys: 2.283 ± 0.052
4.692ArgLeu: 4.692 ± 0.067
1.011ArgMet: 1.011 ± 0.03
1.909ArgAsn: 1.909 ± 0.038
1.482ArgPro: 1.482 ± 0.036
1.914ArgGln: 1.914 ± 0.039
1.96ArgArg: 1.96 ± 0.044
2.51ArgSer: 2.51 ± 0.048
1.993ArgThr: 1.993 ± 0.044
3.009ArgVal: 3.009 ± 0.047
0.518ArgTrp: 0.518 ± 0.021
1.629ArgTyr: 1.629 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
5.999SerAla: 5.999 ± 0.079
0.575SerCys: 0.575 ± 0.022
4.277SerAsp: 4.277 ± 0.067
4.497SerGlu: 4.497 ± 0.071
3.461SerPhe: 3.461 ± 0.062
4.834SerGly: 4.834 ± 0.066
1.403SerHis: 1.403 ± 0.035
5.078SerIle: 5.078 ± 0.07
4.066SerLys: 4.066 ± 0.06
7.799SerLeu: 7.799 ± 0.089
1.81SerMet: 1.81 ± 0.038
3.373SerAsn: 3.373 ± 0.058
2.47SerPro: 2.47 ± 0.042
3.139SerGln: 3.139 ± 0.053
2.764SerArg: 2.764 ± 0.052
5.153SerSer: 5.153 ± 0.078
3.813SerThr: 3.813 ± 0.057
5.024SerVal: 5.024 ± 0.07
0.732SerTrp: 0.732 ± 0.023
2.11SerTyr: 2.11 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
4.081ThrAla: 4.081 ± 0.062
0.478ThrCys: 0.478 ± 0.018
3.029ThrAsp: 3.029 ± 0.059
3.102ThrGlu: 3.102 ± 0.052
2.302ThrPhe: 2.302 ± 0.049
3.797ThrGly: 3.797 ± 0.061
1.283ThrHis: 1.283 ± 0.033
3.59ThrIle: 3.59 ± 0.059
2.809ThrLys: 2.809 ± 0.051
5.954ThrLeu: 5.954 ± 0.073
1.087ThrMet: 1.087 ± 0.028
2.499ThrAsn: 2.499 ± 0.045
2.524ThrPro: 2.524 ± 0.045
2.674ThrGln: 2.674 ± 0.05
2.21ThrArg: 2.21 ± 0.036
3.82ThrSer: 3.82 ± 0.058
2.752ThrThr: 2.752 ± 0.053
3.483ThrVal: 3.483 ± 0.062
0.635ThrTrp: 0.635 ± 0.026
1.584ThrTyr: 1.584 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
5.465ValAla: 5.465 ± 0.089
0.734ValCys: 0.734 ± 0.026
4.206ValAsp: 4.206 ± 0.058
4.329ValGlu: 4.329 ± 0.079
3.273ValPhe: 3.273 ± 0.053
4.225ValGly: 4.225 ± 0.065
1.228ValHis: 1.228 ± 0.034
4.636ValIle: 4.636 ± 0.061
3.747ValLys: 3.747 ± 0.047
6.573ValLeu: 6.573 ± 0.092
1.685ValMet: 1.685 ± 0.037
3.48ValAsn: 3.48 ± 0.066
2.379ValPro: 2.379 ± 0.044
2.115ValGln: 2.115 ± 0.04
2.623ValArg: 2.623 ± 0.054
5.423ValSer: 5.423 ± 0.062
3.761ValThr: 3.761 ± 0.064
4.652ValVal: 4.652 ± 0.067
0.67ValTrp: 0.67 ± 0.022
1.888ValTyr: 1.888 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
0.704TrpAla: 0.704 ± 0.024
0.119TrpCys: 0.119 ± 0.011
0.6TrpAsp: 0.6 ± 0.026
0.546TrpGlu: 0.546 ± 0.02
0.594TrpPhe: 0.594 ± 0.02
0.704TrpGly: 0.704 ± 0.025
0.3TrpHis: 0.3 ± 0.017
0.62TrpIle: 0.62 ± 0.023
0.491TrpLys: 0.491 ± 0.02
1.45TrpLeu: 1.45 ± 0.037
0.338TrpMet: 0.338 ± 0.017
0.435TrpAsn: 0.435 ± 0.02
0.387TrpPro: 0.387 ± 0.02
0.78TrpGln: 0.78 ± 0.027
0.636TrpArg: 0.636 ± 0.025
0.692TrpSer: 0.692 ± 0.026
0.505TrpThr: 0.505 ± 0.019
0.758TrpVal: 0.758 ± 0.021
0.166TrpTrp: 0.166 ± 0.013
0.379TrpTyr: 0.379 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.373TyrAla: 2.373 ± 0.042
0.334TyrCys: 0.334 ± 0.015
1.802TyrAsp: 1.802 ± 0.043
1.884TyrGlu: 1.884 ± 0.036
1.571TyrPhe: 1.571 ± 0.037
1.939TyrGly: 1.939 ± 0.04
0.715TyrHis: 0.715 ± 0.025
1.706TyrIle: 1.706 ± 0.04
1.621TyrLys: 1.621 ± 0.037
3.311TyrLeu: 3.311 ± 0.059
0.654TyrMet: 0.654 ± 0.022
1.204TyrAsn: 1.204 ± 0.033
1.234TyrPro: 1.234 ± 0.03
1.798TyrGln: 1.798 ± 0.035
1.619TyrArg: 1.619 ± 0.04
2.274TyrSer: 2.274 ± 0.044
1.535TyrThr: 1.535 ± 0.034
1.847TyrVal: 1.847 ± 0.038
0.425TyrTrp: 0.425 ± 0.022
1.025TyrTyr: 1.025 ± 0.032
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3631 proteins (1250739 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski