Amino acid dipepetide frequency for Enhydrobacter aerosaccus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.145AlaAla: 17.145 ± 0.129
1.136AlaCys: 1.136 ± 0.025
6.666AlaAsp: 6.666 ± 0.057
7.028AlaGlu: 7.028 ± 0.066
4.567AlaPhe: 4.567 ± 0.052
10.434AlaGly: 10.434 ± 0.077
2.275AlaHis: 2.275 ± 0.038
6.322AlaIle: 6.322 ± 0.059
4.072AlaLys: 4.072 ± 0.058
13.535AlaLeu: 13.535 ± 0.098
3.412AlaMet: 3.412 ± 0.039
2.885AlaAsn: 2.885 ± 0.044
5.792AlaPro: 5.792 ± 0.06
3.998AlaGln: 3.998 ± 0.051
8.688AlaArg: 8.688 ± 0.072
6.192AlaSer: 6.192 ± 0.061
6.275AlaThr: 6.275 ± 0.063
9.032AlaVal: 9.032 ± 0.069
1.675AlaTrp: 1.675 ± 0.031
2.644AlaTyr: 2.644 ± 0.037
0.0AlaXaa: 0.0 ± 0.0
Cys
0.988CysAla: 0.988 ± 0.024
0.125CysCys: 0.125 ± 0.009
0.537CysAsp: 0.537 ± 0.017
0.418CysGlu: 0.418 ± 0.014
0.369CysPhe: 0.369 ± 0.016
0.995CysGly: 0.995 ± 0.022
0.255CysHis: 0.255 ± 0.015
0.401CysIle: 0.401 ± 0.014
0.22CysLys: 0.22 ± 0.01
0.852CysLeu: 0.852 ± 0.018
0.155CysMet: 0.155 ± 0.009
0.221CysAsn: 0.221 ± 0.011
0.466CysPro: 0.466 ± 0.015
0.243CysGln: 0.243 ± 0.01
0.696CysArg: 0.696 ± 0.017
0.51CysSer: 0.51 ± 0.017
0.46CysThr: 0.46 ± 0.017
0.628CysVal: 0.628 ± 0.018
0.137CysTrp: 0.137 ± 0.007
0.207CysTyr: 0.207 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
6.055AspAla: 6.055 ± 0.062
0.479AspCys: 0.479 ± 0.017
2.799AspAsp: 2.799 ± 0.045
3.015AspGlu: 3.015 ± 0.04
2.118AspPhe: 2.118 ± 0.032
4.951AspGly: 4.951 ± 0.055
1.252AspHis: 1.252 ± 0.026
2.819AspIle: 2.819 ± 0.034
1.913AspLys: 1.913 ± 0.042
5.731AspLeu: 5.731 ± 0.058
1.287AspMet: 1.287 ± 0.028
1.216AspAsn: 1.216 ± 0.027
3.53AspPro: 3.53 ± 0.042
1.611AspGln: 1.611 ± 0.029
4.587AspArg: 4.587 ± 0.054
2.219AspSer: 2.219 ± 0.033
2.455AspThr: 2.455 ± 0.037
4.128AspVal: 4.128 ± 0.045
1.04AspTrp: 1.04 ± 0.022
1.471AspTyr: 1.471 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
7.064GluAla: 7.064 ± 0.07
0.342GluCys: 0.342 ± 0.014
2.161GluAsp: 2.161 ± 0.034
2.852GluGlu: 2.852 ± 0.049
1.644GluPhe: 1.644 ± 0.024
3.989GluGly: 3.989 ± 0.052
1.183GluHis: 1.183 ± 0.024
3.215GluIle: 3.215 ± 0.04
2.149GluLys: 2.149 ± 0.031
5.179GluLeu: 5.179 ± 0.049
1.433GluMet: 1.433 ± 0.025
1.223GluAsn: 1.223 ± 0.032
2.554GluPro: 2.554 ± 0.036
2.105GluGln: 2.105 ± 0.033
4.899GluArg: 4.899 ± 0.063
2.269GluSer: 2.269 ± 0.033
2.938GluThr: 2.938 ± 0.039
3.754GluVal: 3.754 ± 0.048
0.712GluTrp: 0.712 ± 0.019
1.007GluTyr: 1.007 ± 0.021
0.0GluXaa: 0.0 ± 0.0
Phe
4.56PheAla: 4.56 ± 0.05
0.445PheCys: 0.445 ± 0.015
2.589PheAsp: 2.589 ± 0.037
2.016PheGlu: 2.016 ± 0.032
1.41PhePhe: 1.41 ± 0.027
3.805PheGly: 3.805 ± 0.042
0.789PheHis: 0.789 ± 0.022
1.616PheIle: 1.616 ± 0.027
1.098PheLys: 1.098 ± 0.023
3.413PheLeu: 3.413 ± 0.05
0.757PheMet: 0.757 ± 0.018
1.071PheAsn: 1.071 ± 0.025
1.66PhePro: 1.66 ± 0.031
0.988PheGln: 0.988 ± 0.024
2.212PheArg: 2.212 ± 0.036
2.055PheSer: 2.055 ± 0.035
2.065PheThr: 2.065 ± 0.034
3.027PheVal: 3.027 ± 0.038
0.576PheTrp: 0.576 ± 0.017
0.897PheTyr: 0.897 ± 0.02
0.0PheXaa: 0.0 ± 0.0
Gly
9.233GlyAla: 9.233 ± 0.068
0.872GlyCys: 0.872 ± 0.024
4.121GlyAsp: 4.121 ± 0.051
4.208GlyGlu: 4.208 ± 0.053
3.624GlyPhe: 3.624 ± 0.042
7.58GlyGly: 7.58 ± 0.11
1.947GlyHis: 1.947 ± 0.031
4.508GlyIle: 4.508 ± 0.056
3.423GlyLys: 3.423 ± 0.045
9.342GlyLeu: 9.342 ± 0.084
2.31GlyMet: 2.31 ± 0.032
2.163GlyAsn: 2.163 ± 0.036
3.929GlyPro: 3.929 ± 0.046
2.976GlyGln: 2.976 ± 0.042
6.478GlyArg: 6.478 ± 0.063
4.67GlySer: 4.67 ± 0.054
4.789GlyThr: 4.789 ± 0.069
6.312GlyVal: 6.312 ± 0.057
1.568GlyTrp: 1.568 ± 0.031
2.397GlyTyr: 2.397 ± 0.038
0.0GlyXaa: 0.0 ± 0.0
His
2.291HisAla: 2.291 ± 0.034
0.253HisCys: 0.253 ± 0.012
1.225HisAsp: 1.225 ± 0.029
1.014HisGlu: 1.014 ± 0.026
0.866HisPhe: 0.866 ± 0.022
1.98HisGly: 1.98 ± 0.037
0.6HisHis: 0.6 ± 0.02
0.967HisIle: 0.967 ± 0.021
0.561HisLys: 0.561 ± 0.018
2.106HisLeu: 2.106 ± 0.034
0.48HisMet: 0.48 ± 0.015
0.511HisAsn: 0.511 ± 0.015
1.454HisPro: 1.454 ± 0.025
0.573HisGln: 0.573 ± 0.019
1.579HisArg: 1.579 ± 0.033
0.898HisSer: 0.898 ± 0.022
0.891HisThr: 0.891 ± 0.022
1.572HisVal: 1.572 ± 0.029
0.404HisTrp: 0.404 ± 0.013
0.598HisTyr: 0.598 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
7.073IleAla: 7.073 ± 0.068
0.529IleCys: 0.529 ± 0.019
3.64IleAsp: 3.64 ± 0.043
3.371IleGlu: 3.371 ± 0.034
1.554IlePhe: 1.554 ± 0.029
5.012IleGly: 5.012 ± 0.051
0.862IleHis: 0.862 ± 0.018
1.736IleIle: 1.736 ± 0.034
1.48IleLys: 1.48 ± 0.028
4.138IleLeu: 4.138 ± 0.054
0.875IleMet: 0.875 ± 0.021
1.313IleAsn: 1.313 ± 0.029
2.285IlePro: 2.285 ± 0.034
1.245IleGln: 1.245 ± 0.026
2.95IleArg: 2.95 ± 0.04
2.501IleSer: 2.501 ± 0.03
2.379IleThr: 2.379 ± 0.035
4.83IleVal: 4.83 ± 0.057
0.607IleTrp: 0.607 ± 0.017
1.125IleTyr: 1.125 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
4.382LysAla: 4.382 ± 0.056
0.172LysCys: 0.172 ± 0.009
1.791LysAsp: 1.791 ± 0.034
1.749LysGlu: 1.749 ± 0.033
0.965LysPhe: 0.965 ± 0.022
2.744LysGly: 2.744 ± 0.043
0.624LysHis: 0.624 ± 0.018
1.775LysIle: 1.775 ± 0.035
1.407LysLys: 1.407 ± 0.034
3.493LysLeu: 3.493 ± 0.047
0.786LysMet: 0.786 ± 0.02
0.75LysAsn: 0.75 ± 0.016
2.29LysPro: 2.29 ± 0.036
0.99LysGln: 0.99 ± 0.019
2.515LysArg: 2.515 ± 0.034
1.691LysSer: 1.691 ± 0.032
1.819LysThr: 1.819 ± 0.03
2.714LysVal: 2.714 ± 0.04
0.444LysTrp: 0.444 ± 0.017
0.702LysTyr: 0.702 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
14.003LeuAla: 14.003 ± 0.103
0.977LeuCys: 0.977 ± 0.023
5.96LeuAsp: 5.96 ± 0.058
5.091LeuGlu: 5.091 ± 0.05
3.682LeuPhe: 3.682 ± 0.048
8.899LeuGly: 8.899 ± 0.077
1.984LeuHis: 1.984 ± 0.032
4.707LeuIle: 4.707 ± 0.053
3.714LeuLys: 3.714 ± 0.048
10.261LeuLeu: 10.261 ± 0.104
2.437LeuMet: 2.437 ± 0.033
2.37LeuAsn: 2.37 ± 0.04
5.975LeuPro: 5.975 ± 0.057
3.068LeuGln: 3.068 ± 0.042
7.153LeuArg: 7.153 ± 0.066
6.183LeuSer: 6.183 ± 0.059
5.224LeuThr: 5.224 ± 0.049
7.947LeuVal: 7.947 ± 0.07
1.318LeuTrp: 1.318 ± 0.027
2.215LeuTyr: 2.215 ± 0.032
0.0LeuXaa: 0.0 ± 0.0
Met
3.298MetAla: 3.298 ± 0.04
0.178MetCys: 0.178 ± 0.01
1.07MetAsp: 1.07 ± 0.023
1.019MetGlu: 1.019 ± 0.022
0.734MetPhe: 0.734 ± 0.018
1.859MetGly: 1.859 ± 0.028
0.419MetHis: 0.419 ± 0.015
1.297MetIle: 1.297 ± 0.02
1.009MetLys: 1.009 ± 0.022
2.444MetLeu: 2.444 ± 0.027
0.654MetMet: 0.654 ± 0.018
0.685MetAsn: 0.685 ± 0.019
1.577MetPro: 1.577 ± 0.03
0.787MetGln: 0.787 ± 0.018
1.795MetArg: 1.795 ± 0.029
1.629MetSer: 1.629 ± 0.029
1.811MetThr: 1.811 ± 0.024
1.754MetVal: 1.754 ± 0.03
0.244MetTrp: 0.244 ± 0.01
0.341MetTyr: 0.341 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.962AsnAla: 2.962 ± 0.041
0.259AsnCys: 0.259 ± 0.012
1.318AsnAsp: 1.318 ± 0.028
1.161AsnGlu: 1.161 ± 0.026
0.926AsnPhe: 0.926 ± 0.02
2.352AsnGly: 2.352 ± 0.043
0.502AsnHis: 0.502 ± 0.016
1.2AsnIle: 1.2 ± 0.025
0.751AsnLys: 0.751 ± 0.019
2.458AsnLeu: 2.458 ± 0.034
0.576AsnMet: 0.576 ± 0.017
0.713AsnAsn: 0.713 ± 0.02
1.776AsnPro: 1.776 ± 0.031
0.75AsnGln: 0.75 ± 0.018
1.755AsnArg: 1.755 ± 0.031
1.086AsnSer: 1.086 ± 0.029
1.226AsnThr: 1.226 ± 0.026
2.036AsnVal: 2.036 ± 0.036
0.425AsnTrp: 0.425 ± 0.015
0.672AsnTyr: 0.672 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
6.557ProAla: 6.557 ± 0.068
0.337ProCys: 0.337 ± 0.012
3.647ProAsp: 3.647 ± 0.041
3.424ProGlu: 3.424 ± 0.043
2.046ProPhe: 2.046 ± 0.033
4.727ProGly: 4.727 ± 0.043
1.166ProHis: 1.166 ± 0.028
2.48ProIle: 2.48 ± 0.036
1.866ProLys: 1.866 ± 0.035
5.2ProLeu: 5.2 ± 0.052
1.364ProMet: 1.364 ± 0.028
1.428ProAsn: 1.428 ± 0.026
3.248ProPro: 3.248 ± 0.06
1.782ProGln: 1.782 ± 0.027
3.273ProArg: 3.273 ± 0.042
3.095ProSer: 3.095 ± 0.039
2.975ProThr: 2.975 ± 0.041
4.261ProVal: 4.261 ± 0.041
0.864ProTrp: 0.864 ± 0.022
1.342ProTyr: 1.342 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
4.315GlnAla: 4.315 ± 0.057
0.196GlnCys: 0.196 ± 0.01
1.301GlnAsp: 1.301 ± 0.022
1.413GlnGlu: 1.413 ± 0.03
1.083GlnPhe: 1.083 ± 0.023
2.441GlnGly: 2.441 ± 0.034
0.685GlnHis: 0.685 ± 0.019
1.669GlnIle: 1.669 ± 0.028
1.123GlnLys: 1.123 ± 0.029
2.991GlnLeu: 2.991 ± 0.039
0.839GlnMet: 0.839 ± 0.02
0.772GlnAsn: 0.772 ± 0.018
1.942GlnPro: 1.942 ± 0.032
1.333GlnGln: 1.333 ± 0.027
2.685GlnArg: 2.685 ± 0.04
1.737GlnSer: 1.737 ± 0.03
1.69GlnThr: 1.69 ± 0.027
2.272GlnVal: 2.272 ± 0.04
0.456GlnTrp: 0.456 ± 0.016
0.651GlnTyr: 0.651 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
7.974ArgAla: 7.974 ± 0.073
0.598ArgCys: 0.598 ± 0.016
4.056ArgAsp: 4.056 ± 0.041
3.916ArgGlu: 3.916 ± 0.054
2.926ArgPhe: 2.926 ± 0.04
4.918ArgGly: 4.918 ± 0.056
1.884ArgHis: 1.884 ± 0.03
3.807ArgIle: 3.807 ± 0.048
2.375ArgLys: 2.375 ± 0.039
8.606ArgLeu: 8.606 ± 0.088
1.81ArgMet: 1.81 ± 0.025
1.807ArgAsn: 1.807 ± 0.032
3.956ArgPro: 3.956 ± 0.047
2.73ArgGln: 2.73 ± 0.042
6.548ArgArg: 6.548 ± 0.07
3.746ArgSer: 3.746 ± 0.036
3.712ArgThr: 3.712 ± 0.039
4.82ArgVal: 4.82 ± 0.052
1.128ArgTrp: 1.128 ± 0.025
1.813ArgTyr: 1.813 ± 0.026
0.0ArgXaa: 0.0 ± 0.0
Ser
6.009SerAla: 6.009 ± 0.059
0.468SerCys: 0.468 ± 0.015
2.749SerAsp: 2.749 ± 0.033
2.386SerGlu: 2.386 ± 0.04
2.283SerPhe: 2.283 ± 0.036
5.336SerGly: 5.336 ± 0.07
1.074SerHis: 1.074 ± 0.023
2.677SerIle: 2.677 ± 0.041
1.497SerLys: 1.497 ± 0.028
5.485SerLeu: 5.485 ± 0.054
1.357SerMet: 1.357 ± 0.021
1.32SerAsn: 1.32 ± 0.026
2.932SerPro: 2.932 ± 0.041
1.494SerGln: 1.494 ± 0.028
3.539SerArg: 3.539 ± 0.045
2.909SerSer: 2.909 ± 0.051
2.833SerThr: 2.833 ± 0.04
3.894SerVal: 3.894 ± 0.048
0.829SerTrp: 0.829 ± 0.021
1.312SerTyr: 1.312 ± 0.027
0.0SerXaa: 0.0 ± 0.0
Thr
6.215ThrAla: 6.215 ± 0.058
0.415ThrCys: 0.415 ± 0.013
2.661ThrAsp: 2.661 ± 0.037
2.372ThrGlu: 2.372 ± 0.04
2.044ThrPhe: 2.044 ± 0.034
4.918ThrGly: 4.918 ± 0.063
1.012ThrHis: 1.012 ± 0.024
2.912ThrIle: 2.912 ± 0.04
1.443ThrLys: 1.443 ± 0.029
5.989ThrLeu: 5.989 ± 0.063
1.253ThrMet: 1.253 ± 0.027
1.332ThrAsn: 1.332 ± 0.028
3.369ThrPro: 3.369 ± 0.042
1.388ThrGln: 1.388 ± 0.026
3.199ThrArg: 3.199 ± 0.037
2.721ThrSer: 2.721 ± 0.039
2.95ThrThr: 2.95 ± 0.049
4.573ThrVal: 4.573 ± 0.053
0.741ThrTrp: 0.741 ± 0.019
1.329ThrTyr: 1.329 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
9.677ValAla: 9.677 ± 0.079
0.703ValCys: 0.703 ± 0.019
4.249ValAsp: 4.249 ± 0.049
4.394ValGlu: 4.394 ± 0.048
2.698ValPhe: 2.698 ± 0.044
6.258ValGly: 6.258 ± 0.064
1.441ValHis: 1.441 ± 0.028
3.777ValIle: 3.777 ± 0.053
2.462ValLys: 2.462 ± 0.044
7.771ValLeu: 7.771 ± 0.069
1.929ValMet: 1.929 ± 0.031
1.996ValAsn: 1.996 ± 0.033
4.359ValPro: 4.359 ± 0.051
2.162ValGln: 2.162 ± 0.037
5.299ValArg: 5.299 ± 0.051
4.176ValSer: 4.176 ± 0.038
4.361ValThr: 4.361 ± 0.049
6.504ValVal: 6.504 ± 0.065
0.967ValTrp: 0.967 ± 0.024
1.507ValTyr: 1.507 ± 0.022
0.0ValXaa: 0.0 ± 0.0
Trp
1.326TrpAla: 1.326 ± 0.026
0.131TrpCys: 0.131 ± 0.008
0.705TrpAsp: 0.705 ± 0.018
0.584TrpGlu: 0.584 ± 0.018
0.548TrpPhe: 0.548 ± 0.017
1.027TrpGly: 1.027 ± 0.024
0.393TrpHis: 0.393 ± 0.013
0.767TrpIle: 0.767 ± 0.019
0.542TrpLys: 0.542 ± 0.017
1.841TrpLeu: 1.841 ± 0.034
0.406TrpMet: 0.406 ± 0.014
0.518TrpAsn: 0.518 ± 0.016
0.815TrpPro: 0.815 ± 0.021
0.655TrpGln: 0.655 ± 0.019
1.39TrpArg: 1.39 ± 0.027
0.896TrpSer: 0.896 ± 0.023
0.83TrpThr: 0.83 ± 0.022
0.868TrpVal: 0.868 ± 0.019
0.254TrpTrp: 0.254 ± 0.012
0.315TrpTyr: 0.315 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.65TyrAla: 2.65 ± 0.039
0.268TyrCys: 0.268 ± 0.012
1.43TyrAsp: 1.43 ± 0.028
1.241TyrGlu: 1.241 ± 0.025
0.962TyrPhe: 0.962 ± 0.022
2.225TyrGly: 2.225 ± 0.038
0.478TyrHis: 0.478 ± 0.016
0.854TyrIle: 0.854 ± 0.023
0.697TyrLys: 0.697 ± 0.023
2.333TyrLeu: 2.333 ± 0.033
0.483TyrMet: 0.483 ± 0.017
0.609TyrAsn: 0.609 ± 0.017
1.174TyrPro: 1.174 ± 0.025
0.716TyrGln: 0.716 ± 0.021
1.889TyrArg: 1.889 ± 0.029
1.2TyrSer: 1.2 ± 0.027
1.15TyrThr: 1.15 ± 0.029
1.766TyrVal: 1.766 ± 0.029
0.425TyrTrp: 0.425 ± 0.015
0.621TyrTyr: 0.621 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6507 proteins (2051812 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski