Amino acid dipepetide frequency for Anaeromassilibacillus sp. An250

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.142AlaAla: 10.142 ± 0.134
1.407AlaCys: 1.407 ± 0.047
5.285AlaAsp: 5.285 ± 0.071
6.602AlaGlu: 6.602 ± 0.09
3.545AlaPhe: 3.545 ± 0.065
6.712AlaGly: 6.712 ± 0.079
1.482AlaHis: 1.482 ± 0.042
5.226AlaIle: 5.226 ± 0.069
4.663AlaLys: 4.663 ± 0.075
8.868AlaLeu: 8.868 ± 0.109
2.586AlaMet: 2.586 ± 0.059
2.94AlaAsn: 2.94 ± 0.068
3.119AlaPro: 3.119 ± 0.057
3.623AlaGln: 3.623 ± 0.064
4.044AlaArg: 4.044 ± 0.078
4.588AlaSer: 4.588 ± 0.079
3.984AlaThr: 3.984 ± 0.094
7.194AlaVal: 7.194 ± 0.091
0.871AlaTrp: 0.871 ± 0.032
3.106AlaTyr: 3.106 ± 0.067
0.0AlaXaa: 0.0 ± 0.0
Cys
1.378CysAla: 1.378 ± 0.037
0.328CysCys: 0.328 ± 0.02
0.839CysAsp: 0.839 ± 0.029
0.905CysGlu: 0.905 ± 0.029
0.653CysPhe: 0.653 ± 0.023
1.641CysGly: 1.641 ± 0.047
0.282CysHis: 0.282 ± 0.018
0.946CysIle: 0.946 ± 0.036
0.719CysLys: 0.719 ± 0.029
1.206CysLeu: 1.206 ± 0.038
0.434CysMet: 0.434 ± 0.021
0.527CysAsn: 0.527 ± 0.022
0.788CysPro: 0.788 ± 0.035
0.438CysGln: 0.438 ± 0.022
0.888CysArg: 0.888 ± 0.03
0.953CysSer: 0.953 ± 0.031
0.852CysThr: 0.852 ± 0.032
1.055CysVal: 1.055 ± 0.035
0.124CysTrp: 0.124 ± 0.011
0.581CysTyr: 0.581 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
5.381AspAla: 5.381 ± 0.082
0.878AspCys: 0.878 ± 0.031
3.008AspAsp: 3.008 ± 0.074
4.32AspGlu: 4.32 ± 0.078
2.674AspPhe: 2.674 ± 0.053
5.125AspGly: 5.125 ± 0.095
0.968AspHis: 0.968 ± 0.03
3.569AspIle: 3.569 ± 0.07
2.617AspLys: 2.617 ± 0.053
5.036AspLeu: 5.036 ± 0.075
1.621AspMet: 1.621 ± 0.04
1.883AspAsn: 1.883 ± 0.056
2.43AspPro: 2.43 ± 0.05
1.507AspGln: 1.507 ± 0.04
2.891AspArg: 2.891 ± 0.056
2.868AspSer: 2.868 ± 0.055
3.178AspThr: 3.178 ± 0.071
4.004AspVal: 4.004 ± 0.076
0.71AspTrp: 0.71 ± 0.029
2.485AspTyr: 2.485 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
6.634GluAla: 6.634 ± 0.107
0.775GluCys: 0.775 ± 0.029
4.157GluAsp: 4.157 ± 0.078
6.322GluGlu: 6.322 ± 0.102
2.178GluPhe: 2.178 ± 0.051
4.612GluGly: 4.612 ± 0.075
1.368GluHis: 1.368 ± 0.04
4.653GluIle: 4.653 ± 0.076
4.861GluLys: 4.861 ± 0.076
6.548GluLeu: 6.548 ± 0.082
2.112GluMet: 2.112 ± 0.049
3.456GluAsn: 3.456 ± 0.059
2.564GluPro: 2.564 ± 0.058
3.596GluGln: 3.596 ± 0.062
4.009GluArg: 4.009 ± 0.073
3.338GluSer: 3.338 ± 0.058
3.797GluThr: 3.797 ± 0.075
4.218GluVal: 4.218 ± 0.069
0.765GluTrp: 0.765 ± 0.029
2.609GluTyr: 2.609 ± 0.062
0.0GluXaa: 0.0 ± 0.0
Phe
3.301PheAla: 3.301 ± 0.058
0.796PheCys: 0.796 ± 0.029
2.49PheAsp: 2.49 ± 0.048
2.405PheGlu: 2.405 ± 0.05
1.738PhePhe: 1.738 ± 0.051
2.908PheGly: 2.908 ± 0.063
0.853PheHis: 0.853 ± 0.028
2.154PheIle: 2.154 ± 0.056
1.575PheLys: 1.575 ± 0.042
3.871PheLeu: 3.871 ± 0.083
0.88PheMet: 0.88 ± 0.03
1.244PheAsn: 1.244 ± 0.042
1.486PhePro: 1.486 ± 0.034
1.503PheGln: 1.503 ± 0.039
1.802PheArg: 1.802 ± 0.045
2.93PheSer: 2.93 ± 0.064
2.534PheThr: 2.534 ± 0.058
2.717PheVal: 2.717 ± 0.051
0.438PheTrp: 0.438 ± 0.022
1.559PheTyr: 1.559 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
5.785GlyAla: 5.785 ± 0.088
1.248GlyCys: 1.248 ± 0.04
4.0GlyAsp: 4.0 ± 0.071
5.197GlyGlu: 5.197 ± 0.079
2.947GlyPhe: 2.947 ± 0.057
5.51GlyGly: 5.51 ± 0.092
1.249GlyHis: 1.249 ± 0.037
5.064GlyIle: 5.064 ± 0.085
4.817GlyLys: 4.817 ± 0.079
6.251GlyLeu: 6.251 ± 0.093
2.331GlyMet: 2.331 ± 0.05
3.083GlyAsn: 3.083 ± 0.06
1.721GlyPro: 1.721 ± 0.067
2.516GlyGln: 2.516 ± 0.051
3.669GlyArg: 3.669 ± 0.067
4.214GlySer: 4.214 ± 0.077
4.561GlyThr: 4.561 ± 0.088
5.276GlyVal: 5.276 ± 0.075
0.9GlyTrp: 0.9 ± 0.032
3.01GlyTyr: 3.01 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
1.487HisAla: 1.487 ± 0.036
0.408HisCys: 0.408 ± 0.019
0.894HisAsp: 0.894 ± 0.028
0.979HisGlu: 0.979 ± 0.033
0.923HisPhe: 0.923 ± 0.031
1.46HisGly: 1.46 ± 0.04
0.382HisHis: 0.382 ± 0.019
1.243HisIle: 1.243 ± 0.042
0.794HisLys: 0.794 ± 0.028
1.673HisLeu: 1.673 ± 0.044
0.482HisMet: 0.482 ± 0.022
0.727HisAsn: 0.727 ± 0.029
1.08HisPro: 1.08 ± 0.033
0.551HisGln: 0.551 ± 0.023
1.072HisArg: 1.072 ± 0.033
1.026HisSer: 1.026 ± 0.032
1.065HisThr: 1.065 ± 0.035
1.13HisVal: 1.13 ± 0.036
0.207HisTrp: 0.207 ± 0.015
0.761HisTyr: 0.761 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
5.586IleAla: 5.586 ± 0.072
1.071IleCys: 1.071 ± 0.036
3.772IleAsp: 3.772 ± 0.064
3.884IleGlu: 3.884 ± 0.068
2.19IlePhe: 2.19 ± 0.053
4.338IleGly: 4.338 ± 0.079
1.184IleHis: 1.184 ± 0.033
3.146IleIle: 3.146 ± 0.067
2.565IleLys: 2.565 ± 0.061
5.93IleLeu: 5.93 ± 0.111
1.314IleMet: 1.314 ± 0.04
2.12IleAsn: 2.12 ± 0.054
3.035IlePro: 3.035 ± 0.055
2.319IleGln: 2.319 ± 0.052
3.404IleArg: 3.404 ± 0.055
3.983IleSer: 3.983 ± 0.068
3.434IleThr: 3.434 ± 0.062
4.143IleVal: 4.143 ± 0.066
0.519IleTrp: 0.519 ± 0.022
2.07IleTyr: 2.07 ± 0.049
0.0IleXaa: 0.0 ± 0.0
Lys
4.799LysAla: 4.799 ± 0.065
0.594LysCys: 0.594 ± 0.025
2.868LysAsp: 2.868 ± 0.053
4.347LysGlu: 4.347 ± 0.075
1.404LysPhe: 1.404 ± 0.046
3.465LysGly: 3.465 ± 0.061
0.926LysHis: 0.926 ± 0.034
3.404LysIle: 3.404 ± 0.059
3.897LysLys: 3.897 ± 0.083
4.537LysLeu: 4.537 ± 0.075
1.545LysMet: 1.545 ± 0.039
2.344LysAsn: 2.344 ± 0.05
2.139LysPro: 2.139 ± 0.05
2.174LysGln: 2.174 ± 0.056
3.225LysArg: 3.225 ± 0.063
2.812LysSer: 2.812 ± 0.046
3.153LysThr: 3.153 ± 0.056
3.218LysVal: 3.218 ± 0.066
0.555LysTrp: 0.555 ± 0.024
1.95LysTyr: 1.95 ± 0.05
0.0LysXaa: 0.0 ± 0.0
Leu
8.039LeuAla: 8.039 ± 0.121
1.836LeuCys: 1.836 ± 0.047
5.472LeuAsp: 5.472 ± 0.067
6.434LeuGlu: 6.434 ± 0.084
3.974LeuPhe: 3.974 ± 0.072
5.82LeuGly: 5.82 ± 0.089
1.854LeuHis: 1.854 ± 0.049
4.855LeuIle: 4.855 ± 0.09
4.781LeuLys: 4.781 ± 0.071
9.8LeuLeu: 9.8 ± 0.152
2.254LeuMet: 2.254 ± 0.048
3.46LeuAsn: 3.46 ± 0.066
4.447LeuPro: 4.447 ± 0.069
3.37LeuGln: 3.37 ± 0.06
5.194LeuArg: 5.194 ± 0.089
6.565LeuSer: 6.565 ± 0.079
5.352LeuThr: 5.352 ± 0.076
5.888LeuVal: 5.888 ± 0.084
0.892LeuTrp: 0.892 ± 0.032
3.205LeuTyr: 3.205 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
2.513MetAla: 2.513 ± 0.051
0.304MetCys: 0.304 ± 0.018
1.716MetAsp: 1.716 ± 0.041
2.342MetGlu: 2.342 ± 0.049
0.852MetPhe: 0.852 ± 0.032
1.959MetGly: 1.959 ± 0.045
0.451MetHis: 0.451 ± 0.018
1.373MetIle: 1.373 ± 0.039
1.859MetLys: 1.859 ± 0.049
2.637MetLeu: 2.637 ± 0.057
0.789MetMet: 0.789 ± 0.034
1.16MetAsn: 1.16 ± 0.031
1.178MetPro: 1.178 ± 0.041
1.123MetGln: 1.123 ± 0.036
1.452MetArg: 1.452 ± 0.04
1.451MetSer: 1.451 ± 0.036
1.556MetThr: 1.556 ± 0.042
1.685MetVal: 1.685 ± 0.047
0.194MetTrp: 0.194 ± 0.014
0.659MetTyr: 0.659 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
3.508AsnAla: 3.508 ± 0.067
0.58AsnCys: 0.58 ± 0.025
1.895AsnAsp: 1.895 ± 0.052
2.232AsnGlu: 2.232 ± 0.05
1.402AsnPhe: 1.402 ± 0.038
3.606AsnGly: 3.606 ± 0.081
0.754AsnHis: 0.754 ± 0.03
2.443AsnIle: 2.443 ± 0.053
1.644AsnLys: 1.644 ± 0.041
3.663AsnLeu: 3.663 ± 0.068
0.973AsnMet: 0.973 ± 0.028
1.422AsnAsn: 1.422 ± 0.04
2.101AsnPro: 2.101 ± 0.052
1.325AsnGln: 1.325 ± 0.041
2.181AsnArg: 2.181 ± 0.055
2.038AsnSer: 2.038 ± 0.051
2.188AsnThr: 2.188 ± 0.057
2.496AsnVal: 2.496 ± 0.056
0.432AsnTrp: 0.432 ± 0.021
1.426AsnTyr: 1.426 ± 0.045
0.0AsnXaa: 0.0 ± 0.0
Pro
3.751ProAla: 3.751 ± 0.073
0.543ProCys: 0.543 ± 0.022
2.847ProAsp: 2.847 ± 0.058
4.082ProGlu: 4.082 ± 0.067
1.758ProPhe: 1.758 ± 0.044
2.849ProGly: 2.849 ± 0.056
0.785ProHis: 0.785 ± 0.03
2.104ProIle: 2.104 ± 0.047
1.957ProLys: 1.957 ± 0.054
3.284ProLeu: 3.284 ± 0.06
0.99ProMet: 0.99 ± 0.031
1.465ProAsn: 1.465 ± 0.04
1.428ProPro: 1.428 ± 0.045
1.576ProGln: 1.576 ± 0.043
1.491ProArg: 1.491 ± 0.042
2.381ProSer: 2.381 ± 0.055
1.998ProThr: 1.998 ± 0.059
3.419ProVal: 3.419 ± 0.066
0.365ProTrp: 0.365 ± 0.019
1.557ProTyr: 1.557 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
3.516GlnAla: 3.516 ± 0.069
0.404GlnCys: 0.404 ± 0.022
1.875GlnAsp: 1.875 ± 0.042
3.09GlnGlu: 3.09 ± 0.061
1.237GlnPhe: 1.237 ± 0.039
2.276GlnGly: 2.276 ± 0.048
0.585GlnHis: 0.585 ± 0.023
2.435GlnIle: 2.435 ± 0.057
2.646GlnLys: 2.646 ± 0.064
3.215GlnLeu: 3.215 ± 0.065
1.149GlnMet: 1.149 ± 0.033
1.779GlnAsn: 1.779 ± 0.042
1.41GlnPro: 1.41 ± 0.043
1.696GlnGln: 1.696 ± 0.045
2.109GlnArg: 2.109 ± 0.054
2.014GlnSer: 2.014 ± 0.043
2.003GlnThr: 2.003 ± 0.047
2.402GlnVal: 2.402 ± 0.044
0.359GlnTrp: 0.359 ± 0.019
1.513GlnTyr: 1.513 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
4.314ArgAla: 4.314 ± 0.072
0.739ArgCys: 0.739 ± 0.027
2.573ArgAsp: 2.573 ± 0.056
4.064ArgGlu: 4.064 ± 0.081
2.175ArgPhe: 2.175 ± 0.048
3.079ArgGly: 3.079 ± 0.055
0.998ArgHis: 0.998 ± 0.033
3.443ArgIle: 3.443 ± 0.063
3.247ArgLys: 3.247 ± 0.067
5.09ArgLeu: 5.09 ± 0.084
1.788ArgMet: 1.788 ± 0.046
1.971ArgAsn: 1.971 ± 0.047
1.936ArgPro: 1.936 ± 0.046
2.139ArgGln: 2.139 ± 0.052
3.474ArgArg: 3.474 ± 0.074
2.729ArgSer: 2.729 ± 0.05
2.749ArgThr: 2.749 ± 0.054
3.386ArgVal: 3.386 ± 0.063
0.557ArgTrp: 0.557 ± 0.022
2.027ArgTyr: 2.027 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
4.924SerAla: 4.924 ± 0.072
0.8SerCys: 0.8 ± 0.029
3.221SerAsp: 3.221 ± 0.061
3.646SerGlu: 3.646 ± 0.062
2.573SerPhe: 2.573 ± 0.054
5.242SerGly: 5.242 ± 0.077
1.078SerHis: 1.078 ± 0.035
3.68SerIle: 3.68 ± 0.065
2.669SerLys: 2.669 ± 0.047
5.335SerLeu: 5.335 ± 0.075
1.664SerMet: 1.664 ± 0.044
2.076SerAsn: 2.076 ± 0.051
2.218SerPro: 2.218 ± 0.045
1.922SerGln: 1.922 ± 0.041
2.913SerArg: 2.913 ± 0.059
3.653SerSer: 3.653 ± 0.082
3.076SerThr: 3.076 ± 0.07
4.322SerVal: 4.322 ± 0.064
0.58SerTrp: 0.58 ± 0.028
2.174SerTyr: 2.174 ± 0.051
0.0SerXaa: 0.0 ± 0.0
Thr
5.363ThrAla: 5.363 ± 0.097
0.665ThrCys: 0.665 ± 0.031
3.095ThrAsp: 3.095 ± 0.064
3.683ThrGlu: 3.683 ± 0.08
2.259ThrPhe: 2.259 ± 0.048
4.786ThrGly: 4.786 ± 0.088
0.931ThrHis: 0.931 ± 0.03
3.608ThrIle: 3.608 ± 0.069
2.352ThrLys: 2.352 ± 0.041
5.21ThrLeu: 5.21 ± 0.071
1.399ThrMet: 1.399 ± 0.038
1.869ThrAsn: 1.869 ± 0.048
2.816ThrPro: 2.816 ± 0.062
1.797ThrGln: 1.797 ± 0.045
2.321ThrArg: 2.321 ± 0.05
2.964ThrSer: 2.964 ± 0.066
3.025ThrThr: 3.025 ± 0.075
5.086ThrVal: 5.086 ± 0.101
0.522ThrTrp: 0.522 ± 0.024
1.985ThrTyr: 1.985 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
5.689ValAla: 5.689 ± 0.091
1.331ValCys: 1.331 ± 0.041
4.144ValAsp: 4.144 ± 0.069
4.826ValGlu: 4.826 ± 0.079
2.731ValPhe: 2.731 ± 0.061
4.424ValGly: 4.424 ± 0.076
1.208ValHis: 1.208 ± 0.037
3.955ValIle: 3.955 ± 0.075
3.527ValLys: 3.527 ± 0.062
7.009ValLeu: 7.009 ± 0.101
1.782ValMet: 1.782 ± 0.044
2.678ValAsn: 2.678 ± 0.059
2.969ValPro: 2.969 ± 0.054
2.564ValGln: 2.564 ± 0.048
3.513ValArg: 3.513 ± 0.064
4.577ValSer: 4.577 ± 0.063
4.287ValThr: 4.287 ± 0.081
4.992ValVal: 4.992 ± 0.083
0.76ValTrp: 0.76 ± 0.03
2.625ValTyr: 2.625 ± 0.058
0.0ValXaa: 0.0 ± 0.0
Trp
0.805TrpAla: 0.805 ± 0.027
0.165TrpCys: 0.165 ± 0.012
0.624TrpAsp: 0.624 ± 0.029
0.718TrpGlu: 0.718 ± 0.028
0.446TrpPhe: 0.446 ± 0.021
0.709TrpGly: 0.709 ± 0.027
0.187TrpHis: 0.187 ± 0.013
0.588TrpIle: 0.588 ± 0.023
0.642TrpLys: 0.642 ± 0.024
0.952TrpLeu: 0.952 ± 0.037
0.312TrpMet: 0.312 ± 0.017
0.538TrpAsn: 0.538 ± 0.028
0.246TrpPro: 0.246 ± 0.018
0.472TrpGln: 0.472 ± 0.02
0.574TrpArg: 0.574 ± 0.022
0.572TrpSer: 0.572 ± 0.026
0.55TrpThr: 0.55 ± 0.027
0.636TrpVal: 0.636 ± 0.029
0.137TrpTrp: 0.137 ± 0.015
0.385TrpTyr: 0.385 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.072TyrAla: 3.072 ± 0.064
0.667TyrCys: 0.667 ± 0.026
2.375TyrAsp: 2.375 ± 0.049
2.491TyrGlu: 2.491 ± 0.056
1.525TyrPhe: 1.525 ± 0.044
2.83TyrGly: 2.83 ± 0.054
0.82TyrHis: 0.82 ± 0.025
2.119TyrIle: 2.119 ± 0.044
1.526TyrLys: 1.526 ± 0.037
3.373TyrLeu: 3.373 ± 0.068
0.847TyrMet: 0.847 ± 0.025
1.567TyrAsn: 1.567 ± 0.044
1.516TyrPro: 1.516 ± 0.037
1.503TyrGln: 1.503 ± 0.036
2.253TyrArg: 2.253 ± 0.053
2.111TyrSer: 2.111 ± 0.05
2.393TyrThr: 2.393 ± 0.055
2.327TyrVal: 2.327 ± 0.049
0.372TyrTrp: 0.372 ± 0.02
1.593TyrTyr: 1.593 ± 0.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3281 proteins (1017378 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski