Amino acid dipepetide frequency for Neomicrococcus aestuarii

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.607AlaAla: 15.607 ± 0.22
0.652AlaCys: 0.652 ± 0.035
6.365AlaAsp: 6.365 ± 0.095
7.933AlaGlu: 7.933 ± 0.131
3.743AlaPhe: 3.743 ± 0.083
9.752AlaGly: 9.752 ± 0.126
2.272AlaHis: 2.272 ± 0.054
5.789AlaIle: 5.789 ± 0.104
4.108AlaLys: 4.108 ± 0.08
11.928AlaLeu: 11.928 ± 0.145
2.603AlaMet: 2.603 ± 0.064
3.021AlaAsn: 3.021 ± 0.063
5.197AlaPro: 5.197 ± 0.104
4.207AlaGln: 4.207 ± 0.074
6.905AlaArg: 6.905 ± 0.107
7.439AlaSer: 7.439 ± 0.113
6.713AlaThr: 6.713 ± 0.102
9.317AlaVal: 9.317 ± 0.127
1.537AlaTrp: 1.537 ± 0.05
2.277AlaTyr: 2.277 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
0.644CysAla: 0.644 ± 0.03
0.062CysCys: 0.062 ± 0.01
0.284CysAsp: 0.284 ± 0.016
0.308CysGlu: 0.308 ± 0.021
0.19CysPhe: 0.19 ± 0.014
0.631CysGly: 0.631 ± 0.028
0.152CysHis: 0.152 ± 0.015
0.267CysIle: 0.267 ± 0.017
0.118CysLys: 0.118 ± 0.013
0.517CysLeu: 0.517 ± 0.029
0.093CysMet: 0.093 ± 0.011
0.144CysAsn: 0.144 ± 0.014
0.267CysPro: 0.267 ± 0.019
0.148CysGln: 0.148 ± 0.015
0.308CysArg: 0.308 ± 0.021
0.357CysSer: 0.357 ± 0.022
0.352CysThr: 0.352 ± 0.022
0.457CysVal: 0.457 ± 0.023
0.076CysTrp: 0.076 ± 0.01
0.107CysTyr: 0.107 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
7.218AspAla: 7.218 ± 0.121
0.278AspCys: 0.278 ± 0.018
3.255AspAsp: 3.255 ± 0.082
3.999AspGlu: 3.999 ± 0.085
2.034AspPhe: 2.034 ± 0.057
4.654AspGly: 4.654 ± 0.079
1.268AspHis: 1.268 ± 0.042
2.59AspIle: 2.59 ± 0.062
1.561AspLys: 1.561 ± 0.048
5.594AspLeu: 5.594 ± 0.089
0.98AspMet: 0.98 ± 0.035
1.185AspAsn: 1.185 ± 0.038
3.46AspPro: 3.46 ± 0.067
1.828AspGln: 1.828 ± 0.049
3.401AspArg: 3.401 ± 0.078
3.431AspSer: 3.431 ± 0.066
2.62AspThr: 2.62 ± 0.055
4.879AspVal: 4.879 ± 0.07
0.706AspTrp: 0.706 ± 0.033
1.351AspTyr: 1.351 ± 0.046
0.0AspXaa: 0.0 ± 0.0
Glu
7.367GluAla: 7.367 ± 0.12
0.308GluCys: 0.308 ± 0.021
3.737GluAsp: 3.737 ± 0.081
4.327GluGlu: 4.327 ± 0.092
2.237GluPhe: 2.237 ± 0.062
4.361GluGly: 4.361 ± 0.084
1.577GluHis: 1.577 ± 0.044
3.512GluIle: 3.512 ± 0.069
2.433GluLys: 2.433 ± 0.06
7.17GluLeu: 7.17 ± 0.097
1.182GluMet: 1.182 ± 0.043
2.161GluAsn: 2.161 ± 0.056
2.753GluPro: 2.753 ± 0.072
2.427GluGln: 2.427 ± 0.064
4.637GluArg: 4.637 ± 0.091
4.057GluSer: 4.057 ± 0.087
3.441GluThr: 3.441 ± 0.073
4.795GluVal: 4.795 ± 0.081
0.948GluTrp: 0.948 ± 0.034
1.464GluTyr: 1.464 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
4.108PheAla: 4.108 ± 0.082
0.232PheCys: 0.232 ± 0.016
2.176PheAsp: 2.176 ± 0.05
2.328PheGlu: 2.328 ± 0.058
1.316PhePhe: 1.316 ± 0.044
3.516PheGly: 3.516 ± 0.074
0.626PheHis: 0.626 ± 0.027
1.619PheIle: 1.619 ± 0.053
0.972PheLys: 0.972 ± 0.034
3.093PheLeu: 3.093 ± 0.063
0.702PheMet: 0.702 ± 0.032
1.063PheAsn: 1.063 ± 0.041
1.501PhePro: 1.501 ± 0.044
1.004PheGln: 1.004 ± 0.032
1.764PheArg: 1.764 ± 0.05
2.212PheSer: 2.212 ± 0.048
2.25PheThr: 2.25 ± 0.061
3.066PheVal: 3.066 ± 0.069
0.49PheTrp: 0.49 ± 0.026
0.741PheTyr: 0.741 ± 0.036
0.0PheXaa: 0.0 ± 0.0
Gly
8.636GlyAla: 8.636 ± 0.122
0.549GlyCys: 0.549 ± 0.029
4.067GlyAsp: 4.067 ± 0.075
4.904GlyGlu: 4.904 ± 0.094
3.271GlyPhe: 3.271 ± 0.073
6.542GlyGly: 6.542 ± 0.109
1.799GlyHis: 1.799 ± 0.057
4.782GlyIle: 4.782 ± 0.099
3.225GlyLys: 3.225 ± 0.066
8.093GlyLeu: 8.093 ± 0.124
1.926GlyMet: 1.926 ± 0.052
2.282GlyAsn: 2.282 ± 0.055
3.199GlyPro: 3.199 ± 0.071
2.81GlyGln: 2.81 ± 0.06
4.798GlyArg: 4.798 ± 0.084
5.372GlySer: 5.372 ± 0.095
5.29GlyThr: 5.29 ± 0.088
6.97GlyVal: 6.97 ± 0.097
1.392GlyTrp: 1.392 ± 0.048
2.159GlyTyr: 2.159 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
2.12HisAla: 2.12 ± 0.055
0.118HisCys: 0.118 ± 0.014
1.175HisAsp: 1.175 ± 0.044
1.374HisGlu: 1.374 ± 0.045
0.74HisPhe: 0.74 ± 0.035
1.84HisGly: 1.84 ± 0.053
0.676HisHis: 0.676 ± 0.03
0.841HisIle: 0.841 ± 0.032
0.47HisLys: 0.47 ± 0.024
2.076HisLeu: 2.076 ± 0.05
0.456HisMet: 0.456 ± 0.025
0.462HisAsn: 0.462 ± 0.028
1.48HisPro: 1.48 ± 0.055
0.708HisGln: 0.708 ± 0.028
1.488HisArg: 1.488 ± 0.044
1.269HisSer: 1.269 ± 0.04
1.061HisThr: 1.061 ± 0.042
1.791HisVal: 1.791 ± 0.056
0.291HisTrp: 0.291 ± 0.02
0.466HisTyr: 0.466 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
6.436IleAla: 6.436 ± 0.122
0.323IleCys: 0.323 ± 0.022
3.228IleAsp: 3.228 ± 0.068
3.104IleGlu: 3.104 ± 0.071
1.718IlePhe: 1.718 ± 0.059
4.368IleGly: 4.368 ± 0.1
0.93IleHis: 0.93 ± 0.04
2.4IleIle: 2.4 ± 0.061
1.464IleLys: 1.464 ± 0.043
4.523IleLeu: 4.523 ± 0.095
1.023IleMet: 1.023 ± 0.039
1.564IleAsn: 1.564 ± 0.053
2.696IlePro: 2.696 ± 0.062
1.406IleGln: 1.406 ± 0.042
2.743IleArg: 2.743 ± 0.059
3.33IleSer: 3.33 ± 0.071
3.113IleThr: 3.113 ± 0.071
4.241IleVal: 4.241 ± 0.088
0.496IleTrp: 0.496 ± 0.025
0.932IleTyr: 0.932 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
3.613LysAla: 3.613 ± 0.078
0.118LysCys: 0.118 ± 0.012
2.127LysAsp: 2.127 ± 0.055
1.929LysGlu: 1.929 ± 0.053
1.061LysPhe: 1.061 ± 0.04
2.191LysGly: 2.191 ± 0.061
0.732LysHis: 0.732 ± 0.033
1.811LysIle: 1.811 ± 0.054
1.562LysLys: 1.562 ± 0.053
3.17LysLeu: 3.17 ± 0.068
0.799LysMet: 0.799 ± 0.035
1.272LysAsn: 1.272 ± 0.042
1.688LysPro: 1.688 ± 0.055
1.019LysGln: 1.019 ± 0.037
2.123LysArg: 2.123 ± 0.06
2.094LysSer: 2.094 ± 0.054
2.021LysThr: 2.021 ± 0.054
2.671LysVal: 2.671 ± 0.062
0.428LysTrp: 0.428 ± 0.022
0.837LysTyr: 0.837 ± 0.034
0.0LysXaa: 0.0 ± 0.0
Leu
12.194LeuAla: 12.194 ± 0.144
0.521LeuCys: 0.521 ± 0.023
5.798LeuAsp: 5.798 ± 0.096
6.356LeuGlu: 6.356 ± 0.098
3.038LeuPhe: 3.038 ± 0.071
8.462LeuGly: 8.462 ± 0.108
1.837LeuHis: 1.837 ± 0.049
4.832LeuIle: 4.832 ± 0.087
3.293LeuLys: 3.293 ± 0.067
9.177LeuLeu: 9.177 ± 0.137
2.082LeuMet: 2.082 ± 0.053
2.894LeuAsn: 2.894 ± 0.063
5.144LeuPro: 5.144 ± 0.087
2.835LeuGln: 2.835 ± 0.063
6.193LeuArg: 6.193 ± 0.097
6.504LeuSer: 6.504 ± 0.1
6.058LeuThr: 6.058 ± 0.084
8.224LeuVal: 8.224 ± 0.147
1.219LeuTrp: 1.219 ± 0.047
1.714LeuTyr: 1.714 ± 0.058
0.0LeuXaa: 0.0 ± 0.0
Met
2.288MetAla: 2.288 ± 0.054
0.107MetCys: 0.107 ± 0.011
1.091MetAsp: 1.091 ± 0.034
0.987MetGlu: 0.987 ± 0.035
0.672MetPhe: 0.672 ± 0.035
1.688MetGly: 1.688 ± 0.051
0.403MetHis: 0.403 ± 0.023
1.112MetIle: 1.112 ± 0.04
0.714MetLys: 0.714 ± 0.034
1.975MetLeu: 1.975 ± 0.059
0.419MetMet: 0.419 ± 0.022
0.732MetAsn: 0.732 ± 0.029
1.086MetPro: 1.086 ± 0.038
0.608MetGln: 0.608 ± 0.027
1.354MetArg: 1.354 ± 0.046
1.694MetSer: 1.694 ± 0.046
1.63MetThr: 1.63 ± 0.048
1.663MetVal: 1.663 ± 0.045
0.24MetTrp: 0.24 ± 0.018
0.386MetTyr: 0.386 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
3.287AsnAla: 3.287 ± 0.07
0.145AsnCys: 0.145 ± 0.014
1.451AsnAsp: 1.451 ± 0.045
1.73AsnGlu: 1.73 ± 0.055
1.035AsnPhe: 1.035 ± 0.036
2.571AsnGly: 2.571 ± 0.059
0.6AsnHis: 0.6 ± 0.03
1.454AsnIle: 1.454 ± 0.042
0.926AsnLys: 0.926 ± 0.042
2.652AsnLeu: 2.652 ± 0.061
0.572AsnMet: 0.572 ± 0.024
0.902AsnAsn: 0.902 ± 0.036
1.942AsnPro: 1.942 ± 0.047
1.032AsnGln: 1.032 ± 0.044
1.649AsnArg: 1.649 ± 0.05
1.791AsnSer: 1.791 ± 0.051
1.522AsnThr: 1.522 ± 0.049
2.409AsnVal: 2.409 ± 0.059
0.42AsnTrp: 0.42 ± 0.024
0.695AsnTyr: 0.695 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
5.829ProAla: 5.829 ± 0.102
0.192ProCys: 0.192 ± 0.016
2.878ProAsp: 2.878 ± 0.065
4.354ProGlu: 4.354 ± 0.091
1.705ProPhe: 1.705 ± 0.053
4.091ProGly: 4.091 ± 0.079
1.137ProHis: 1.137 ± 0.042
2.051ProIle: 2.051 ± 0.049
1.557ProLys: 1.557 ± 0.047
4.624ProLeu: 4.624 ± 0.083
0.904ProMet: 0.904 ± 0.038
1.418ProAsn: 1.418 ± 0.048
1.505ProPro: 1.505 ± 0.055
1.753ProGln: 1.753 ± 0.046
2.665ProArg: 2.665 ± 0.065
3.278ProSer: 3.278 ± 0.07
2.932ProThr: 2.932 ± 0.08
4.285ProVal: 4.285 ± 0.067
0.745ProTrp: 0.745 ± 0.035
1.012ProTyr: 1.012 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
3.748GlnAla: 3.748 ± 0.082
0.164GlnCys: 0.164 ± 0.015
1.804GlnAsp: 1.804 ± 0.044
2.132GlnGlu: 2.132 ± 0.054
1.1GlnPhe: 1.1 ± 0.041
2.336GlnGly: 2.336 ± 0.051
0.782GlnHis: 0.782 ± 0.032
1.612GlnIle: 1.612 ± 0.052
1.116GlnLys: 1.116 ± 0.038
3.576GlnLeu: 3.576 ± 0.078
0.604GlnMet: 0.604 ± 0.03
0.938GlnAsn: 0.938 ± 0.04
1.646GlnPro: 1.646 ± 0.055
1.531GlnGln: 1.531 ± 0.055
2.611GlnArg: 2.611 ± 0.066
1.971GlnSer: 1.971 ± 0.052
1.596GlnThr: 1.596 ± 0.045
2.43GlnVal: 2.43 ± 0.056
0.574GlnTrp: 0.574 ± 0.03
0.727GlnTyr: 0.727 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
6.441ArgAla: 6.441 ± 0.103
0.304ArgCys: 0.304 ± 0.022
3.396ArgAsp: 3.396 ± 0.072
4.397ArgGlu: 4.397 ± 0.084
2.267ArgPhe: 2.267 ± 0.056
4.478ArgGly: 4.478 ± 0.083
1.332ArgHis: 1.332 ± 0.039
3.344ArgIle: 3.344 ± 0.068
2.123ArgLys: 2.123 ± 0.066
6.062ArgLeu: 6.062 ± 0.106
1.431ArgMet: 1.431 ± 0.037
1.812ArgAsn: 1.812 ± 0.06
2.759ArgPro: 2.759 ± 0.077
2.114ArgGln: 2.114 ± 0.051
4.586ArgArg: 4.586 ± 0.086
3.828ArgSer: 3.828 ± 0.086
3.545ArgThr: 3.545 ± 0.065
4.93ArgVal: 4.93 ± 0.085
1.023ArgTrp: 1.023 ± 0.037
1.459ArgTyr: 1.459 ± 0.057
0.0ArgXaa: 0.0 ± 0.0
Ser
7.763SerAla: 7.763 ± 0.128
0.322SerCys: 0.322 ± 0.025
3.056SerAsp: 3.056 ± 0.076
4.037SerGlu: 4.037 ± 0.078
2.323SerPhe: 2.323 ± 0.06
6.039SerGly: 6.039 ± 0.098
1.29SerHis: 1.29 ± 0.045
3.025SerIle: 3.025 ± 0.075
2.116SerLys: 2.116 ± 0.058
6.113SerLeu: 6.113 ± 0.078
1.549SerMet: 1.549 ± 0.047
1.772SerAsn: 1.772 ± 0.046
3.096SerPro: 3.096 ± 0.069
2.106SerGln: 2.106 ± 0.057
3.784SerArg: 3.784 ± 0.078
5.144SerSer: 5.144 ± 0.133
4.15SerThr: 4.15 ± 0.088
5.275SerVal: 5.275 ± 0.093
0.935SerTrp: 0.935 ± 0.038
1.447SerTyr: 1.447 ± 0.048
0.0SerXaa: 0.0 ± 0.0
Thr
6.829ThrAla: 6.829 ± 0.097
0.295ThrCys: 0.295 ± 0.02
3.185ThrAsp: 3.185 ± 0.076
3.677ThrGlu: 3.677 ± 0.076
2.052ThrPhe: 2.052 ± 0.051
5.107ThrGly: 5.107 ± 0.075
1.194ThrHis: 1.194 ± 0.04
2.936ThrIle: 2.936 ± 0.072
1.807ThrLys: 1.807 ± 0.052
5.649ThrLeu: 5.649 ± 0.085
1.088ThrMet: 1.088 ± 0.038
1.638ThrAsn: 1.638 ± 0.049
3.453ThrPro: 3.453 ± 0.076
1.846ThrGln: 1.846 ± 0.046
3.107ThrArg: 3.107 ± 0.065
3.952ThrSer: 3.952 ± 0.076
3.712ThrThr: 3.712 ± 0.087
5.417ThrVal: 5.417 ± 0.081
0.792ThrTrp: 0.792 ± 0.038
1.395ThrTyr: 1.395 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
9.63ValAla: 9.63 ± 0.134
0.54ValCys: 0.54 ± 0.029
5.08ValAsp: 5.08 ± 0.089
4.994ValGlu: 4.994 ± 0.095
2.794ValPhe: 2.794 ± 0.061
6.47ValGly: 6.47 ± 0.107
1.638ValHis: 1.638 ± 0.05
4.434ValIle: 4.434 ± 0.088
2.597ValLys: 2.597 ± 0.062
8.497ValLeu: 8.497 ± 0.124
1.763ValMet: 1.763 ± 0.055
2.446ValAsn: 2.446 ± 0.063
4.344ValPro: 4.344 ± 0.074
2.349ValGln: 2.349 ± 0.057
5.046ValArg: 5.046 ± 0.1
5.467ValSer: 5.467 ± 0.089
5.182ValThr: 5.182 ± 0.098
7.624ValVal: 7.624 ± 0.115
0.982ValTrp: 0.982 ± 0.042
1.506ValTyr: 1.506 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
1.357TrpAla: 1.357 ± 0.044
0.114TrpCys: 0.114 ± 0.012
0.799TrpAsp: 0.799 ± 0.032
0.733TrpGlu: 0.733 ± 0.033
0.588TrpPhe: 0.588 ± 0.027
0.99TrpGly: 0.99 ± 0.044
0.293TrpHis: 0.293 ± 0.02
0.808TrpIle: 0.808 ± 0.035
0.483TrpLys: 0.483 ± 0.026
1.587TrpLeu: 1.587 ± 0.056
0.331TrpMet: 0.331 ± 0.022
0.513TrpAsn: 0.513 ± 0.03
0.606TrpPro: 0.606 ± 0.029
0.47TrpGln: 0.47 ± 0.026
0.927TrpArg: 0.927 ± 0.039
0.824TrpSer: 0.824 ± 0.028
0.83TrpThr: 0.83 ± 0.039
1.158TrpVal: 1.158 ± 0.044
0.305TrpTrp: 0.305 ± 0.021
0.3TrpTyr: 0.3 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.252TyrAla: 2.252 ± 0.055
0.14TyrCys: 0.14 ± 0.013
1.338TyrAsp: 1.338 ± 0.04
1.289TyrGlu: 1.289 ± 0.047
0.896TyrPhe: 0.896 ± 0.038
1.981TyrGly: 1.981 ± 0.056
0.363TyrHis: 0.363 ± 0.023
0.821TyrIle: 0.821 ± 0.036
0.619TyrLys: 0.619 ± 0.029
2.301TyrLeu: 2.301 ± 0.059
0.34TyrMet: 0.34 ± 0.022
0.564TyrAsn: 0.564 ± 0.031
1.093TyrPro: 1.093 ± 0.039
0.795TyrGln: 0.795 ± 0.033
1.543TyrArg: 1.543 ± 0.047
1.328TyrSer: 1.328 ± 0.04
1.116TyrThr: 1.116 ± 0.034
1.778TyrVal: 1.778 ± 0.05
0.418TyrTrp: 0.418 ± 0.026
0.516TyrTyr: 0.516 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2289 proteins (763650 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski