Amino acid dipepetide frequency for Falsibacillus pallidus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.889AlaAla: 6.889 ± 0.098
0.547AlaCys: 0.547 ± 0.026
3.58AlaAsp: 3.58 ± 0.064
4.948AlaGlu: 4.948 ± 0.072
3.401AlaPhe: 3.401 ± 0.059
5.647AlaGly: 5.647 ± 0.081
1.295AlaHis: 1.295 ± 0.034
5.508AlaIle: 5.508 ± 0.075
4.977AlaLys: 4.977 ± 0.083
6.934AlaLeu: 6.934 ± 0.081
2.128AlaMet: 2.128 ± 0.045
2.559AlaAsn: 2.559 ± 0.049
2.226AlaPro: 2.226 ± 0.059
2.072AlaGln: 2.072 ± 0.049
2.538AlaArg: 2.538 ± 0.053
4.344AlaSer: 4.344 ± 0.065
2.768AlaThr: 2.768 ± 0.053
5.609AlaVal: 5.609 ± 0.077
0.589AlaTrp: 0.589 ± 0.026
2.214AlaTyr: 2.214 ± 0.043
0.0AlaXaa: 0.0 ± 0.0
Cys
0.466CysAla: 0.466 ± 0.024
0.107CysCys: 0.107 ± 0.01
0.382CysAsp: 0.382 ± 0.022
0.474CysGlu: 0.474 ± 0.019
0.328CysPhe: 0.328 ± 0.018
0.648CysGly: 0.648 ± 0.023
0.201CysHis: 0.201 ± 0.015
0.54CysIle: 0.54 ± 0.022
0.368CysLys: 0.368 ± 0.021
0.662CysLeu: 0.662 ± 0.026
0.186CysMet: 0.186 ± 0.014
0.236CysAsn: 0.236 ± 0.017
0.401CysPro: 0.401 ± 0.02
0.241CysGln: 0.241 ± 0.015
0.321CysArg: 0.321 ± 0.019
0.582CysSer: 0.582 ± 0.023
0.398CysThr: 0.398 ± 0.021
0.402CysVal: 0.402 ± 0.021
0.073CysTrp: 0.073 ± 0.008
0.234CysTyr: 0.234 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.618AspAla: 3.618 ± 0.063
0.41AspCys: 0.41 ± 0.02
2.405AspAsp: 2.405 ± 0.053
4.402AspGlu: 4.402 ± 0.077
2.543AspPhe: 2.543 ± 0.054
3.6AspGly: 3.6 ± 0.057
1.215AspHis: 1.215 ± 0.037
4.011AspIle: 4.011 ± 0.056
3.193AspLys: 3.193 ± 0.06
5.058AspLeu: 5.058 ± 0.065
1.412AspMet: 1.412 ± 0.041
1.716AspAsn: 1.716 ± 0.045
2.101AspPro: 2.101 ± 0.042
2.155AspGln: 2.155 ± 0.046
2.246AspArg: 2.246 ± 0.047
2.954AspSer: 2.954 ± 0.053
2.278AspThr: 2.278 ± 0.047
3.599AspVal: 3.599 ± 0.059
0.693AspTrp: 0.693 ± 0.029
2.113AspTyr: 2.113 ± 0.045
0.0AspXaa: 0.0 ± 0.0
Glu
5.411GluAla: 5.411 ± 0.073
0.433GluCys: 0.433 ± 0.022
3.911GluAsp: 3.911 ± 0.065
7.096GluGlu: 7.096 ± 0.115
2.837GluPhe: 2.837 ± 0.039
4.577GluGly: 4.577 ± 0.068
1.456GluHis: 1.456 ± 0.034
5.638GluIle: 5.638 ± 0.084
7.166GluLys: 7.166 ± 0.096
7.138GluLeu: 7.138 ± 0.101
2.545GluMet: 2.545 ± 0.046
3.722GluAsn: 3.722 ± 0.066
1.946GluPro: 1.946 ± 0.048
2.913GluGln: 2.913 ± 0.06
3.396GluArg: 3.396 ± 0.068
3.767GluSer: 3.767 ± 0.066
3.478GluThr: 3.478 ± 0.064
4.726GluVal: 4.726 ± 0.076
0.921GluTrp: 0.921 ± 0.029
2.227GluTyr: 2.227 ± 0.04
0.0GluXaa: 0.0 ± 0.0
Phe
3.104PheAla: 3.104 ± 0.057
0.372PheCys: 0.372 ± 0.019
2.389PheAsp: 2.389 ± 0.046
2.939PheGlu: 2.939 ± 0.05
2.48PhePhe: 2.48 ± 0.053
3.34PheGly: 3.34 ± 0.067
1.2PheHis: 1.2 ± 0.035
3.967PheIle: 3.967 ± 0.076
2.691PheLys: 2.691 ± 0.052
4.751PheLeu: 4.751 ± 0.074
1.307PheMet: 1.307 ± 0.04
1.95PheAsn: 1.95 ± 0.045
1.749PhePro: 1.749 ± 0.043
1.767PheGln: 1.767 ± 0.044
1.656PheArg: 1.656 ± 0.044
3.625PheSer: 3.625 ± 0.063
2.519PheThr: 2.519 ± 0.053
2.942PheVal: 2.942 ± 0.055
0.523PheTrp: 0.523 ± 0.023
1.75PheTyr: 1.75 ± 0.042
0.0PheXaa: 0.0 ± 0.0
Gly
4.906GlyAla: 4.906 ± 0.075
0.625GlyCys: 0.625 ± 0.025
3.358GlyAsp: 3.358 ± 0.059
4.65GlyGlu: 4.65 ± 0.072
3.537GlyPhe: 3.537 ± 0.069
4.859GlyGly: 4.859 ± 0.084
1.443GlyHis: 1.443 ± 0.039
6.141GlyIle: 6.141 ± 0.086
5.544GlyLys: 5.544 ± 0.069
6.639GlyLeu: 6.639 ± 0.09
2.302GlyMet: 2.302 ± 0.045
2.69GlyAsn: 2.69 ± 0.052
1.85GlyPro: 1.85 ± 0.061
2.199GlyGln: 2.199 ± 0.049
2.707GlyArg: 2.707 ± 0.055
4.32GlySer: 4.32 ± 0.078
4.039GlyThr: 4.039 ± 0.079
4.849GlyVal: 4.849 ± 0.073
0.859GlyTrp: 0.859 ± 0.032
2.77GlyTyr: 2.77 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
1.314HisAla: 1.314 ± 0.03
0.207HisCys: 0.207 ± 0.013
1.061HisAsp: 1.061 ± 0.029
1.433HisGlu: 1.433 ± 0.037
1.181HisPhe: 1.181 ± 0.031
1.439HisGly: 1.439 ± 0.04
0.721HisHis: 0.721 ± 0.029
1.458HisIle: 1.458 ± 0.041
1.114HisLys: 1.114 ± 0.028
2.156HisLeu: 2.156 ± 0.046
0.532HisMet: 0.532 ± 0.022
0.749HisAsn: 0.749 ± 0.029
1.206HisPro: 1.206 ± 0.04
0.869HisGln: 0.869 ± 0.027
0.874HisArg: 0.874 ± 0.03
1.354HisSer: 1.354 ± 0.038
1.04HisThr: 1.04 ± 0.03
1.335HisVal: 1.335 ± 0.038
0.262HisTrp: 0.262 ± 0.016
0.922HisTyr: 0.922 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
5.699IleAla: 5.699 ± 0.077
0.671IleCys: 0.671 ± 0.027
4.128IleAsp: 4.128 ± 0.066
5.627IleGlu: 5.627 ± 0.08
3.502IlePhe: 3.502 ± 0.06
6.052IleGly: 6.052 ± 0.096
1.87IleHis: 1.87 ± 0.044
5.941IleIle: 5.941 ± 0.093
4.816IleLys: 4.816 ± 0.066
7.54IleLeu: 7.54 ± 0.092
1.954IleMet: 1.954 ± 0.04
3.113IleAsn: 3.113 ± 0.062
3.377IlePro: 3.377 ± 0.051
2.922IleGln: 2.922 ± 0.062
3.068IleArg: 3.068 ± 0.053
5.272IleSer: 5.272 ± 0.082
3.95IleThr: 3.95 ± 0.067
5.08IleVal: 5.08 ± 0.068
0.681IleTrp: 0.681 ± 0.026
2.495IleTyr: 2.495 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
4.995LysAla: 4.995 ± 0.074
0.371LysCys: 0.371 ± 0.019
4.396LysAsp: 4.396 ± 0.063
7.317LysGlu: 7.317 ± 0.098
2.117LysPhe: 2.117 ± 0.049
5.01LysGly: 5.01 ± 0.066
1.307LysHis: 1.307 ± 0.034
4.882LysIle: 4.882 ± 0.068
6.614LysLys: 6.614 ± 0.093
5.984LysLeu: 5.984 ± 0.075
2.543LysMet: 2.543 ± 0.048
3.669LysAsn: 3.669 ± 0.067
2.413LysPro: 2.413 ± 0.044
2.743LysGln: 2.743 ± 0.057
3.309LysArg: 3.309 ± 0.048
3.928LysSer: 3.928 ± 0.063
3.737LysThr: 3.737 ± 0.056
4.737LysVal: 4.737 ± 0.064
0.978LysTrp: 0.978 ± 0.034
2.181LysTyr: 2.181 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
7.22LeuAla: 7.22 ± 0.091
0.658LeuCys: 0.658 ± 0.026
4.854LeuAsp: 4.854 ± 0.073
6.52LeuGlu: 6.52 ± 0.088
5.025LeuPhe: 5.025 ± 0.091
6.292LeuGly: 6.292 ± 0.087
1.971LeuHis: 1.971 ± 0.048
7.332LeuIle: 7.332 ± 0.097
7.374LeuLys: 7.374 ± 0.078
10.08LeuLeu: 10.08 ± 0.121
2.77LeuMet: 2.77 ± 0.06
4.369LeuAsn: 4.369 ± 0.065
3.912LeuPro: 3.912 ± 0.069
3.401LeuGln: 3.401 ± 0.055
3.435LeuArg: 3.435 ± 0.064
6.945LeuSer: 6.945 ± 0.091
5.263LeuThr: 5.263 ± 0.067
5.806LeuVal: 5.806 ± 0.075
0.852LeuTrp: 0.852 ± 0.029
3.138LeuTyr: 3.138 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.257MetAla: 2.257 ± 0.045
0.172MetCys: 0.172 ± 0.013
1.746MetAsp: 1.746 ± 0.039
2.381MetGlu: 2.381 ± 0.047
1.067MetPhe: 1.067 ± 0.035
2.005MetGly: 2.005 ± 0.047
0.481MetHis: 0.481 ± 0.021
2.323MetIle: 2.323 ± 0.046
2.871MetLys: 2.871 ± 0.051
2.621MetLeu: 2.621 ± 0.048
1.005MetMet: 1.005 ± 0.037
1.826MetAsn: 1.826 ± 0.039
1.088MetPro: 1.088 ± 0.03
0.884MetGln: 0.884 ± 0.03
1.079MetArg: 1.079 ± 0.034
1.69MetSer: 1.69 ± 0.04
1.677MetThr: 1.677 ± 0.038
1.755MetVal: 1.755 ± 0.038
0.197MetTrp: 0.197 ± 0.012
0.706MetTyr: 0.706 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.789AsnAla: 2.789 ± 0.047
0.303AsnCys: 0.303 ± 0.019
2.15AsnAsp: 2.15 ± 0.05
3.583AsnGlu: 3.583 ± 0.062
1.504AsnPhe: 1.504 ± 0.037
3.721AsnGly: 3.721 ± 0.076
1.07AsnHis: 1.07 ± 0.03
3.09AsnIle: 3.09 ± 0.056
2.794AsnLys: 2.794 ± 0.051
3.806AsnLeu: 3.806 ± 0.063
1.217AsnMet: 1.217 ± 0.035
1.672AsnAsn: 1.672 ± 0.045
2.175AsnPro: 2.175 ± 0.041
1.867AsnGln: 1.867 ± 0.043
1.942AsnArg: 1.942 ± 0.035
2.318AsnSer: 2.318 ± 0.05
1.957AsnThr: 1.957 ± 0.049
2.785AsnVal: 2.785 ± 0.054
0.491AsnTrp: 0.491 ± 0.02
1.379AsnTyr: 1.379 ± 0.041
0.0AsnXaa: 0.0 ± 0.0
Pro
2.462ProAla: 2.462 ± 0.044
0.212ProCys: 0.212 ± 0.016
2.215ProAsp: 2.215 ± 0.047
3.043ProGlu: 3.043 ± 0.051
2.074ProPhe: 2.074 ± 0.044
2.472ProGly: 2.472 ± 0.059
0.816ProHis: 0.816 ± 0.024
2.851ProIle: 2.851 ± 0.054
2.3ProLys: 2.3 ± 0.047
3.537ProLeu: 3.537 ± 0.056
0.914ProMet: 0.914 ± 0.031
1.577ProAsn: 1.577 ± 0.038
1.075ProPro: 1.075 ± 0.033
1.157ProGln: 1.157 ± 0.036
1.101ProArg: 1.101 ± 0.033
2.547ProSer: 2.547 ± 0.048
1.661ProThr: 1.661 ± 0.038
2.949ProVal: 2.949 ± 0.141
0.392ProTrp: 0.392 ± 0.02
1.489ProTyr: 1.489 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
2.473GlnAla: 2.473 ± 0.041
0.195GlnCys: 0.195 ± 0.013
1.621GlnAsp: 1.621 ± 0.038
2.702GlnGlu: 2.702 ± 0.05
1.655GlnPhe: 1.655 ± 0.042
2.146GlnGly: 2.146 ± 0.046
0.706GlnHis: 0.706 ± 0.026
2.39GlnIle: 2.39 ± 0.056
2.979GlnLys: 2.979 ± 0.06
3.671GlnLeu: 3.671 ± 0.063
1.276GlnMet: 1.276 ± 0.034
1.644GlnAsn: 1.644 ± 0.036
1.17GlnPro: 1.17 ± 0.036
1.521GlnGln: 1.521 ± 0.045
1.416GlnArg: 1.416 ± 0.033
2.244GlnSer: 2.244 ± 0.049
1.889GlnThr: 1.889 ± 0.042
2.074GlnVal: 2.074 ± 0.046
0.419GlnTrp: 0.419 ± 0.018
1.35GlnTyr: 1.35 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
2.312ArgAla: 2.312 ± 0.052
0.264ArgCys: 0.264 ± 0.015
2.025ArgAsp: 2.025 ± 0.038
3.151ArgGlu: 3.151 ± 0.059
1.929ArgPhe: 1.929 ± 0.038
2.365ArgGly: 2.365 ± 0.05
0.776ArgHis: 0.776 ± 0.029
3.078ArgIle: 3.078 ± 0.051
3.329ArgLys: 3.329 ± 0.053
3.878ArgLeu: 3.878 ± 0.06
1.365ArgMet: 1.365 ± 0.036
1.832ArgAsn: 1.832 ± 0.039
1.283ArgPro: 1.283 ± 0.033
1.394ArgGln: 1.394 ± 0.038
1.797ArgArg: 1.797 ± 0.042
2.283ArgSer: 2.283 ± 0.046
1.984ArgThr: 1.984 ± 0.044
2.456ArgVal: 2.456 ± 0.05
0.399ArgTrp: 0.399 ± 0.02
1.505ArgTyr: 1.505 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
4.054SerAla: 4.054 ± 0.062
0.437SerCys: 0.437 ± 0.021
2.985SerAsp: 2.985 ± 0.057
4.0SerGlu: 4.0 ± 0.058
3.594SerPhe: 3.594 ± 0.06
4.749SerGly: 4.749 ± 0.079
1.321SerHis: 1.321 ± 0.036
5.531SerIle: 5.531 ± 0.077
4.109SerLys: 4.109 ± 0.064
6.406SerLeu: 6.406 ± 0.09
1.932SerMet: 1.932 ± 0.043
2.487SerAsn: 2.487 ± 0.056
2.36SerPro: 2.36 ± 0.045
2.015SerGln: 2.015 ± 0.041
2.425SerArg: 2.425 ± 0.049
4.387SerSer: 4.387 ± 0.075
3.106SerThr: 3.106 ± 0.054
4.144SerVal: 4.144 ± 0.063
0.68SerTrp: 0.68 ± 0.026
2.221SerTyr: 2.221 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
3.92ThrAla: 3.92 ± 0.07
0.315ThrCys: 0.315 ± 0.016
2.525ThrAsp: 2.525 ± 0.051
3.157ThrGlu: 3.157 ± 0.055
2.51ThrPhe: 2.51 ± 0.048
4.053ThrGly: 4.053 ± 0.071
0.987ThrHis: 0.987 ± 0.032
4.249ThrIle: 4.249 ± 0.071
3.132ThrLys: 3.132 ± 0.059
4.87ThrLeu: 4.87 ± 0.051
1.287ThrMet: 1.287 ± 0.032
2.02ThrAsn: 2.02 ± 0.045
2.195ThrPro: 2.195 ± 0.045
1.274ThrGln: 1.274 ± 0.04
1.707ThrArg: 1.707 ± 0.037
3.123ThrSer: 3.123 ± 0.048
2.399ThrThr: 2.399 ± 0.06
3.814ThrVal: 3.814 ± 0.064
0.465ThrTrp: 0.465 ± 0.021
1.676ThrTyr: 1.676 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
4.235ValAla: 4.235 ± 0.066
0.568ValCys: 0.568 ± 0.024
3.571ValAsp: 3.571 ± 0.058
4.73ValGlu: 4.73 ± 0.078
3.297ValPhe: 3.297 ± 0.063
4.197ValGly: 4.197 ± 0.072
1.351ValHis: 1.351 ± 0.037
5.424ValIle: 5.424 ± 0.065
4.914ValLys: 4.914 ± 0.068
6.745ValLeu: 6.745 ± 0.091
1.901ValMet: 1.901 ± 0.037
2.844ValAsn: 2.844 ± 0.058
2.622ValPro: 2.622 ± 0.053
2.372ValGln: 2.372 ± 0.047
2.445ValArg: 2.445 ± 0.046
4.392ValSer: 4.392 ± 0.073
3.273ValThr: 3.273 ± 0.05
4.644ValVal: 4.644 ± 0.074
0.611ValTrp: 0.611 ± 0.025
2.231ValTyr: 2.231 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
0.64TrpAla: 0.64 ± 0.024
0.077TrpCys: 0.077 ± 0.008
0.562TrpAsp: 0.562 ± 0.026
0.676TrpGlu: 0.676 ± 0.025
0.521TrpPhe: 0.521 ± 0.025
0.731TrpGly: 0.731 ± 0.03
0.21TrpHis: 0.21 ± 0.016
0.983TrpIle: 0.983 ± 0.033
0.845TrpLys: 0.845 ± 0.029
1.123TrpLeu: 1.123 ± 0.035
0.388TrpMet: 0.388 ± 0.022
0.576TrpAsn: 0.576 ± 0.026
0.273TrpPro: 0.273 ± 0.016
0.322TrpGln: 0.322 ± 0.017
0.398TrpArg: 0.398 ± 0.019
0.613TrpSer: 0.613 ± 0.025
0.561TrpThr: 0.561 ± 0.022
0.642TrpVal: 0.642 ± 0.028
0.153TrpTrp: 0.153 ± 0.013
0.337TrpTyr: 0.337 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.01TyrAla: 2.01 ± 0.048
0.308TyrCys: 0.308 ± 0.016
1.859TyrAsp: 1.859 ± 0.048
2.474TyrGlu: 2.474 ± 0.049
1.919TyrPhe: 1.919 ± 0.048
2.384TyrGly: 2.384 ± 0.054
0.829TyrHis: 0.829 ± 0.028
2.421TyrIle: 2.421 ± 0.048
2.101TyrLys: 2.101 ± 0.049
3.617TyrLeu: 3.617 ± 0.063
0.868TyrMet: 0.868 ± 0.028
1.36TyrAsn: 1.36 ± 0.032
1.421TyrPro: 1.421 ± 0.038
1.452TyrGln: 1.452 ± 0.034
1.549TyrArg: 1.549 ± 0.038
2.254TyrSer: 2.254 ± 0.042
1.7TyrThr: 1.7 ± 0.041
2.02TyrVal: 2.02 ± 0.049
0.394TyrTrp: 0.394 ± 0.02
1.429TyrTyr: 1.429 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3956 proteins (1115423 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski