Amino acid dipepetide frequency for Flavobacterium cerinum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.712AlaAla: 5.712 ± 0.114
0.624AlaCys: 0.624 ± 0.03
3.745AlaAsp: 3.745 ± 0.061
4.103AlaGlu: 4.103 ± 0.078
3.408AlaPhe: 3.408 ± 0.057
5.077AlaGly: 5.077 ± 0.104
1.068AlaHis: 1.068 ± 0.034
5.507AlaIle: 5.507 ± 0.079
4.95AlaLys: 4.95 ± 0.092
6.663AlaLeu: 6.663 ± 0.099
1.629AlaMet: 1.629 ± 0.044
3.745AlaAsn: 3.745 ± 0.085
2.259AlaPro: 2.259 ± 0.073
2.692AlaGln: 2.692 ± 0.051
1.862AlaArg: 1.862 ± 0.047
4.383AlaSer: 4.383 ± 0.07
4.498AlaThr: 4.498 ± 0.167
4.89AlaVal: 4.89 ± 0.087
0.639AlaTrp: 0.639 ± 0.022
2.568AlaTyr: 2.568 ± 0.043
0.0AlaXaa: 0.0 ± 0.0
Cys
0.531CysAla: 0.531 ± 0.028
0.104CysCys: 0.104 ± 0.011
0.433CysAsp: 0.433 ± 0.02
0.493CysGlu: 0.493 ± 0.028
0.411CysPhe: 0.411 ± 0.019
0.663CysGly: 0.663 ± 0.03
0.176CysHis: 0.176 ± 0.013
0.657CysIle: 0.657 ± 0.03
0.468CysLys: 0.468 ± 0.022
0.696CysLeu: 0.696 ± 0.025
0.143CysMet: 0.143 ± 0.011
0.51CysAsn: 0.51 ± 0.026
0.32CysPro: 0.32 ± 0.021
0.199CysGln: 0.199 ± 0.013
0.237CysArg: 0.237 ± 0.014
0.7CysSer: 0.7 ± 0.033
0.572CysThr: 0.572 ± 0.036
0.487CysVal: 0.487 ± 0.028
0.069CysTrp: 0.069 ± 0.008
0.311CysTyr: 0.311 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
3.918AspAla: 3.918 ± 0.061
0.424AspCys: 0.424 ± 0.022
2.714AspAsp: 2.714 ± 0.051
3.369AspGlu: 3.369 ± 0.062
3.294AspPhe: 3.294 ± 0.059
3.755AspGly: 3.755 ± 0.08
0.884AspHis: 0.884 ± 0.03
4.471AspIle: 4.471 ± 0.063
4.305AspLys: 4.305 ± 0.081
4.663AspLeu: 4.663 ± 0.068
1.254AspMet: 1.254 ± 0.036
3.194AspAsn: 3.194 ± 0.058
1.763AspPro: 1.763 ± 0.045
1.394AspGln: 1.394 ± 0.034
1.854AspArg: 1.854 ± 0.04
2.958AspSer: 2.958 ± 0.064
2.921AspThr: 2.921 ± 0.048
3.554AspVal: 3.554 ± 0.054
0.644AspTrp: 0.644 ± 0.025
2.638AspTyr: 2.638 ± 0.049
0.0AspXaa: 0.0 ± 0.0
Glu
4.249GluAla: 4.249 ± 0.071
0.376GluCys: 0.376 ± 0.019
3.175GluAsp: 3.175 ± 0.065
4.208GluGlu: 4.208 ± 0.089
2.808GluPhe: 2.808 ± 0.051
3.722GluGly: 3.722 ± 0.065
0.991GluHis: 0.991 ± 0.029
5.085GluIle: 5.085 ± 0.085
5.67GluLys: 5.67 ± 0.101
5.605GluLeu: 5.605 ± 0.086
1.652GluMet: 1.652 ± 0.035
4.117GluAsn: 4.117 ± 0.08
1.637GluPro: 1.637 ± 0.042
2.237GluGln: 2.237 ± 0.046
2.302GluArg: 2.302 ± 0.054
3.025GluSer: 3.025 ± 0.054
3.482GluThr: 3.482 ± 0.059
4.076GluVal: 4.076 ± 0.06
0.649GluTrp: 0.649 ± 0.024
2.578GluTyr: 2.578 ± 0.054
0.0GluXaa: 0.0 ± 0.0
Phe
3.174PheAla: 3.174 ± 0.052
0.522PheCys: 0.522 ± 0.028
3.115PheAsp: 3.115 ± 0.05
3.195PheGlu: 3.195 ± 0.058
2.653PhePhe: 2.653 ± 0.055
3.436PheGly: 3.436 ± 0.064
0.79PheHis: 0.79 ± 0.03
3.865PheIle: 3.865 ± 0.073
3.471PheLys: 3.471 ± 0.064
4.254PheLeu: 4.254 ± 0.076
1.145PheMet: 1.145 ± 0.034
3.22PheAsn: 3.22 ± 0.063
1.623PhePro: 1.623 ± 0.04
1.27PheGln: 1.27 ± 0.035
1.561PheArg: 1.561 ± 0.038
3.791PheSer: 3.791 ± 0.066
3.675PheThr: 3.675 ± 0.087
2.869PheVal: 2.869 ± 0.063
0.513PheTrp: 0.513 ± 0.02
2.114PheTyr: 2.114 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
4.508GlyAla: 4.508 ± 0.085
0.769GlyCys: 0.769 ± 0.045
3.304GlyAsp: 3.304 ± 0.054
3.507GlyGlu: 3.507 ± 0.056
3.508GlyPhe: 3.508 ± 0.064
4.77GlyGly: 4.77 ± 0.114
1.097GlyHis: 1.097 ± 0.032
5.52GlyIle: 5.52 ± 0.073
4.944GlyLys: 4.944 ± 0.08
5.746GlyLeu: 5.746 ± 0.07
1.65GlyMet: 1.65 ± 0.039
3.936GlyAsn: 3.936 ± 0.093
1.431GlyPro: 1.431 ± 0.053
2.034GlyGln: 2.034 ± 0.049
2.021GlyArg: 2.021 ± 0.048
4.412GlySer: 4.412 ± 0.084
5.188GlyThr: 5.188 ± 0.211
4.328GlyVal: 4.328 ± 0.075
0.777GlyTrp: 0.777 ± 0.027
2.905GlyTyr: 2.905 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
0.945HisAla: 0.945 ± 0.029
0.178HisCys: 0.178 ± 0.012
0.879HisAsp: 0.879 ± 0.031
0.971HisGlu: 0.971 ± 0.036
1.013HisPhe: 1.013 ± 0.031
1.011HisGly: 1.011 ± 0.033
0.405HisHis: 0.405 ± 0.021
1.344HisIle: 1.344 ± 0.032
1.128HisLys: 1.128 ± 0.035
1.604HisLeu: 1.604 ± 0.045
0.317HisMet: 0.317 ± 0.016
0.993HisAsn: 0.993 ± 0.027
0.796HisPro: 0.796 ± 0.026
0.567HisGln: 0.567 ± 0.024
0.599HisArg: 0.599 ± 0.022
1.049HisSer: 1.049 ± 0.029
0.973HisThr: 0.973 ± 0.029
0.874HisVal: 0.874 ± 0.029
0.173HisTrp: 0.173 ± 0.013
0.799HisTyr: 0.799 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
5.97IleAla: 5.97 ± 0.082
0.67IleCys: 0.67 ± 0.029
4.6IleAsp: 4.6 ± 0.07
5.114IleGlu: 5.114 ± 0.076
3.345IlePhe: 3.345 ± 0.068
4.892IleGly: 4.892 ± 0.075
1.276IleHis: 1.276 ± 0.036
6.016IleIle: 6.016 ± 0.091
5.851IleLys: 5.851 ± 0.1
6.501IleLeu: 6.501 ± 0.102
1.527IleMet: 1.527 ± 0.042
4.799IleAsn: 4.799 ± 0.075
3.169IlePro: 3.169 ± 0.053
2.471IleGln: 2.471 ± 0.052
2.474IleArg: 2.474 ± 0.058
5.28IleSer: 5.28 ± 0.071
5.651IleThr: 5.651 ± 0.125
5.006IleVal: 5.006 ± 0.076
0.637IleTrp: 0.637 ± 0.025
2.897IleTyr: 2.897 ± 0.049
0.0IleXaa: 0.0 ± 0.0
Lys
5.211LysAla: 5.211 ± 0.083
0.37LysCys: 0.37 ± 0.018
4.491LysAsp: 4.491 ± 0.082
5.918LysGlu: 5.918 ± 0.103
2.815LysPhe: 2.815 ± 0.06
4.415LysGly: 4.415 ± 0.067
1.24LysHis: 1.24 ± 0.04
5.988LysIle: 5.988 ± 0.1
6.735LysLys: 6.735 ± 0.105
6.251LysLeu: 6.251 ± 0.089
2.218LysMet: 2.218 ± 0.049
4.949LysAsn: 4.949 ± 0.076
2.519LysPro: 2.519 ± 0.049
2.721LysGln: 2.721 ± 0.06
2.61LysArg: 2.61 ± 0.062
4.124LysSer: 4.124 ± 0.075
4.677LysThr: 4.677 ± 0.078
4.525LysVal: 4.525 ± 0.07
0.799LysTrp: 0.799 ± 0.028
2.983LysTyr: 2.983 ± 0.054
0.0LysXaa: 0.0 ± 0.0
Leu
6.024LeuAla: 6.024 ± 0.082
0.716LeuCys: 0.716 ± 0.026
4.632LeuAsp: 4.632 ± 0.076
5.404LeuGlu: 5.404 ± 0.082
4.708LeuPhe: 4.708 ± 0.073
5.49LeuGly: 5.49 ± 0.085
1.555LeuHis: 1.555 ± 0.039
6.387LeuIle: 6.387 ± 0.095
7.056LeuLys: 7.056 ± 0.115
8.934LeuLeu: 8.934 ± 0.126
2.09LeuMet: 2.09 ± 0.055
5.348LeuAsn: 5.348 ± 0.079
3.729LeuPro: 3.729 ± 0.053
3.257LeuGln: 3.257 ± 0.052
2.945LeuArg: 2.945 ± 0.052
6.608LeuSer: 6.608 ± 0.085
5.615LeuThr: 5.615 ± 0.109
5.131LeuVal: 5.131 ± 0.067
0.775LeuTrp: 0.775 ± 0.029
3.267LeuTyr: 3.267 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
1.827MetAla: 1.827 ± 0.039
0.144MetCys: 0.144 ± 0.01
1.19MetAsp: 1.19 ± 0.033
1.501MetGlu: 1.501 ± 0.041
0.902MetPhe: 0.902 ± 0.028
1.509MetGly: 1.509 ± 0.042
0.415MetHis: 0.415 ± 0.022
1.553MetIle: 1.553 ± 0.044
2.325MetLys: 2.325 ± 0.047
2.061MetLeu: 2.061 ± 0.05
0.608MetMet: 0.608 ± 0.025
1.255MetAsn: 1.255 ± 0.031
0.923MetPro: 0.923 ± 0.033
0.828MetGln: 0.828 ± 0.026
0.889MetArg: 0.889 ± 0.026
1.431MetSer: 1.431 ± 0.031
1.331MetThr: 1.331 ± 0.032
1.407MetVal: 1.407 ± 0.035
0.186MetTrp: 0.186 ± 0.014
0.768MetTyr: 0.768 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
4.126AsnAla: 4.126 ± 0.065
0.465AsnCys: 0.465 ± 0.025
3.3AsnAsp: 3.3 ± 0.051
3.539AsnGlu: 3.539 ± 0.06
2.905AsnPhe: 2.905 ± 0.056
4.463AsnGly: 4.463 ± 0.107
0.951AsnHis: 0.951 ± 0.031
4.677AsnIle: 4.677 ± 0.071
4.166AsnLys: 4.166 ± 0.07
4.99AsnLeu: 4.99 ± 0.075
1.273AsnMet: 1.273 ± 0.032
4.063AsnAsn: 4.063 ± 0.082
2.78AsnPro: 2.78 ± 0.058
1.814AsnGln: 1.814 ± 0.044
2.043AsnArg: 2.043 ± 0.043
3.812AsnSer: 3.812 ± 0.064
3.994AsnThr: 3.994 ± 0.085
3.735AsnVal: 3.735 ± 0.067
0.719AsnTrp: 0.719 ± 0.031
2.892AsnTyr: 2.892 ± 0.065
0.0AsnXaa: 0.0 ± 0.0
Pro
2.731ProAla: 2.731 ± 0.082
0.227ProCys: 0.227 ± 0.016
2.188ProAsp: 2.188 ± 0.043
2.677ProGlu: 2.677 ± 0.052
1.881ProPhe: 1.881 ± 0.038
2.413ProGly: 2.413 ± 0.064
0.553ProHis: 0.553 ± 0.024
2.585ProIle: 2.585 ± 0.05
2.391ProLys: 2.391 ± 0.052
3.038ProLeu: 3.038 ± 0.058
0.742ProMet: 0.742 ± 0.024
2.164ProAsn: 2.164 ± 0.043
0.924ProPro: 0.924 ± 0.035
1.244ProGln: 1.244 ± 0.038
0.866ProArg: 0.866 ± 0.03
2.157ProSer: 2.157 ± 0.046
2.241ProThr: 2.241 ± 0.081
2.974ProVal: 2.974 ± 0.07
0.283ProTrp: 0.283 ± 0.013
1.45ProTyr: 1.45 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
2.144GlnAla: 2.144 ± 0.049
0.227GlnCys: 0.227 ± 0.017
1.628GlnAsp: 1.628 ± 0.038
2.176GlnGlu: 2.176 ± 0.051
1.632GlnPhe: 1.632 ± 0.037
1.875GlnGly: 1.875 ± 0.057
0.574GlnHis: 0.574 ± 0.024
2.408GlnIle: 2.408 ± 0.046
2.733GlnLys: 2.733 ± 0.059
3.334GlnLeu: 3.334 ± 0.058
0.876GlnMet: 0.876 ± 0.029
2.089GlnAsn: 2.089 ± 0.049
1.209GlnPro: 1.209 ± 0.038
1.496GlnGln: 1.496 ± 0.039
1.155GlnArg: 1.155 ± 0.034
1.867GlnSer: 1.867 ± 0.042
1.938GlnThr: 1.938 ± 0.052
1.983GlnVal: 1.983 ± 0.036
0.4GlnTrp: 0.4 ± 0.021
1.43GlnTyr: 1.43 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
1.959ArgAla: 1.959 ± 0.042
0.187ArgCys: 0.187 ± 0.013
1.69ArgAsp: 1.69 ± 0.043
2.129ArgGlu: 2.129 ± 0.049
1.824ArgPhe: 1.824 ± 0.039
1.739ArgGly: 1.739 ± 0.041
0.569ArgHis: 0.569 ± 0.023
2.725ArgIle: 2.725 ± 0.056
2.638ArgLys: 2.638 ± 0.056
3.073ArgLeu: 3.073 ± 0.063
0.911ArgMet: 0.911 ± 0.029
1.95ArgAsn: 1.95 ± 0.041
1.072ArgPro: 1.072 ± 0.032
1.14ArgGln: 1.14 ± 0.032
1.225ArgArg: 1.225 ± 0.035
1.801ArgSer: 1.801 ± 0.046
1.787ArgThr: 1.787 ± 0.037
2.028ArgVal: 2.028 ± 0.044
0.372ArgTrp: 0.372 ± 0.021
1.456ArgTyr: 1.456 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
4.179SerAla: 4.179 ± 0.076
0.724SerCys: 0.724 ± 0.033
3.231SerAsp: 3.231 ± 0.049
3.445SerGlu: 3.445 ± 0.052
3.696SerPhe: 3.696 ± 0.055
5.071SerGly: 5.071 ± 0.107
1.127SerHis: 1.127 ± 0.036
4.995SerIle: 4.995 ± 0.073
4.277SerLys: 4.277 ± 0.076
5.871SerLeu: 5.871 ± 0.087
1.307SerMet: 1.307 ± 0.033
3.534SerAsn: 3.534 ± 0.069
2.344SerPro: 2.344 ± 0.057
2.073SerGln: 2.073 ± 0.042
2.082SerArg: 2.082 ± 0.047
4.03SerSer: 4.03 ± 0.076
3.831SerThr: 3.831 ± 0.094
4.082SerVal: 4.082 ± 0.074
0.713SerTrp: 0.713 ± 0.036
2.71SerTyr: 2.71 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
5.287ThrAla: 5.287 ± 0.196
0.43ThrCys: 0.43 ± 0.03
3.576ThrAsp: 3.576 ± 0.063
3.487ThrGlu: 3.487 ± 0.052
3.145ThrPhe: 3.145 ± 0.065
5.189ThrGly: 5.189 ± 0.158
0.977ThrHis: 0.977 ± 0.032
5.574ThrIle: 5.574 ± 0.122
3.897ThrLys: 3.897 ± 0.07
5.761ThrLeu: 5.761 ± 0.096
1.125ThrMet: 1.125 ± 0.031
3.569ThrAsn: 3.569 ± 0.084
2.979ThrPro: 2.979 ± 0.084
2.036ThrGln: 2.036 ± 0.055
1.652ThrArg: 1.652 ± 0.035
3.937ThrSer: 3.937 ± 0.1
4.44ThrThr: 4.44 ± 0.13
4.989ThrVal: 4.989 ± 0.157
0.59ThrTrp: 0.59 ± 0.026
2.628ThrTyr: 2.628 ± 0.087
0.0ThrXaa: 0.0 ± 0.0
Val
4.339ValAla: 4.339 ± 0.074
0.578ValCys: 0.578 ± 0.028
3.258ValAsp: 3.258 ± 0.061
3.565ValGlu: 3.565 ± 0.052
3.345ValPhe: 3.345 ± 0.057
3.703ValGly: 3.703 ± 0.066
0.989ValHis: 0.989 ± 0.03
5.13ValIle: 5.13 ± 0.078
4.691ValLys: 4.691 ± 0.083
6.017ValLeu: 6.017 ± 0.077
1.469ValMet: 1.469 ± 0.037
3.876ValAsn: 3.876 ± 0.069
2.524ValPro: 2.524 ± 0.055
1.916ValGln: 1.916 ± 0.043
1.998ValArg: 1.998 ± 0.049
4.479ValSer: 4.479 ± 0.072
4.833ValThr: 4.833 ± 0.16
4.53ValVal: 4.53 ± 0.081
0.664ValTrp: 0.664 ± 0.028
2.532ValTyr: 2.532 ± 0.046
0.0ValXaa: 0.0 ± 0.0
Trp
0.646TrpAla: 0.646 ± 0.026
0.098TrpCys: 0.098 ± 0.01
0.587TrpAsp: 0.587 ± 0.023
0.588TrpGlu: 0.588 ± 0.026
0.541TrpPhe: 0.541 ± 0.022
0.651TrpGly: 0.651 ± 0.028
0.206TrpHis: 0.206 ± 0.015
0.712TrpIle: 0.712 ± 0.025
0.776TrpLys: 0.776 ± 0.029
0.944TrpLeu: 0.944 ± 0.032
0.283TrpMet: 0.283 ± 0.014
0.669TrpAsn: 0.669 ± 0.022
0.223TrpPro: 0.223 ± 0.014
0.404TrpGln: 0.404 ± 0.018
0.349TrpArg: 0.349 ± 0.017
0.639TrpSer: 0.639 ± 0.031
0.608TrpThr: 0.608 ± 0.028
0.641TrpVal: 0.641 ± 0.027
0.116TrpTrp: 0.116 ± 0.01
0.476TrpTyr: 0.476 ± 0.033
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.542TyrAla: 2.542 ± 0.048
0.347TyrCys: 0.347 ± 0.019
2.235TyrAsp: 2.235 ± 0.052
2.256TyrGlu: 2.256 ± 0.053
2.421TyrPhe: 2.421 ± 0.048
2.509TyrGly: 2.509 ± 0.054
0.774TyrHis: 0.774 ± 0.029
2.945TyrIle: 2.945 ± 0.057
3.087TyrLys: 3.087 ± 0.06
3.665TyrLeu: 3.665 ± 0.056
0.832TyrMet: 0.832 ± 0.027
2.728TyrAsn: 2.728 ± 0.056
1.506TyrPro: 1.506 ± 0.037
1.383TyrGln: 1.383 ± 0.044
1.551TyrArg: 1.551 ± 0.041
2.838TyrSer: 2.838 ± 0.057
3.014TyrThr: 3.014 ± 0.079
2.33TyrVal: 2.33 ± 0.052
0.44TyrTrp: 0.44 ± 0.022
2.047TyrTyr: 2.047 ± 0.057
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3673 proteins (1213498 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski