Amino acid dipepetide frequency for Vombatus ursinus (Common wombat)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.892AlaAla: 5.892 ± 0.03
1.285AlaCys: 1.285 ± 0.011
2.761AlaAsp: 2.761 ± 0.013
4.398AlaGlu: 4.398 ± 0.026
2.698AlaPhe: 2.698 ± 0.017
4.148AlaGly: 4.148 ± 0.021
1.432AlaHis: 1.432 ± 0.009
2.878AlaIle: 2.878 ± 0.015
3.393AlaLys: 3.393 ± 0.018
6.764AlaLeu: 6.764 ± 0.027
1.446AlaMet: 1.446 ± 0.009
2.072AlaAsn: 2.072 ± 0.011
3.516AlaPro: 3.516 ± 0.027
2.963AlaGln: 2.963 ± 0.02
3.181AlaArg: 3.181 ± 0.016
5.436AlaSer: 5.436 ± 0.021
3.29AlaThr: 3.29 ± 0.014
4.372AlaVal: 4.372 ± 0.019
0.739AlaTrp: 0.739 ± 0.007
1.522AlaTyr: 1.522 ± 0.011
0.001AlaXaa: 0.001 ± 0.0
Cys
1.159CysAla: 1.159 ± 0.01
0.612CysCys: 0.612 ± 0.009
1.057CysAsp: 1.057 ± 0.011
1.277CysGlu: 1.277 ± 0.012
0.876CysPhe: 0.876 ± 0.007
1.735CysGly: 1.735 ± 0.021
0.681CysHis: 0.681 ± 0.008
1.026CysIle: 1.026 ± 0.01
1.189CysLys: 1.189 ± 0.011
2.189CysLeu: 2.189 ± 0.013
0.428CysMet: 0.428 ± 0.005
0.986CysAsn: 0.986 ± 0.012
1.335CysPro: 1.335 ± 0.016
1.095CysGln: 1.095 ± 0.012
1.21CysArg: 1.21 ± 0.01
1.972CysSer: 1.972 ± 0.015
1.089CysThr: 1.089 ± 0.009
1.262CysVal: 1.262 ± 0.011
0.278CysTrp: 0.278 ± 0.005
0.609CysTyr: 0.609 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
2.673AspAla: 2.673 ± 0.015
1.062AspCys: 1.062 ± 0.011
2.696AspAsp: 2.696 ± 0.018
3.505AspGlu: 3.505 ± 0.019
2.182AspPhe: 2.182 ± 0.012
3.306AspGly: 3.306 ± 0.017
1.153AspHis: 1.153 ± 0.008
2.731AspIle: 2.731 ± 0.014
2.577AspLys: 2.577 ± 0.016
5.081AspLeu: 5.081 ± 0.019
1.118AspMet: 1.118 ± 0.008
1.8AspAsn: 1.8 ± 0.011
2.817AspPro: 2.817 ± 0.014
1.905AspGln: 1.905 ± 0.011
2.376AspArg: 2.376 ± 0.013
4.162AspSer: 4.162 ± 0.021
2.421AspThr: 2.421 ± 0.013
3.032AspVal: 3.032 ± 0.015
0.643AspTrp: 0.643 ± 0.006
1.534AspTyr: 1.534 ± 0.011
0.0AspXaa: 0.0 ± 0.0
Glu
4.865GluAla: 4.865 ± 0.028
1.567GluCys: 1.567 ± 0.022
4.379GluAsp: 4.379 ± 0.02
7.909GluGlu: 7.909 ± 0.046
2.1GluPhe: 2.1 ± 0.012
4.136GluGly: 4.136 ± 0.018
1.523GluHis: 1.523 ± 0.009
3.379GluIle: 3.379 ± 0.017
5.788GluLys: 5.788 ± 0.038
6.483GluLeu: 6.483 ± 0.032
1.752GluMet: 1.752 ± 0.01
3.306GluAsn: 3.306 ± 0.018
2.999GluPro: 2.999 ± 0.017
3.113GluGln: 3.113 ± 0.017
3.961GluArg: 3.961 ± 0.025
4.503GluSer: 4.503 ± 0.021
3.476GluThr: 3.476 ± 0.014
4.131GluVal: 4.131 ± 0.017
0.697GluTrp: 0.697 ± 0.007
1.635GluTyr: 1.635 ± 0.012
0.001GluXaa: 0.001 ± 0.0
Phe
1.974PheAla: 1.974 ± 0.011
0.96PheCys: 0.96 ± 0.009
1.751PheAsp: 1.751 ± 0.011
2.044PheGlu: 2.044 ± 0.011
1.77PhePhe: 1.77 ± 0.016
2.28PheGly: 2.28 ± 0.017
1.083PheHis: 1.083 ± 0.008
2.005PheIle: 2.005 ± 0.013
1.842PheLys: 1.842 ± 0.012
4.347PheLeu: 4.347 ± 0.023
0.789PheMet: 0.789 ± 0.007
1.417PheAsn: 1.417 ± 0.009
2.064PhePro: 2.064 ± 0.013
1.871PheGln: 1.871 ± 0.011
1.982PheArg: 1.982 ± 0.013
3.62PheSer: 3.62 ± 0.017
2.143PheThr: 2.143 ± 0.014
2.203PheVal: 2.203 ± 0.012
0.504PheTrp: 0.504 ± 0.006
1.262PheTyr: 1.262 ± 0.01
0.001PheXaa: 0.001 ± 0.0
Gly
3.971GlyAla: 3.971 ± 0.022
1.215GlyCys: 1.215 ± 0.01
3.084GlyAsp: 3.084 ± 0.015
4.239GlyGlu: 4.239 ± 0.027
2.423GlyPhe: 2.423 ± 0.014
4.928GlyGly: 4.928 ± 0.036
1.666GlyHis: 1.666 ± 0.011
2.954GlyIle: 2.954 ± 0.016
3.99GlyLys: 3.99 ± 0.02
5.663GlyLeu: 5.663 ± 0.021
1.328GlyMet: 1.328 ± 0.009
2.539GlyAsn: 2.539 ± 0.014
3.816GlyPro: 3.816 ± 0.04
2.737GlyGln: 2.737 ± 0.017
3.494GlyArg: 3.494 ± 0.02
5.639GlySer: 5.639 ± 0.024
3.522GlyThr: 3.522 ± 0.021
3.413GlyVal: 3.413 ± 0.018
0.78GlyTrp: 0.78 ± 0.009
1.787GlyTyr: 1.787 ± 0.012
0.002GlyXaa: 0.002 ± 0.0
His
1.238HisAla: 1.238 ± 0.009
0.73HisCys: 0.73 ± 0.008
0.891HisAsp: 0.891 ± 0.008
1.348HisGlu: 1.348 ± 0.009
1.124HisPhe: 1.124 ± 0.01
1.511HisGly: 1.511 ± 0.011
0.919HisHis: 0.919 ± 0.01
1.336HisIle: 1.336 ± 0.01
1.31HisLys: 1.31 ± 0.01
2.938HisLeu: 2.938 ± 0.014
0.576HisMet: 0.576 ± 0.006
0.929HisAsn: 0.929 ± 0.007
1.599HisPro: 1.599 ± 0.011
1.516HisGln: 1.516 ± 0.013
1.523HisArg: 1.523 ± 0.011
2.313HisSer: 2.313 ± 0.012
1.557HisThr: 1.557 ± 0.016
1.449HisVal: 1.449 ± 0.01
0.356HisTrp: 0.356 ± 0.005
0.833HisTyr: 0.833 ± 0.007
0.0HisXaa: 0.0 ± 0.0
Ile
2.676IleAla: 2.676 ± 0.014
1.168IleCys: 1.168 ± 0.009
2.198IleAsp: 2.198 ± 0.015
2.743IleGlu: 2.743 ± 0.016
2.077IlePhe: 2.077 ± 0.014
2.399IleGly: 2.399 ± 0.013
1.552IleHis: 1.552 ± 0.015
2.66IleIle: 2.66 ± 0.014
2.773IleLys: 2.773 ± 0.015
4.993IleLeu: 4.993 ± 0.023
1.072IleMet: 1.072 ± 0.009
1.993IleAsn: 1.993 ± 0.013
2.779IlePro: 2.779 ± 0.014
2.494IleGln: 2.494 ± 0.015
2.484IleArg: 2.484 ± 0.013
4.04IleSer: 4.04 ± 0.017
2.761IleThr: 2.761 ± 0.015
2.633IleVal: 2.633 ± 0.014
0.559IleTrp: 0.559 ± 0.006
1.505IleTyr: 1.505 ± 0.011
0.001IleXaa: 0.001 ± 0.0
Lys
4.032LysAla: 4.032 ± 0.022
1.222LysCys: 1.222 ± 0.012
3.239LysAsp: 3.239 ± 0.017
5.338LysGlu: 5.338 ± 0.031
1.777LysPhe: 1.777 ± 0.011
3.338LysGly: 3.338 ± 0.02
1.449LysHis: 1.449 ± 0.011
3.001LysIle: 3.001 ± 0.018
4.895LysLys: 4.895 ± 0.027
5.419LysLeu: 5.419 ± 0.025
1.511LysMet: 1.511 ± 0.01
2.585LysAsn: 2.585 ± 0.014
3.174LysPro: 3.174 ± 0.023
2.698LysGln: 2.698 ± 0.015
3.303LysArg: 3.303 ± 0.018
4.049LysSer: 4.049 ± 0.02
3.188LysThr: 3.188 ± 0.015
3.602LysVal: 3.602 ± 0.017
0.645LysTrp: 0.645 ± 0.006
1.635LysTyr: 1.635 ± 0.012
0.0LysXaa: 0.0 ± 0.0
Leu
6.433LeuAla: 6.433 ± 0.025
2.155LeuCys: 2.155 ± 0.012
4.802LeuAsp: 4.802 ± 0.021
7.281LeuGlu: 7.281 ± 0.039
3.577LeuPhe: 3.577 ± 0.022
5.763LeuGly: 5.763 ± 0.023
2.769LeuHis: 2.769 ± 0.017
4.27LeuIle: 4.27 ± 0.022
6.075LeuLys: 6.075 ± 0.028
10.75LeuLeu: 10.75 ± 0.047
2.069LeuMet: 2.069 ± 0.011
3.812LeuAsn: 3.812 ± 0.016
5.849LeuPro: 5.849 ± 0.025
5.743LeuGln: 5.743 ± 0.03
5.688LeuArg: 5.688 ± 0.026
8.114LeuSer: 8.114 ± 0.03
5.278LeuThr: 5.278 ± 0.02
5.514LeuVal: 5.514 ± 0.022
1.146LeuTrp: 1.146 ± 0.008
2.663LeuTyr: 2.663 ± 0.014
0.001LeuXaa: 0.001 ± 0.0
Met
1.841MetAla: 1.841 ± 0.011
0.4MetCys: 0.4 ± 0.006
1.278MetAsp: 1.278 ± 0.008
1.94MetGlu: 1.94 ± 0.011
0.767MetPhe: 0.767 ± 0.008
1.315MetGly: 1.315 ± 0.009
0.464MetHis: 0.464 ± 0.005
0.941MetIle: 0.941 ± 0.008
1.538MetLys: 1.538 ± 0.01
2.024MetLeu: 2.024 ± 0.012
0.614MetMet: 0.614 ± 0.007
0.958MetAsn: 0.958 ± 0.008
1.062MetPro: 1.062 ± 0.009
0.933MetGln: 0.933 ± 0.008
1.001MetArg: 1.001 ± 0.008
1.582MetSer: 1.582 ± 0.009
1.177MetThr: 1.177 ± 0.008
1.415MetVal: 1.415 ± 0.01
0.241MetTrp: 0.241 ± 0.004
0.616MetTyr: 0.616 ± 0.006
0.0MetXaa: 0.0 ± 0.0
Asn
2.069AsnAla: 2.069 ± 0.012
0.91AsnCys: 0.91 ± 0.009
1.625AsnAsp: 1.625 ± 0.01
2.508AsnGlu: 2.508 ± 0.015
1.61AsnPhe: 1.61 ± 0.009
2.597AsnGly: 2.597 ± 0.015
1.026AsnHis: 1.026 ± 0.008
2.312AsnIle: 2.312 ± 0.014
2.363AsnLys: 2.363 ± 0.014
4.01AsnLeu: 4.01 ± 0.017
0.965AsnMet: 0.965 ± 0.007
1.664AsnAsn: 1.664 ± 0.011
2.251AsnPro: 2.251 ± 0.012
1.81AsnGln: 1.81 ± 0.011
1.919AsnArg: 1.919 ± 0.01
3.366AsnSer: 3.366 ± 0.02
2.09AsnThr: 2.09 ± 0.011
2.337AsnVal: 2.337 ± 0.012
0.484AsnTrp: 0.484 ± 0.006
1.186AsnTyr: 1.186 ± 0.01
0.0AsnXaa: 0.0 ± 0.0
Pro
4.11ProAla: 4.11 ± 0.028
1.091ProCys: 1.091 ± 0.012
2.718ProAsp: 2.718 ± 0.013
4.199ProGlu: 4.199 ± 0.018
2.009ProPhe: 2.009 ± 0.014
4.75ProGly: 4.75 ± 0.054
1.409ProHis: 1.409 ± 0.01
2.02ProIle: 2.02 ± 0.013
2.795ProLys: 2.795 ± 0.018
5.183ProLeu: 5.183 ± 0.022
1.045ProMet: 1.045 ± 0.008
1.954ProAsn: 1.954 ± 0.012
5.641ProPro: 5.641 ± 0.054
2.691ProGln: 2.691 ± 0.019
2.966ProArg: 2.966 ± 0.019
5.5ProSer: 5.5 ± 0.03
2.97ProThr: 2.97 ± 0.018
3.624ProVal: 3.624 ± 0.021
0.66ProTrp: 0.66 ± 0.006
1.583ProTyr: 1.583 ± 0.014
0.001ProXaa: 0.001 ± 0.0
Gln
3.274GlnAla: 3.274 ± 0.019
1.003GlnCys: 1.003 ± 0.01
2.312GlnAsp: 2.312 ± 0.011
3.886GlnGlu: 3.886 ± 0.024
1.414GlnPhe: 1.414 ± 0.009
2.768GlnGly: 2.768 ± 0.018
1.361GlnHis: 1.361 ± 0.011
2.15GlnIle: 2.15 ± 0.014
3.068GlnLys: 3.068 ± 0.02
4.81GlnLeu: 4.81 ± 0.028
1.146GlnMet: 1.146 ± 0.009
1.99GlnAsn: 1.99 ± 0.012
2.617GlnPro: 2.617 ± 0.019
3.121GlnGln: 3.121 ± 0.032
2.939GlnArg: 2.939 ± 0.016
3.232GlnSer: 3.232 ± 0.016
2.327GlnThr: 2.327 ± 0.014
2.824GlnVal: 2.824 ± 0.014
0.552GlnTrp: 0.552 ± 0.005
1.213GlnTyr: 1.213 ± 0.01
0.001GlnXaa: 0.001 ± 0.0
Arg
3.426ArgAla: 3.426 ± 0.018
1.156ArgCys: 1.156 ± 0.012
2.636ArgAsp: 2.636 ± 0.014
3.856ArgGlu: 3.856 ± 0.024
1.837ArgPhe: 1.837 ± 0.01
3.316ArgGly: 3.316 ± 0.021
1.469ArgHis: 1.469 ± 0.009
2.62ArgIle: 2.62 ± 0.015
3.677ArgLys: 3.677 ± 0.02
5.171ArgLeu: 5.171 ± 0.024
1.166ArgMet: 1.166 ± 0.007
2.182ArgAsn: 2.182 ± 0.013
2.859ArgPro: 2.859 ± 0.02
2.549ArgGln: 2.549 ± 0.014
3.966ArgArg: 3.966 ± 0.023
4.056ArgSer: 4.056 ± 0.027
2.752ArgThr: 2.752 ± 0.013
3.003ArgVal: 3.003 ± 0.015
0.661ArgTrp: 0.661 ± 0.006
1.491ArgTyr: 1.491 ± 0.01
0.0ArgXaa: 0.0 ± 0.0
Ser
4.914SerAla: 4.914 ± 0.019
1.852SerCys: 1.852 ± 0.014
3.952SerAsp: 3.952 ± 0.018
5.195SerGlu: 5.195 ± 0.021
3.216SerPhe: 3.216 ± 0.015
5.545SerGly: 5.545 ± 0.024
2.176SerHis: 2.176 ± 0.012
3.496SerIle: 3.496 ± 0.018
4.203SerLys: 4.203 ± 0.021
8.393SerLeu: 8.393 ± 0.029
1.636SerMet: 1.636 ± 0.01
2.882SerAsn: 2.882 ± 0.014
5.855SerPro: 5.855 ± 0.034
3.94SerGln: 3.94 ± 0.019
4.335SerArg: 4.335 ± 0.025
9.49SerSer: 9.49 ± 0.051
4.51SerThr: 4.51 ± 0.022
4.889SerVal: 4.889 ± 0.02
1.053SerTrp: 1.053 ± 0.009
2.179SerTyr: 2.179 ± 0.013
0.001SerXaa: 0.001 ± 0.0
Thr
3.561ThrAla: 3.561 ± 0.015
1.305ThrCys: 1.305 ± 0.013
2.509ThrAsp: 2.509 ± 0.011
3.61ThrGlu: 3.61 ± 0.014
2.23ThrPhe: 2.23 ± 0.012
3.597ThrGly: 3.597 ± 0.018
1.27ThrHis: 1.27 ± 0.01
2.604ThrIle: 2.604 ± 0.014
2.743ThrLys: 2.743 ± 0.015
5.412ThrLeu: 5.412 ± 0.021
1.153ThrMet: 1.153 ± 0.007
1.849ThrAsn: 1.849 ± 0.011
3.387ThrPro: 3.387 ± 0.021
2.29ThrGln: 2.29 ± 0.012
2.38ThrArg: 2.38 ± 0.013
4.66ThrSer: 4.66 ± 0.022
2.983ThrThr: 2.983 ± 0.019
3.834ThrVal: 3.834 ± 0.016
0.692ThrTrp: 0.692 ± 0.007
1.489ThrTyr: 1.489 ± 0.008
0.001ThrXaa: 0.001 ± 0.0
Val
3.963ValAla: 3.963 ± 0.019
1.436ValCys: 1.436 ± 0.01
2.872ValAsp: 2.872 ± 0.015
3.832ValGlu: 3.832 ± 0.018
2.468ValPhe: 2.468 ± 0.016
3.315ValGly: 3.315 ± 0.015
1.512ValHis: 1.512 ± 0.01
3.128ValIle: 3.128 ± 0.015
3.455ValLys: 3.455 ± 0.016
6.117ValLeu: 6.117 ± 0.025
1.346ValMet: 1.346 ± 0.011
2.399ValAsn: 2.399 ± 0.013
3.444ValPro: 3.444 ± 0.021
2.7ValGln: 2.7 ± 0.016
2.859ValArg: 2.859 ± 0.012
4.84ValSer: 4.84 ± 0.018
3.797ValThr: 3.797 ± 0.019
3.861ValVal: 3.861 ± 0.02
0.693ValTrp: 0.693 ± 0.008
1.665ValTyr: 1.665 ± 0.011
0.0ValXaa: 0.0 ± 0.0
Trp
0.735TrpAla: 0.735 ± 0.006
0.24TrpCys: 0.24 ± 0.004
0.662TrpAsp: 0.662 ± 0.007
0.814TrpGlu: 0.814 ± 0.007
0.435TrpPhe: 0.435 ± 0.005
0.738TrpGly: 0.738 ± 0.009
0.294TrpHis: 0.294 ± 0.004
0.589TrpIle: 0.589 ± 0.007
0.859TrpLys: 0.859 ± 0.007
1.205TrpLeu: 1.205 ± 0.009
0.317TrpMet: 0.317 ± 0.005
0.586TrpAsn: 0.586 ± 0.006
0.503TrpPro: 0.503 ± 0.006
0.523TrpGln: 0.523 ± 0.005
0.696TrpArg: 0.696 ± 0.007
0.878TrpSer: 0.878 ± 0.009
0.669TrpThr: 0.669 ± 0.007
0.675TrpVal: 0.675 ± 0.007
0.197TrpTrp: 0.197 ± 0.003
0.338TrpTyr: 0.338 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.382TyrAla: 1.382 ± 0.01
0.698TyrCys: 0.698 ± 0.007
1.352TyrAsp: 1.352 ± 0.01
1.781TyrGlu: 1.781 ± 0.012
1.32TyrPhe: 1.32 ± 0.01
1.705TyrGly: 1.705 ± 0.012
0.787TyrHis: 0.787 ± 0.008
1.462TyrIle: 1.462 ± 0.01
1.567TyrLys: 1.567 ± 0.015
2.798TyrLeu: 2.798 ± 0.015
0.626TyrMet: 0.626 ± 0.006
1.197TyrAsn: 1.197 ± 0.009
1.35TyrPro: 1.35 ± 0.011
1.314TyrGln: 1.314 ± 0.01
1.586TyrArg: 1.586 ± 0.011
2.26TyrSer: 2.26 ± 0.012
1.56TyrThr: 1.56 ± 0.011
1.627TyrVal: 1.627 ± 0.011
0.373TyrTrp: 0.373 ± 0.005
0.99TyrTyr: 0.99 ± 0.009
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.002XaaPro: 0.002 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.101XaaXaa: 0.101 ± 0.016
Statistics based on 31139 proteins (18770870 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski