Amino acid dipepetide frequency for Chloebia gouldiae (Gouldian finch) (Erythrura gouldiae)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.013AlaAla: 8.013 ± 0.056
1.377AlaCys: 1.377 ± 0.017
2.954AlaAsp: 2.954 ± 0.021
5.207AlaGlu: 5.207 ± 0.037
2.568AlaPhe: 2.568 ± 0.019
5.618AlaGly: 5.618 ± 0.041
1.485AlaHis: 1.485 ± 0.014
2.709AlaIle: 2.709 ± 0.018
3.529AlaLys: 3.529 ± 0.025
7.371AlaLeu: 7.371 ± 0.035
1.53AlaMet: 1.53 ± 0.014
2.002AlaAsn: 2.002 ± 0.017
4.278AlaPro: 4.278 ± 0.039
3.197AlaGln: 3.197 ± 0.028
3.824AlaArg: 3.824 ± 0.029
5.438AlaSer: 5.438 ± 0.029
3.377AlaThr: 3.377 ± 0.025
5.218AlaVal: 5.218 ± 0.028
0.863AlaTrp: 0.863 ± 0.013
1.434AlaTyr: 1.434 ± 0.015
0.001AlaXaa: 0.001 ± 0.0
Cys
1.369CysAla: 1.369 ± 0.015
0.739CysCys: 0.739 ± 0.011
1.0CysAsp: 1.0 ± 0.016
1.232CysGlu: 1.232 ± 0.018
0.89CysPhe: 0.89 ± 0.012
1.626CysGly: 1.626 ± 0.021
0.742CysHis: 0.742 ± 0.011
0.98CysIle: 0.98 ± 0.012
1.186CysLys: 1.186 ± 0.014
2.259CysLeu: 2.259 ± 0.018
0.439CysMet: 0.439 ± 0.007
0.788CysAsn: 0.788 ± 0.011
1.715CysPro: 1.715 ± 0.032
1.151CysGln: 1.151 ± 0.017
1.429CysArg: 1.429 ± 0.017
2.216CysSer: 2.216 ± 0.026
1.145CysThr: 1.145 ± 0.014
1.36CysVal: 1.36 ± 0.018
0.366CysTrp: 0.366 ± 0.008
0.593CysTyr: 0.593 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
2.863AspAla: 2.863 ± 0.022
1.037AspCys: 1.037 ± 0.015
2.496AspAsp: 2.496 ± 0.025
3.386AspGlu: 3.386 ± 0.026
2.087AspPhe: 2.087 ± 0.016
3.309AspGly: 3.309 ± 0.027
1.058AspHis: 1.058 ± 0.012
2.565AspIle: 2.565 ± 0.021
2.527AspLys: 2.527 ± 0.023
4.697AspLeu: 4.697 ± 0.028
1.107AspMet: 1.107 ± 0.013
1.708AspAsn: 1.708 ± 0.014
2.841AspPro: 2.841 ± 0.024
1.688AspGln: 1.688 ± 0.018
2.364AspArg: 2.364 ± 0.023
4.055AspSer: 4.055 ± 0.03
2.522AspThr: 2.522 ± 0.017
3.05AspVal: 3.05 ± 0.026
0.654AspTrp: 0.654 ± 0.011
1.399AspTyr: 1.399 ± 0.013
0.0AspXaa: 0.0 ± 0.0
Glu
4.941GluAla: 4.941 ± 0.038
1.261GluCys: 1.261 ± 0.019
4.213GluAsp: 4.213 ± 0.028
8.02GluGlu: 8.02 ± 0.074
2.074GluPhe: 2.074 ± 0.016
4.251GluGly: 4.251 ± 0.028
1.561GluHis: 1.561 ± 0.016
3.159GluIle: 3.159 ± 0.023
5.284GluLys: 5.284 ± 0.053
6.544GluLeu: 6.544 ± 0.052
1.753GluMet: 1.753 ± 0.019
3.104GluAsn: 3.104 ± 0.024
3.182GluPro: 3.182 ± 0.026
3.344GluGln: 3.344 ± 0.03
4.074GluArg: 4.074 ± 0.037
4.37GluSer: 4.37 ± 0.033
3.262GluThr: 3.262 ± 0.025
4.126GluVal: 4.126 ± 0.029
0.743GluTrp: 0.743 ± 0.011
1.571GluTyr: 1.571 ± 0.016
0.0GluXaa: 0.0 ± 0.0
Phe
2.058PheAla: 2.058 ± 0.017
0.963PheCys: 0.963 ± 0.012
1.594PheAsp: 1.594 ± 0.014
1.923PheGlu: 1.923 ± 0.016
1.588PhePhe: 1.588 ± 0.018
2.394PheGly: 2.394 ± 0.028
1.056PheHis: 1.056 ± 0.013
1.793PheIle: 1.793 ± 0.018
1.718PheLys: 1.718 ± 0.017
4.048PheLeu: 4.048 ± 0.027
0.752PheMet: 0.752 ± 0.01
1.267PheAsn: 1.267 ± 0.012
2.195PhePro: 2.195 ± 0.02
1.793PheGln: 1.793 ± 0.013
1.851PheArg: 1.851 ± 0.017
3.327PheSer: 3.327 ± 0.022
1.957PheThr: 1.957 ± 0.018
2.172PheVal: 2.172 ± 0.018
0.579PheTrp: 0.579 ± 0.012
1.144PheTyr: 1.144 ± 0.013
0.0PheXaa: 0.0 ± 0.0
Gly
5.004GlyAla: 5.004 ± 0.04
1.548GlyCys: 1.548 ± 0.02
3.287GlyAsp: 3.287 ± 0.025
4.131GlyGlu: 4.131 ± 0.029
2.524GlyPhe: 2.524 ± 0.026
5.377GlyGly: 5.377 ± 0.057
1.809GlyHis: 1.809 ± 0.017
3.028GlyIle: 3.028 ± 0.029
3.863GlyLys: 3.863 ± 0.024
5.751GlyLeu: 5.751 ± 0.035
1.505GlyMet: 1.505 ± 0.018
2.445GlyAsn: 2.445 ± 0.022
3.596GlyPro: 3.596 ± 0.044
2.871GlyGln: 2.871 ± 0.024
4.092GlyArg: 4.092 ± 0.034
6.146GlySer: 6.146 ± 0.042
3.935GlyThr: 3.935 ± 0.028
3.893GlyVal: 3.893 ± 0.034
1.055GlyTrp: 1.055 ± 0.017
1.742GlyTyr: 1.742 ± 0.021
0.002GlyXaa: 0.002 ± 0.0
His
1.34HisAla: 1.34 ± 0.014
0.737HisCys: 0.737 ± 0.012
0.864HisAsp: 0.864 ± 0.01
1.292HisGlu: 1.292 ± 0.013
1.05HisPhe: 1.05 ± 0.012
1.753HisGly: 1.753 ± 0.017
0.911HisHis: 0.911 ± 0.015
1.212HisIle: 1.212 ± 0.013
1.276HisLys: 1.276 ± 0.013
2.848HisLeu: 2.848 ± 0.023
0.572HisMet: 0.572 ± 0.009
0.884HisAsn: 0.884 ± 0.012
1.764HisPro: 1.764 ± 0.019
1.204HisGln: 1.204 ± 0.013
1.603HisArg: 1.603 ± 0.015
2.262HisSer: 2.262 ± 0.018
1.275HisThr: 1.275 ± 0.013
1.459HisVal: 1.459 ± 0.014
0.385HisTrp: 0.385 ± 0.007
0.766HisTyr: 0.766 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
2.639IleAla: 2.639 ± 0.021
1.075IleCys: 1.075 ± 0.013
1.954IleAsp: 1.954 ± 0.019
2.474IleGlu: 2.474 ± 0.021
1.883IlePhe: 1.883 ± 0.02
2.193IleGly: 2.193 ± 0.024
1.221IleHis: 1.221 ± 0.013
2.295IleIle: 2.295 ± 0.024
2.585IleLys: 2.585 ± 0.022
4.446IleLeu: 4.446 ± 0.026
0.979IleMet: 0.979 ± 0.011
1.802IleAsn: 1.802 ± 0.015
2.917IlePro: 2.917 ± 0.027
2.22IleGln: 2.22 ± 0.019
2.267IleArg: 2.267 ± 0.018
3.739IleSer: 3.739 ± 0.026
2.498IleThr: 2.498 ± 0.022
2.492IleVal: 2.492 ± 0.022
0.552IleTrp: 0.552 ± 0.009
1.329IleTyr: 1.329 ± 0.015
0.0IleXaa: 0.0 ± 0.0
Lys
3.975LysAla: 3.975 ± 0.026
1.107LysCys: 1.107 ± 0.014
3.121LysAsp: 3.121 ± 0.027
5.135LysGlu: 5.135 ± 0.049
1.685LysPhe: 1.685 ± 0.016
3.32LysGly: 3.32 ± 0.032
1.418LysHis: 1.418 ± 0.018
2.776LysIle: 2.776 ± 0.021
4.73LysLys: 4.73 ± 0.041
5.109LysLeu: 5.109 ± 0.035
1.465LysMet: 1.465 ± 0.015
2.377LysAsn: 2.377 ± 0.021
2.914LysPro: 2.914 ± 0.026
2.707LysGln: 2.707 ± 0.023
3.322LysArg: 3.322 ± 0.024
3.932LysSer: 3.932 ± 0.031
2.977LysThr: 2.977 ± 0.022
3.419LysVal: 3.419 ± 0.025
0.63LysTrp: 0.63 ± 0.009
1.582LysTyr: 1.582 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
6.74LeuAla: 6.74 ± 0.04
2.448LeuCys: 2.448 ± 0.021
4.464LeuAsp: 4.464 ± 0.027
7.025LeuGlu: 7.025 ± 0.053
3.347LeuPhe: 3.347 ± 0.027
6.176LeuGly: 6.176 ± 0.044
2.704LeuHis: 2.704 ± 0.019
3.697LeuIle: 3.697 ± 0.028
5.722LeuLys: 5.722 ± 0.04
10.71LeuLeu: 10.71 ± 0.058
2.006LeuMet: 2.006 ± 0.017
3.388LeuAsn: 3.388 ± 0.026
6.151LeuPro: 6.151 ± 0.038
5.796LeuGln: 5.796 ± 0.041
5.821LeuArg: 5.821 ± 0.035
7.993LeuSer: 7.993 ± 0.035
4.648LeuThr: 4.648 ± 0.027
5.295LeuVal: 5.295 ± 0.03
1.244LeuTrp: 1.244 ± 0.013
2.491LeuTyr: 2.491 ± 0.02
0.0LeuXaa: 0.0 ± 0.0
Met
1.887MetAla: 1.887 ± 0.017
0.431MetCys: 0.431 ± 0.008
1.266MetAsp: 1.266 ± 0.013
1.907MetGlu: 1.907 ± 0.02
0.759MetPhe: 0.759 ± 0.01
1.438MetGly: 1.438 ± 0.018
0.465MetHis: 0.465 ± 0.008
0.839MetIle: 0.839 ± 0.009
1.457MetLys: 1.457 ± 0.017
1.967MetLeu: 1.967 ± 0.017
0.576MetMet: 0.576 ± 0.01
0.867MetAsn: 0.867 ± 0.011
1.079MetPro: 1.079 ± 0.013
0.982MetGln: 0.982 ± 0.012
1.066MetArg: 1.066 ± 0.013
1.604MetSer: 1.604 ± 0.014
1.041MetThr: 1.041 ± 0.013
1.36MetVal: 1.36 ± 0.014
0.271MetTrp: 0.271 ± 0.006
0.552MetTyr: 0.552 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
2.139AsnAla: 2.139 ± 0.017
0.87AsnCys: 0.87 ± 0.014
1.432AsnAsp: 1.432 ± 0.015
2.215AsnGlu: 2.215 ± 0.021
1.436AsnPhe: 1.436 ± 0.014
2.474AsnGly: 2.474 ± 0.025
0.874AsnHis: 0.874 ± 0.011
2.076AsnIle: 2.076 ± 0.02
2.19AsnLys: 2.19 ± 0.019
3.579AsnLeu: 3.579 ± 0.025
0.894AsnMet: 0.894 ± 0.012
1.554AsnAsn: 1.554 ± 0.018
2.198AsnPro: 2.198 ± 0.021
1.501AsnGln: 1.501 ± 0.017
1.8AsnArg: 1.8 ± 0.015
3.12AsnSer: 3.12 ± 0.026
1.946AsnThr: 1.946 ± 0.018
2.191AsnVal: 2.191 ± 0.019
0.454AsnTrp: 0.454 ± 0.008
1.083AsnTyr: 1.083 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
5.444ProAla: 5.444 ± 0.045
1.416ProCys: 1.416 ± 0.021
2.621ProAsp: 2.621 ± 0.02
4.289ProGlu: 4.289 ± 0.03
1.962ProPhe: 1.962 ± 0.018
5.475ProGly: 5.475 ± 0.066
1.463ProHis: 1.463 ± 0.018
1.822ProIle: 1.822 ± 0.018
2.77ProLys: 2.77 ± 0.029
5.211ProLeu: 5.211 ± 0.036
0.978ProMet: 0.978 ± 0.012
1.75ProAsn: 1.75 ± 0.016
5.636ProPro: 5.636 ± 0.065
2.768ProGln: 2.768 ± 0.025
3.572ProArg: 3.572 ± 0.032
5.582ProSer: 5.582 ± 0.04
2.749ProThr: 2.749 ± 0.022
3.965ProVal: 3.965 ± 0.029
0.823ProTrp: 0.823 ± 0.016
1.345ProTyr: 1.345 ± 0.016
0.001ProXaa: 0.001 ± 0.0
Gln
3.267GlnAla: 3.267 ± 0.026
0.982GlnCys: 0.982 ± 0.016
2.283GlnAsp: 2.283 ± 0.018
3.798GlnGlu: 3.798 ± 0.034
1.345GlnPhe: 1.345 ± 0.015
2.939GlnGly: 2.939 ± 0.026
1.383GlnHis: 1.383 ± 0.014
2.001GlnIle: 2.001 ± 0.018
2.965GlnLys: 2.965 ± 0.024
4.729GlnLeu: 4.729 ± 0.036
1.088GlnMet: 1.088 ± 0.013
1.882GlnAsn: 1.882 ± 0.018
2.758GlnPro: 2.758 ± 0.024
3.079GlnGln: 3.079 ± 0.043
2.941GlnArg: 2.941 ± 0.022
3.176GlnSer: 3.176 ± 0.024
2.209GlnThr: 2.209 ± 0.017
2.805GlnVal: 2.805 ± 0.022
0.556GlnTrp: 0.556 ± 0.01
1.121GlnTyr: 1.121 ± 0.012
0.0GlnXaa: 0.0 ± 0.0
Arg
4.235ArgAla: 4.235 ± 0.033
1.384ArgCys: 1.384 ± 0.022
2.891ArgAsp: 2.891 ± 0.025
4.025ArgGlu: 4.025 ± 0.035
1.874ArgPhe: 1.874 ± 0.018
4.044ArgGly: 4.044 ± 0.041
1.498ArgHis: 1.498 ± 0.016
2.388ArgIle: 2.388 ± 0.018
3.565ArgLys: 3.565 ± 0.024
5.187ArgLeu: 5.187 ± 0.033
1.233ArgMet: 1.233 ± 0.012
2.088ArgAsn: 2.088 ± 0.015
2.971ArgPro: 2.971 ± 0.027
2.61ArgGln: 2.61 ± 0.022
4.519ArgArg: 4.519 ± 0.042
4.395ArgSer: 4.395 ± 0.034
2.689ArgThr: 2.689 ± 0.018
3.138ArgVal: 3.138 ± 0.021
0.761ArgTrp: 0.761 ± 0.01
1.449ArgTyr: 1.449 ± 0.013
0.001ArgXaa: 0.001 ± 0.0
Ser
5.661SerAla: 5.661 ± 0.032
2.08SerCys: 2.08 ± 0.022
3.768SerAsp: 3.768 ± 0.026
4.947SerGlu: 4.947 ± 0.033
2.983SerPhe: 2.983 ± 0.02
5.51SerGly: 5.51 ± 0.035
2.108SerHis: 2.108 ± 0.02
3.239SerIle: 3.239 ± 0.023
4.089SerLys: 4.089 ± 0.029
8.162SerLeu: 8.162 ± 0.039
1.613SerMet: 1.613 ± 0.016
2.63SerAsn: 2.63 ± 0.022
6.157SerPro: 6.157 ± 0.048
3.786SerGln: 3.786 ± 0.028
4.627SerArg: 4.627 ± 0.037
9.551SerSer: 9.551 ± 0.074
4.352SerThr: 4.352 ± 0.036
4.934SerVal: 4.934 ± 0.027
1.139SerTrp: 1.139 ± 0.014
2.013SerTyr: 2.013 ± 0.019
0.001SerXaa: 0.001 ± 0.0
Thr
3.989ThrAla: 3.989 ± 0.027
1.279ThrCys: 1.279 ± 0.022
2.477ThrAsp: 2.477 ± 0.018
3.528ThrGlu: 3.528 ± 0.028
2.014ThrPhe: 2.014 ± 0.016
3.577ThrGly: 3.577 ± 0.028
1.125ThrHis: 1.125 ± 0.012
2.251ThrIle: 2.251 ± 0.019
2.58ThrLys: 2.58 ± 0.022
4.923ThrLeu: 4.923 ± 0.026
1.072ThrMet: 1.072 ± 0.013
1.686ThrAsn: 1.686 ± 0.015
3.338ThrPro: 3.338 ± 0.032
2.069ThrGln: 2.069 ± 0.019
2.386ThrArg: 2.386 ± 0.018
4.367ThrSer: 4.367 ± 0.034
2.742ThrThr: 2.742 ± 0.032
3.738ThrVal: 3.738 ± 0.027
0.72ThrTrp: 0.72 ± 0.012
1.334ThrTyr: 1.334 ± 0.013
0.0ThrXaa: 0.0 ± 0.0
Val
4.208ValAla: 4.208 ± 0.03
1.532ValCys: 1.532 ± 0.019
2.722ValAsp: 2.722 ± 0.022
3.785ValGlu: 3.785 ± 0.027
2.422ValPhe: 2.422 ± 0.021
3.473ValGly: 3.473 ± 0.03
1.511ValHis: 1.511 ± 0.015
2.789ValIle: 2.789 ± 0.022
3.383ValLys: 3.383 ± 0.025
6.447ValLeu: 6.447 ± 0.033
1.329ValMet: 1.329 ± 0.013
2.168ValAsn: 2.168 ± 0.02
4.13ValPro: 4.13 ± 0.039
2.782ValGln: 2.782 ± 0.023
3.04ValArg: 3.04 ± 0.019
4.948ValSer: 4.948 ± 0.029
3.811ValThr: 3.811 ± 0.032
3.993ValVal: 3.993 ± 0.031
0.801ValTrp: 0.801 ± 0.012
1.592ValTyr: 1.592 ± 0.015
0.001ValXaa: 0.001 ± 0.0
Trp
0.855TrpAla: 0.855 ± 0.013
0.294TrpCys: 0.294 ± 0.007
0.789TrpAsp: 0.789 ± 0.015
0.912TrpGlu: 0.912 ± 0.012
0.466TrpPhe: 0.466 ± 0.009
1.008TrpGly: 1.008 ± 0.022
0.356TrpHis: 0.356 ± 0.007
0.6TrpIle: 0.6 ± 0.01
0.853TrpLys: 0.853 ± 0.011
1.284TrpLeu: 1.284 ± 0.016
0.333TrpMet: 0.333 ± 0.007
0.558TrpAsn: 0.558 ± 0.009
0.56TrpPro: 0.56 ± 0.01
0.6TrpGln: 0.6 ± 0.009
0.78TrpArg: 0.78 ± 0.011
0.953TrpSer: 0.953 ± 0.015
0.649TrpThr: 0.649 ± 0.01
0.763TrpVal: 0.763 ± 0.011
0.225TrpTrp: 0.225 ± 0.006
0.345TrpTyr: 0.345 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.363TyrAla: 1.363 ± 0.013
0.664TyrCys: 0.664 ± 0.01
1.217TyrAsp: 1.217 ± 0.014
1.606TyrGlu: 1.606 ± 0.016
1.215TyrPhe: 1.215 ± 0.013
1.65TyrGly: 1.65 ± 0.018
0.707TyrHis: 0.707 ± 0.01
1.341TyrIle: 1.341 ± 0.014
1.43TyrLys: 1.43 ± 0.017
2.597TyrLeu: 2.597 ± 0.022
0.589TyrMet: 0.589 ± 0.009
1.081TyrAsn: 1.081 ± 0.013
1.261TyrPro: 1.261 ± 0.014
1.159TyrGln: 1.159 ± 0.015
1.564TyrArg: 1.564 ± 0.017
2.176TyrSer: 2.176 ± 0.019
1.407TyrThr: 1.407 ± 0.016
1.496TyrVal: 1.496 ± 0.015
0.362TyrTrp: 0.362 ± 0.01
0.918TyrTyr: 0.918 ± 0.013
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.024XaaXaa: 0.024 ± 0.003
Statistics based on 18274 proteins (8488823 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski