Amino acid dipepetide frequency for Taeniopygia guttata (Zebra finch) (Poephila guttata)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.06AlaAla: 8.06 ± 0.047
1.305AlaCys: 1.305 ± 0.01
2.997AlaAsp: 2.997 ± 0.016
5.245AlaGlu: 5.245 ± 0.025
2.444AlaPhe: 2.444 ± 0.012
5.285AlaGly: 5.285 ± 0.026
1.434AlaHis: 1.434 ± 0.011
2.728AlaIle: 2.728 ± 0.013
3.53AlaLys: 3.53 ± 0.018
7.151AlaLeu: 7.151 ± 0.03
1.521AlaMet: 1.521 ± 0.009
2.088AlaAsn: 2.088 ± 0.012
4.255AlaPro: 4.255 ± 0.025
3.143AlaGln: 3.143 ± 0.017
3.848AlaArg: 3.848 ± 0.018
5.31AlaSer: 5.31 ± 0.019
3.39AlaThr: 3.39 ± 0.02
5.102AlaVal: 5.102 ± 0.018
0.791AlaTrp: 0.791 ± 0.007
1.433AlaTyr: 1.433 ± 0.008
0.0AlaXaa: 0.0 ± 0.0
Cys
1.326CysAla: 1.326 ± 0.01
0.661CysCys: 0.661 ± 0.008
1.016CysAsp: 1.016 ± 0.01
1.263CysGlu: 1.263 ± 0.012
0.83CysPhe: 0.83 ± 0.006
1.58CysGly: 1.58 ± 0.015
0.72CysHis: 0.72 ± 0.009
0.999CysIle: 0.999 ± 0.01
1.15CysLys: 1.15 ± 0.009
2.092CysLeu: 2.092 ± 0.011
0.413CysMet: 0.413 ± 0.004
0.798CysAsn: 0.798 ± 0.007
1.652CysPro: 1.652 ± 0.018
1.111CysGln: 1.111 ± 0.01
1.37CysArg: 1.37 ± 0.011
2.074CysSer: 2.074 ± 0.013
1.12CysThr: 1.12 ± 0.009
1.304CysVal: 1.304 ± 0.011
0.304CysTrp: 0.304 ± 0.004
0.586CysTyr: 0.586 ± 0.005
0.0CysXaa: 0.0 ± 0.0
Asp
2.909AspAla: 2.909 ± 0.013
1.051AspCys: 1.051 ± 0.01
2.634AspAsp: 2.634 ± 0.016
3.553AspGlu: 3.553 ± 0.018
2.121AspPhe: 2.121 ± 0.013
3.403AspGly: 3.403 ± 0.019
1.084AspHis: 1.084 ± 0.008
2.682AspIle: 2.682 ± 0.016
2.617AspLys: 2.617 ± 0.013
4.782AspLeu: 4.782 ± 0.017
1.105AspMet: 1.105 ± 0.009
1.756AspAsn: 1.756 ± 0.012
2.808AspPro: 2.808 ± 0.015
1.726AspGln: 1.726 ± 0.011
2.381AspArg: 2.381 ± 0.014
4.155AspSer: 4.155 ± 0.018
2.549AspThr: 2.549 ± 0.014
3.105AspVal: 3.105 ± 0.018
0.623AspTrp: 0.623 ± 0.006
1.468AspTyr: 1.468 ± 0.01
0.0AspXaa: 0.0 ± 0.0
Glu
4.946GluAla: 4.946 ± 0.024
1.348GluCys: 1.348 ± 0.014
4.332GluAsp: 4.332 ± 0.02
8.179GluGlu: 8.179 ± 0.05
2.097GluPhe: 2.097 ± 0.013
4.195GluGly: 4.195 ± 0.02
1.561GluHis: 1.561 ± 0.01
3.278GluIle: 3.278 ± 0.016
5.413GluLys: 5.413 ± 0.034
6.647GluLeu: 6.647 ± 0.033
1.742GluMet: 1.742 ± 0.011
3.226GluAsn: 3.226 ± 0.018
3.145GluPro: 3.145 ± 0.018
3.366GluGln: 3.366 ± 0.022
4.194GluArg: 4.194 ± 0.024
4.444GluSer: 4.444 ± 0.019
3.387GluThr: 3.387 ± 0.018
4.118GluVal: 4.118 ± 0.019
0.719GluTrp: 0.719 ± 0.006
1.652GluTyr: 1.652 ± 0.011
0.0GluXaa: 0.0 ± 0.0
Phe
2.007PheAla: 2.007 ± 0.011
0.914PheCys: 0.914 ± 0.007
1.679PheAsp: 1.679 ± 0.01
1.969PheGlu: 1.969 ± 0.011
1.599PhePhe: 1.599 ± 0.014
2.332PheGly: 2.332 ± 0.015
1.021PheHis: 1.021 ± 0.008
1.775PheIle: 1.775 ± 0.013
1.744PheLys: 1.744 ± 0.011
3.914PheLeu: 3.914 ± 0.022
0.746PheMet: 0.746 ± 0.006
1.315PheAsn: 1.315 ± 0.008
2.206PhePro: 2.206 ± 0.024
1.76PheGln: 1.76 ± 0.009
1.862PheArg: 1.862 ± 0.01
3.272PheSer: 3.272 ± 0.019
1.963PheThr: 1.963 ± 0.012
2.145PheVal: 2.145 ± 0.012
0.535PheTrp: 0.535 ± 0.007
1.14PheTyr: 1.14 ± 0.008
0.0PheXaa: 0.0 ± 0.0
Gly
4.879GlyAla: 4.879 ± 0.025
1.396GlyCys: 1.396 ± 0.011
3.332GlyAsp: 3.332 ± 0.017
4.049GlyGlu: 4.049 ± 0.021
2.463GlyPhe: 2.463 ± 0.018
5.525GlyGly: 5.525 ± 0.042
1.817GlyHis: 1.817 ± 0.018
2.912GlyIle: 2.912 ± 0.019
3.797GlyLys: 3.797 ± 0.017
5.51GlyLeu: 5.51 ± 0.026
1.408GlyMet: 1.408 ± 0.011
2.479GlyAsn: 2.479 ± 0.016
3.711GlyPro: 3.711 ± 0.032
2.782GlyGln: 2.782 ± 0.018
4.084GlyArg: 4.084 ± 0.02
5.998GlySer: 5.998 ± 0.03
3.981GlyThr: 3.981 ± 0.023
3.745GlyVal: 3.745 ± 0.02
0.913GlyTrp: 0.913 ± 0.009
1.771GlyTyr: 1.771 ± 0.012
0.0GlyXaa: 0.0 ± 0.0
His
1.337HisAla: 1.337 ± 0.009
0.716HisCys: 0.716 ± 0.008
0.883HisAsp: 0.883 ± 0.007
1.344HisGlu: 1.344 ± 0.008
1.059HisPhe: 1.059 ± 0.01
1.674HisGly: 1.674 ± 0.012
0.879HisHis: 0.879 ± 0.01
1.241HisIle: 1.241 ± 0.008
1.273HisLys: 1.273 ± 0.009
2.764HisLeu: 2.764 ± 0.019
0.552HisMet: 0.552 ± 0.005
0.9HisAsn: 0.9 ± 0.008
1.742HisPro: 1.742 ± 0.02
1.198HisGln: 1.198 ± 0.011
1.617HisArg: 1.617 ± 0.012
2.263HisSer: 2.263 ± 0.018
1.321HisThr: 1.321 ± 0.012
1.46HisVal: 1.46 ± 0.01
0.359HisTrp: 0.359 ± 0.007
0.788HisTyr: 0.788 ± 0.007
0.0HisXaa: 0.0 ± 0.0
Ile
2.656IleAla: 2.656 ± 0.015
1.057IleCys: 1.057 ± 0.009
2.087IleAsp: 2.087 ± 0.013
2.61IleGlu: 2.61 ± 0.013
1.874IlePhe: 1.874 ± 0.015
2.239IleGly: 2.239 ± 0.014
1.267IleHis: 1.267 ± 0.019
2.326IleIle: 2.326 ± 0.017
2.668IleLys: 2.668 ± 0.015
4.414IleLeu: 4.414 ± 0.022
0.972IleMet: 0.972 ± 0.007
1.914IleAsn: 1.914 ± 0.011
2.878IlePro: 2.878 ± 0.02
2.282IleGln: 2.282 ± 0.013
2.368IleArg: 2.368 ± 0.012
3.73IleSer: 3.73 ± 0.017
2.552IleThr: 2.552 ± 0.014
2.525IleVal: 2.525 ± 0.016
0.518IleTrp: 0.518 ± 0.007
1.352IleTyr: 1.352 ± 0.011
0.0IleXaa: 0.0 ± 0.0
Lys
3.914LysAla: 3.914 ± 0.019
1.108LysCys: 1.108 ± 0.01
3.187LysAsp: 3.187 ± 0.017
5.177LysGlu: 5.177 ± 0.03
1.727LysPhe: 1.727 ± 0.013
3.228LysGly: 3.228 ± 0.021
1.403LysHis: 1.403 ± 0.01
2.865LysIle: 2.865 ± 0.017
4.801LysLys: 4.801 ± 0.031
5.225LysLeu: 5.225 ± 0.023
1.464LysMet: 1.464 ± 0.01
2.499LysAsn: 2.499 ± 0.014
2.942LysPro: 2.942 ± 0.016
2.767LysGln: 2.767 ± 0.017
3.293LysArg: 3.293 ± 0.016
4.045LysSer: 4.045 ± 0.02
3.057LysThr: 3.057 ± 0.016
3.445LysVal: 3.445 ± 0.016
0.627LysTrp: 0.627 ± 0.007
1.617LysTyr: 1.617 ± 0.01
0.0LysXaa: 0.0 ± 0.0
Leu
6.484LeuAla: 6.484 ± 0.025
2.251LeuCys: 2.251 ± 0.014
4.573LeuAsp: 4.573 ± 0.018
7.16LeuGlu: 7.16 ± 0.032
3.251LeuPhe: 3.251 ± 0.018
5.806LeuGly: 5.806 ± 0.027
2.65LeuHis: 2.65 ± 0.014
3.705LeuIle: 3.705 ± 0.019
5.801LeuLys: 5.801 ± 0.027
10.277LeuLeu: 10.277 ± 0.039
1.912LeuMet: 1.912 ± 0.011
3.488LeuAsn: 3.488 ± 0.015
5.921LeuPro: 5.921 ± 0.025
5.747LeuGln: 5.747 ± 0.024
5.887LeuArg: 5.887 ± 0.024
7.788LeuSer: 7.788 ± 0.024
4.607LeuThr: 4.607 ± 0.018
5.151LeuVal: 5.151 ± 0.02
1.119LeuTrp: 1.119 ± 0.009
2.491LeuTyr: 2.491 ± 0.015
0.0LeuXaa: 0.0 ± 0.0
Met
1.799MetAla: 1.799 ± 0.01
0.411MetCys: 0.411 ± 0.005
1.263MetAsp: 1.263 ± 0.009
1.891MetGlu: 1.891 ± 0.011
0.743MetPhe: 0.743 ± 0.006
1.318MetGly: 1.318 ± 0.01
0.472MetHis: 0.472 ± 0.005
0.827MetIle: 0.827 ± 0.007
1.477MetLys: 1.477 ± 0.009
1.881MetLeu: 1.881 ± 0.01
0.556MetMet: 0.556 ± 0.005
0.898MetAsn: 0.898 ± 0.007
1.013MetPro: 1.013 ± 0.01
0.985MetGln: 0.985 ± 0.009
1.041MetArg: 1.041 ± 0.007
1.611MetSer: 1.611 ± 0.014
1.058MetThr: 1.058 ± 0.008
1.331MetVal: 1.331 ± 0.008
0.248MetTrp: 0.248 ± 0.004
0.555MetTyr: 0.555 ± 0.005
0.0MetXaa: 0.0 ± 0.0
Asn
2.201AsnAla: 2.201 ± 0.011
0.864AsnCys: 0.864 ± 0.008
1.55AsnAsp: 1.55 ± 0.012
2.357AsnGlu: 2.357 ± 0.014
1.508AsnPhe: 1.508 ± 0.011
2.629AsnGly: 2.629 ± 0.016
0.899AsnHis: 0.899 ± 0.007
2.158AsnIle: 2.158 ± 0.013
2.264AsnLys: 2.264 ± 0.012
3.697AsnLeu: 3.697 ± 0.018
0.912AsnMet: 0.912 ± 0.006
1.62AsnAsn: 1.62 ± 0.01
2.248AsnPro: 2.248 ± 0.014
1.6AsnGln: 1.6 ± 0.011
1.826AsnArg: 1.826 ± 0.011
3.226AsnSer: 3.226 ± 0.017
2.038AsnThr: 2.038 ± 0.012
2.259AsnVal: 2.259 ± 0.012
0.456AsnTrp: 0.456 ± 0.006
1.122AsnTyr: 1.122 ± 0.008
0.0AsnXaa: 0.0 ± 0.0
Pro
5.309ProAla: 5.309 ± 0.028
1.314ProCys: 1.314 ± 0.014
2.645ProAsp: 2.645 ± 0.015
4.29ProGlu: 4.29 ± 0.022
1.981ProPhe: 1.981 ± 0.018
5.444ProGly: 5.444 ± 0.046
1.49ProHis: 1.49 ± 0.013
1.89ProIle: 1.89 ± 0.015
2.872ProLys: 2.872 ± 0.023
5.074ProLeu: 5.074 ± 0.025
0.988ProMet: 0.988 ± 0.011
1.903ProAsn: 1.903 ± 0.015
6.046ProPro: 6.046 ± 0.054
2.864ProGln: 2.864 ± 0.023
3.542ProArg: 3.542 ± 0.024
5.599ProSer: 5.599 ± 0.032
2.844ProThr: 2.844 ± 0.022
4.023ProVal: 4.023 ± 0.02
0.695ProTrp: 0.695 ± 0.007
1.36ProTyr: 1.36 ± 0.011
0.0ProXaa: 0.0 ± 0.0
Gln
3.233GlnAla: 3.233 ± 0.018
1.0GlnCys: 1.0 ± 0.01
2.244GlnAsp: 2.244 ± 0.011
3.81GlnGlu: 3.81 ± 0.025
1.412GlnPhe: 1.412 ± 0.01
2.766GlnGly: 2.766 ± 0.017
1.36GlnHis: 1.36 ± 0.011
2.052GlnIle: 2.052 ± 0.013
2.986GlnLys: 2.986 ± 0.018
4.716GlnLeu: 4.716 ± 0.019
1.096GlnMet: 1.096 ± 0.008
1.972GlnAsn: 1.972 ± 0.015
2.743GlnPro: 2.743 ± 0.02
3.183GlnGln: 3.183 ± 0.032
2.949GlnArg: 2.949 ± 0.016
3.219GlnSer: 3.219 ± 0.019
2.256GlnThr: 2.256 ± 0.013
2.74GlnVal: 2.74 ± 0.016
0.542GlnTrp: 0.542 ± 0.006
1.183GlnTyr: 1.183 ± 0.009
0.0GlnXaa: 0.0 ± 0.0
Arg
4.171ArgAla: 4.171 ± 0.021
1.371ArgCys: 1.371 ± 0.012
2.899ArgAsp: 2.899 ± 0.016
4.005ArgGlu: 4.005 ± 0.023
1.874ArgPhe: 1.874 ± 0.012
4.001ArgGly: 4.001 ± 0.025
1.537ArgHis: 1.537 ± 0.01
2.423ArgIle: 2.423 ± 0.012
3.551ArgLys: 3.551 ± 0.016
5.179ArgLeu: 5.179 ± 0.02
1.187ArgMet: 1.187 ± 0.008
2.137ArgAsn: 2.137 ± 0.011
3.144ArgPro: 3.144 ± 0.02
2.605ArgGln: 2.605 ± 0.014
4.649ArgArg: 4.649 ± 0.026
4.438ArgSer: 4.438 ± 0.024
2.732ArgThr: 2.732 ± 0.012
3.108ArgVal: 3.108 ± 0.014
0.727ArgTrp: 0.727 ± 0.007
1.474ArgTyr: 1.474 ± 0.009
0.0ArgXaa: 0.0 ± 0.0
Ser
5.597SerAla: 5.597 ± 0.021
1.93SerCys: 1.93 ± 0.012
3.871SerAsp: 3.871 ± 0.02
5.064SerGlu: 5.064 ± 0.021
2.953SerPhe: 2.953 ± 0.017
5.51SerGly: 5.51 ± 0.027
2.068SerHis: 2.068 ± 0.015
3.293SerIle: 3.293 ± 0.021
4.118SerLys: 4.118 ± 0.02
7.953SerLeu: 7.953 ± 0.027
1.581SerMet: 1.581 ± 0.011
2.745SerAsn: 2.745 ± 0.013
6.257SerPro: 6.257 ± 0.037
3.758SerGln: 3.758 ± 0.018
4.545SerArg: 4.545 ± 0.026
9.402SerSer: 9.402 ± 0.053
4.394SerThr: 4.394 ± 0.021
4.94SerVal: 4.94 ± 0.02
1.022SerTrp: 1.022 ± 0.009
2.028SerTyr: 2.028 ± 0.013
0.0SerXaa: 0.0 ± 0.0
Thr
3.974ThrAla: 3.974 ± 0.016
1.267ThrCys: 1.267 ± 0.012
2.544ThrAsp: 2.544 ± 0.012
3.649ThrGlu: 3.649 ± 0.018
1.98ThrPhe: 1.98 ± 0.011
3.62ThrGly: 3.62 ± 0.023
1.16ThrHis: 1.16 ± 0.011
2.316ThrIle: 2.316 ± 0.015
2.621ThrLys: 2.621 ± 0.015
4.9ThrLeu: 4.9 ± 0.019
1.055ThrMet: 1.055 ± 0.008
1.768ThrAsn: 1.768 ± 0.012
3.539ThrPro: 3.539 ± 0.031
2.088ThrGln: 2.088 ± 0.012
2.375ThrArg: 2.375 ± 0.013
4.423ThrSer: 4.423 ± 0.024
2.822ThrThr: 2.822 ± 0.028
3.836ThrVal: 3.836 ± 0.02
0.695ThrTrp: 0.695 ± 0.008
1.366ThrTyr: 1.366 ± 0.008
0.0ThrXaa: 0.0 ± 0.0
Val
4.132ValAla: 4.132 ± 0.017
1.49ValCys: 1.49 ± 0.011
2.781ValAsp: 2.781 ± 0.017
3.836ValGlu: 3.836 ± 0.017
2.364ValPhe: 2.364 ± 0.014
3.362ValGly: 3.362 ± 0.017
1.509ValHis: 1.509 ± 0.013
2.841ValIle: 2.841 ± 0.017
3.434ValLys: 3.434 ± 0.012
6.154ValLeu: 6.154 ± 0.02
1.272ValMet: 1.272 ± 0.008
2.258ValAsn: 2.258 ± 0.013
4.151ValPro: 4.151 ± 0.025
2.762ValGln: 2.762 ± 0.014
3.071ValArg: 3.071 ± 0.015
4.938ValSer: 4.938 ± 0.023
3.849ValThr: 3.849 ± 0.024
3.882ValVal: 3.882 ± 0.017
0.725ValTrp: 0.725 ± 0.008
1.586ValTyr: 1.586 ± 0.011
0.0ValXaa: 0.0 ± 0.0
Trp
0.768TrpAla: 0.768 ± 0.007
0.257TrpCys: 0.257 ± 0.004
0.707TrpAsp: 0.707 ± 0.008
0.833TrpGlu: 0.833 ± 0.008
0.435TrpPhe: 0.435 ± 0.006
0.949TrpGly: 0.949 ± 0.015
0.322TrpHis: 0.322 ± 0.004
0.579TrpIle: 0.579 ± 0.006
0.783TrpLys: 0.783 ± 0.007
1.19TrpLeu: 1.19 ± 0.008
0.309TrpMet: 0.309 ± 0.004
0.545TrpAsn: 0.545 ± 0.006
0.512TrpPro: 0.512 ± 0.005
0.536TrpGln: 0.536 ± 0.005
0.721TrpArg: 0.721 ± 0.006
0.904TrpSer: 0.904 ± 0.008
0.63TrpThr: 0.63 ± 0.006
0.686TrpVal: 0.686 ± 0.007
0.198TrpTrp: 0.198 ± 0.003
0.332TrpTyr: 0.332 ± 0.004
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.355TyrAla: 1.355 ± 0.008
0.659TyrCys: 0.659 ± 0.006
1.289TyrAsp: 1.289 ± 0.011
1.706TyrGlu: 1.706 ± 0.01
1.182TyrPhe: 1.182 ± 0.01
1.686TyrGly: 1.686 ± 0.011
0.717TyrHis: 0.717 ± 0.007
1.399TyrIle: 1.399 ± 0.01
1.493TyrLys: 1.493 ± 0.012
2.548TyrLeu: 2.548 ± 0.014
0.588TyrMet: 0.588 ± 0.005
1.124TyrAsn: 1.124 ± 0.008
1.262TyrPro: 1.262 ± 0.01
1.199TyrGln: 1.199 ± 0.008
1.587TyrArg: 1.587 ± 0.011
2.19TyrSer: 2.19 ± 0.013
1.447TyrThr: 1.447 ± 0.01
1.494TyrVal: 1.494 ± 0.009
0.378TyrTrp: 0.378 ± 0.008
0.907TyrTyr: 0.907 ± 0.008
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.014XaaXaa: 0.014 ± 0.005
Statistics based on 31342 proteins (21258392 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski