Amino acid dipepetide frequency for Cathartes aura (Turkey vulture) (Vultur aura)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.36AlaAla: 5.36 ± 0.062
1.324AlaCys: 1.324 ± 0.023
2.979AlaAsp: 2.979 ± 0.031
4.482AlaGlu: 4.482 ± 0.043
2.617AlaPhe: 2.617 ± 0.028
3.807AlaGly: 3.807 ± 0.049
1.349AlaHis: 1.349 ± 0.023
3.182AlaIle: 3.182 ± 0.036
3.713AlaLys: 3.713 ± 0.038
6.363AlaLeu: 6.363 ± 0.063
1.5AlaMet: 1.5 ± 0.024
2.291AlaAsn: 2.291 ± 0.034
2.865AlaPro: 2.865 ± 0.04
2.634AlaGln: 2.634 ± 0.034
2.909AlaArg: 2.909 ± 0.035
5.12AlaSer: 5.12 ± 0.052
3.311AlaThr: 3.311 ± 0.037
4.872AlaVal: 4.872 ± 0.045
0.683AlaTrp: 0.683 ± 0.017
1.682AlaTyr: 1.682 ± 0.027
0.001AlaXaa: 0.001 ± 0.001
Cys
1.179CysAla: 1.179 ± 0.023
0.699CysCys: 0.699 ± 0.02
1.095CysAsp: 1.095 ± 0.024
1.352CysGlu: 1.352 ± 0.027
0.977CysPhe: 0.977 ± 0.02
1.484CysGly: 1.484 ± 0.031
0.658CysHis: 0.658 ± 0.019
1.213CysIle: 1.213 ± 0.026
1.385CysLys: 1.385 ± 0.026
2.169CysLeu: 2.169 ± 0.035
0.478CysMet: 0.478 ± 0.014
1.005CysAsn: 1.005 ± 0.022
1.23CysPro: 1.23 ± 0.025
1.06CysGln: 1.06 ± 0.023
1.219CysArg: 1.219 ± 0.025
2.023CysSer: 2.023 ± 0.032
1.242CysThr: 1.242 ± 0.023
1.385CysVal: 1.385 ± 0.027
0.308CysTrp: 0.308 ± 0.012
0.739CysTyr: 0.739 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
2.905AspAla: 2.905 ± 0.034
1.145AspCys: 1.145 ± 0.023
2.836AspAsp: 2.836 ± 0.04
3.585AspGlu: 3.585 ± 0.041
2.299AspPhe: 2.299 ± 0.03
3.323AspGly: 3.323 ± 0.046
1.158AspHis: 1.158 ± 0.021
3.06AspIle: 3.06 ± 0.041
2.853AspLys: 2.853 ± 0.034
4.978AspLeu: 4.978 ± 0.044
1.177AspMet: 1.177 ± 0.021
2.032AspAsn: 2.032 ± 0.03
2.594AspPro: 2.594 ± 0.036
1.845AspGln: 1.845 ± 0.028
2.357AspArg: 2.357 ± 0.038
4.097AspSer: 4.097 ± 0.046
2.509AspThr: 2.509 ± 0.028
3.239AspVal: 3.239 ± 0.037
0.658AspTrp: 0.658 ± 0.017
1.7AspTyr: 1.7 ± 0.025
0.001AspXaa: 0.001 ± 0.0
Glu
4.524GluAla: 4.524 ± 0.05
1.352GluCys: 1.352 ± 0.035
4.466GluAsp: 4.466 ± 0.047
7.663GluGlu: 7.663 ± 0.105
2.166GluPhe: 2.166 ± 0.031
3.822GluGly: 3.822 ± 0.044
1.554GluHis: 1.554 ± 0.026
3.595GluIle: 3.595 ± 0.04
5.738GluLys: 5.738 ± 0.081
6.257GluLeu: 6.257 ± 0.071
1.812GluMet: 1.812 ± 0.027
3.596GluAsn: 3.596 ± 0.041
2.477GluPro: 2.477 ± 0.031
3.164GluGln: 3.164 ± 0.044
3.813GluArg: 3.813 ± 0.06
4.442GluSer: 4.442 ± 0.043
3.577GluThr: 3.577 ± 0.04
4.333GluVal: 4.333 ± 0.044
0.745GluTrp: 0.745 ± 0.017
1.86GluTyr: 1.86 ± 0.029
0.001GluXaa: 0.001 ± 0.001
Phe
2.199PheAla: 2.199 ± 0.032
1.059PheCys: 1.059 ± 0.021
1.892PheAsp: 1.892 ± 0.033
2.153PheGlu: 2.153 ± 0.028
1.963PhePhe: 1.963 ± 0.038
2.332PheGly: 2.332 ± 0.034
1.066PheHis: 1.066 ± 0.018
2.242PheIle: 2.242 ± 0.031
2.178PheLys: 2.178 ± 0.029
4.321PheLeu: 4.321 ± 0.047
0.825PheMet: 0.825 ± 0.016
1.591PheAsn: 1.591 ± 0.024
1.983PhePro: 1.983 ± 0.028
1.855PheGln: 1.855 ± 0.027
1.964PheArg: 1.964 ± 0.026
3.525PheSer: 3.525 ± 0.039
2.362PheThr: 2.362 ± 0.029
2.436PheVal: 2.436 ± 0.031
0.542PheTrp: 0.542 ± 0.014
1.383PheTyr: 1.383 ± 0.022
0.001PheXaa: 0.001 ± 0.0
Gly
3.405GlyAla: 3.405 ± 0.044
1.232GlyCys: 1.232 ± 0.025
2.902GlyAsp: 2.902 ± 0.031
3.7GlyGlu: 3.7 ± 0.044
2.539GlyPhe: 2.539 ± 0.039
3.659GlyGly: 3.659 ± 0.049
1.488GlyHis: 1.488 ± 0.026
3.183GlyIle: 3.183 ± 0.038
4.027GlyLys: 4.027 ± 0.047
5.1GlyLeu: 5.1 ± 0.05
1.361GlyMet: 1.361 ± 0.026
2.633GlyAsn: 2.633 ± 0.035
2.666GlyPro: 2.666 ± 0.07
2.421GlyGln: 2.421 ± 0.033
3.097GlyArg: 3.097 ± 0.038
4.865GlySer: 4.865 ± 0.054
3.409GlyThr: 3.409 ± 0.041
3.432GlyVal: 3.432 ± 0.045
0.792GlyTrp: 0.792 ± 0.019
1.881GlyTyr: 1.881 ± 0.032
0.001GlyXaa: 0.001 ± 0.001
His
1.329HisAla: 1.329 ± 0.024
0.716HisCys: 0.716 ± 0.018
0.924HisAsp: 0.924 ± 0.018
1.377HisGlu: 1.377 ± 0.021
1.106HisPhe: 1.106 ± 0.021
1.46HisGly: 1.46 ± 0.023
0.846HisHis: 0.846 ± 0.023
1.379HisIle: 1.379 ± 0.024
1.378HisLys: 1.378 ± 0.023
2.804HisLeu: 2.804 ± 0.039
0.58HisMet: 0.58 ± 0.016
1.001HisAsn: 1.001 ± 0.018
1.47HisPro: 1.47 ± 0.027
1.187HisGln: 1.187 ± 0.021
1.399HisArg: 1.399 ± 0.022
2.192HisSer: 2.192 ± 0.033
1.323HisThr: 1.323 ± 0.024
1.532HisVal: 1.532 ± 0.021
0.419HisTrp: 0.419 ± 0.012
0.885HisTyr: 0.885 ± 0.019
0.001HisXaa: 0.001 ± 0.0
Ile
3.123IleAla: 3.123 ± 0.038
1.306IleCys: 1.306 ± 0.024
2.425IleAsp: 2.425 ± 0.035
2.998IleGlu: 2.998 ± 0.036
2.324IlePhe: 2.324 ± 0.035
2.546IleGly: 2.546 ± 0.037
1.432IleHis: 1.432 ± 0.026
2.959IleIle: 2.959 ± 0.039
3.163IleLys: 3.163 ± 0.044
5.159IleLeu: 5.159 ± 0.05
1.152IleMet: 1.152 ± 0.022
2.278IleAsn: 2.278 ± 0.026
2.943IlePro: 2.943 ± 0.035
2.537IleGln: 2.537 ± 0.031
2.554IleArg: 2.554 ± 0.033
4.289IleSer: 4.289 ± 0.043
2.971IleThr: 2.971 ± 0.035
3.013IleVal: 3.013 ± 0.036
0.597IleTrp: 0.597 ± 0.016
1.673IleTyr: 1.673 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
4.123LysAla: 4.123 ± 0.042
1.293LysCys: 1.293 ± 0.024
3.496LysAsp: 3.496 ± 0.037
5.751LysGlu: 5.751 ± 0.068
1.983LysPhe: 1.983 ± 0.031
3.434LysGly: 3.434 ± 0.044
1.635LysHis: 1.635 ± 0.027
3.373LysIle: 3.373 ± 0.037
5.5LysLys: 5.5 ± 0.077
5.792LysLeu: 5.792 ± 0.06
1.582LysMet: 1.582 ± 0.022
2.904LysAsn: 2.904 ± 0.034
3.059LysPro: 3.059 ± 0.038
3.039LysGln: 3.039 ± 0.042
3.537LysArg: 3.537 ± 0.043
4.335LysSer: 4.335 ± 0.05
3.51LysThr: 3.51 ± 0.038
3.792LysVal: 3.792 ± 0.042
0.692LysTrp: 0.692 ± 0.016
1.938LysTyr: 1.938 ± 0.03
0.001LysXaa: 0.001 ± 0.001
Leu
6.17LeuAla: 6.17 ± 0.058
2.242LeuCys: 2.242 ± 0.034
4.887LeuAsp: 4.887 ± 0.046
6.857LeuGlu: 6.857 ± 0.075
3.652LeuPhe: 3.652 ± 0.045
5.145LeuGly: 5.145 ± 0.051
2.715LeuHis: 2.715 ± 0.037
4.458LeuIle: 4.458 ± 0.045
6.544LeuLys: 6.544 ± 0.06
9.895LeuLeu: 9.895 ± 0.091
2.035LeuMet: 2.035 ± 0.025
3.93LeuAsn: 3.93 ± 0.039
5.355LeuPro: 5.355 ± 0.049
5.554LeuGln: 5.554 ± 0.069
5.084LeuArg: 5.084 ± 0.055
7.792LeuSer: 7.792 ± 0.058
4.997LeuThr: 4.997 ± 0.047
5.518LeuVal: 5.518 ± 0.051
1.069LeuTrp: 1.069 ± 0.021
2.853LeuTyr: 2.853 ± 0.036
0.002LeuXaa: 0.002 ± 0.001
Met
1.603MetAla: 1.603 ± 0.021
0.473MetCys: 0.473 ± 0.015
1.296MetAsp: 1.296 ± 0.018
1.894MetGlu: 1.894 ± 0.029
0.882MetPhe: 0.882 ± 0.019
1.244MetGly: 1.244 ± 0.022
0.538MetHis: 0.538 ± 0.015
1.014MetIle: 1.014 ± 0.018
1.659MetLys: 1.659 ± 0.025
2.097MetLeu: 2.097 ± 0.03
0.622MetMet: 0.622 ± 0.015
1.046MetAsn: 1.046 ± 0.021
1.037MetPro: 1.037 ± 0.021
1.063MetGln: 1.063 ± 0.022
1.081MetArg: 1.081 ± 0.019
1.541MetSer: 1.541 ± 0.028
1.188MetThr: 1.188 ± 0.02
1.436MetVal: 1.436 ± 0.025
0.246MetTrp: 0.246 ± 0.008
0.686MetTyr: 0.686 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.431AsnAla: 2.431 ± 0.031
1.028AsnCys: 1.028 ± 0.021
1.807AsnAsp: 1.807 ± 0.023
2.703AsnGlu: 2.703 ± 0.035
1.707AsnPhe: 1.707 ± 0.024
2.865AsnGly: 2.865 ± 0.039
1.008AsnHis: 1.008 ± 0.02
2.598AsnIle: 2.598 ± 0.033
2.668AsnLys: 2.668 ± 0.032
4.216AsnLeu: 4.216 ± 0.043
1.032AsnMet: 1.032 ± 0.019
1.96AsnAsn: 1.96 ± 0.03
2.318AsnPro: 2.318 ± 0.033
1.789AsnGln: 1.789 ± 0.029
2.095AsnArg: 2.095 ± 0.024
3.555AsnSer: 3.555 ± 0.041
2.358AsnThr: 2.358 ± 0.035
2.589AsnVal: 2.589 ± 0.031
0.53AsnTrp: 0.53 ± 0.013
1.358AsnTyr: 1.358 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
3.582ProAla: 3.582 ± 0.047
1.049ProCys: 1.049 ± 0.024
2.593ProAsp: 2.593 ± 0.03
3.791ProGlu: 3.791 ± 0.042
1.958ProPhe: 1.958 ± 0.032
3.458ProGly: 3.458 ± 0.073
1.229ProHis: 1.229 ± 0.018
1.998ProIle: 1.998 ± 0.028
2.769ProLys: 2.769 ± 0.037
4.625ProLeu: 4.625 ± 0.051
0.96ProMet: 0.96 ± 0.018
1.96ProAsn: 1.96 ± 0.028
4.091ProPro: 4.091 ± 0.077
2.262ProGln: 2.262 ± 0.036
2.502ProArg: 2.502 ± 0.036
4.912ProSer: 4.912 ± 0.066
2.607ProThr: 2.607 ± 0.034
3.663ProVal: 3.663 ± 0.04
0.575ProTrp: 0.575 ± 0.014
1.473ProTyr: 1.473 ± 0.025
0.002ProXaa: 0.002 ± 0.001
Gln
2.969GlnAla: 2.969 ± 0.037
0.985GlnCys: 0.985 ± 0.025
2.205GlnAsp: 2.205 ± 0.029
3.643GlnGlu: 3.643 ± 0.049
1.488GlnPhe: 1.488 ± 0.022
2.395GlnGly: 2.395 ± 0.037
1.261GlnHis: 1.261 ± 0.024
2.311GlnIle: 2.311 ± 0.03
3.229GlnLys: 3.229 ± 0.043
4.567GlnLeu: 4.567 ± 0.055
1.116GlnMet: 1.116 ± 0.025
2.106GlnAsn: 2.106 ± 0.03
2.306GlnPro: 2.306 ± 0.037
2.98GlnGln: 2.98 ± 0.068
2.568GlnArg: 2.568 ± 0.034
3.171GlnSer: 3.171 ± 0.042
2.385GlnThr: 2.385 ± 0.032
2.709GlnVal: 2.709 ± 0.03
0.515GlnTrp: 0.515 ± 0.014
1.314GlnTyr: 1.314 ± 0.024
0.002GlnXaa: 0.002 ± 0.001
Arg
2.963ArgAla: 2.963 ± 0.035
1.13ArgCys: 1.13 ± 0.025
2.537ArgAsp: 2.537 ± 0.036
3.683ArgGlu: 3.683 ± 0.05
1.937ArgPhe: 1.937 ± 0.031
2.714ArgGly: 2.714 ± 0.039
1.442ArgHis: 1.442 ± 0.024
2.612ArgIle: 2.612 ± 0.031
3.982ArgLys: 3.982 ± 0.047
4.849ArgLeu: 4.849 ± 0.044
1.181ArgMet: 1.181 ± 0.021
2.282ArgAsn: 2.282 ± 0.029
2.331ArgPro: 2.331 ± 0.037
2.444ArgGln: 2.444 ± 0.03
3.557ArgArg: 3.557 ± 0.047
3.879ArgSer: 3.879 ± 0.064
2.683ArgThr: 2.683 ± 0.032
2.875ArgVal: 2.875 ± 0.032
0.607ArgTrp: 0.607 ± 0.017
1.62ArgTyr: 1.62 ± 0.025
0.001ArgXaa: 0.001 ± 0.001
Ser
5.051SerAla: 5.051 ± 0.053
1.873SerCys: 1.873 ± 0.031
3.987SerAsp: 3.987 ± 0.043
5.051SerGlu: 5.051 ± 0.055
3.255SerPhe: 3.255 ± 0.037
4.866SerGly: 4.866 ± 0.053
2.003SerHis: 2.003 ± 0.031
3.702SerIle: 3.702 ± 0.033
4.517SerLys: 4.517 ± 0.053
7.951SerLeu: 7.951 ± 0.057
1.656SerMet: 1.656 ± 0.023
3.266SerAsn: 3.266 ± 0.036
4.959SerPro: 4.959 ± 0.069
3.602SerGln: 3.602 ± 0.043
3.964SerArg: 3.964 ± 0.056
9.093SerSer: 9.093 ± 0.119
4.611SerThr: 4.611 ± 0.05
5.212SerVal: 5.212 ± 0.051
0.985SerTrp: 0.985 ± 0.02
2.321SerTyr: 2.321 ± 0.03
0.003SerXaa: 0.003 ± 0.001
Thr
3.843ThrAla: 3.843 ± 0.041
1.342ThrCys: 1.342 ± 0.03
2.764ThrAsp: 2.764 ± 0.028
3.754ThrGlu: 3.754 ± 0.043
2.291ThrPhe: 2.291 ± 0.027
3.478ThrGly: 3.478 ± 0.037
1.209ThrHis: 1.209 ± 0.024
2.705ThrIle: 2.705 ± 0.031
2.912ThrLys: 2.912 ± 0.031
5.17ThrLeu: 5.17 ± 0.041
1.163ThrMet: 1.163 ± 0.02
2.007ThrAsn: 2.007 ± 0.03
3.098ThrPro: 3.098 ± 0.043
2.111ThrGln: 2.111 ± 0.028
2.298ThrArg: 2.298 ± 0.032
4.698ThrSer: 4.698 ± 0.056
3.095ThrThr: 3.095 ± 0.047
4.239ThrVal: 4.239 ± 0.038
0.672ThrTrp: 0.672 ± 0.018
1.595ThrTyr: 1.595 ± 0.024
0.001ThrXaa: 0.001 ± 0.0
Val
4.017ValAla: 4.017 ± 0.044
1.58ValCys: 1.58 ± 0.031
3.104ValAsp: 3.104 ± 0.037
3.882ValGlu: 3.882 ± 0.039
2.785ValPhe: 2.785 ± 0.036
3.304ValGly: 3.304 ± 0.043
1.576ValHis: 1.576 ± 0.025
3.499ValIle: 3.499 ± 0.04
3.893ValLys: 3.893 ± 0.037
6.244ValLeu: 6.244 ± 0.054
1.416ValMet: 1.416 ± 0.022
2.672ValAsn: 2.672 ± 0.033
3.429ValPro: 3.429 ± 0.04
2.771ValGln: 2.771 ± 0.03
2.94ValArg: 2.94 ± 0.035
5.039ValSer: 5.039 ± 0.051
3.926ValThr: 3.926 ± 0.046
4.294ValVal: 4.294 ± 0.041
0.738ValTrp: 0.738 ± 0.019
1.949ValTyr: 1.949 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
0.671TrpAla: 0.671 ± 0.016
0.255TrpCys: 0.255 ± 0.01
0.673TrpAsp: 0.673 ± 0.019
0.771TrpGlu: 0.771 ± 0.018
0.464TrpPhe: 0.464 ± 0.013
0.646TrpGly: 0.646 ± 0.021
0.315TrpHis: 0.315 ± 0.012
0.647TrpIle: 0.647 ± 0.015
0.914TrpLys: 0.914 ± 0.017
1.18TrpLeu: 1.18 ± 0.023
0.307TrpMet: 0.307 ± 0.009
0.698TrpAsn: 0.698 ± 0.018
0.446TrpPro: 0.446 ± 0.013
0.532TrpGln: 0.532 ± 0.015
0.658TrpArg: 0.658 ± 0.018
0.875TrpSer: 0.875 ± 0.019
0.666TrpThr: 0.666 ± 0.016
0.664TrpVal: 0.664 ± 0.017
0.207TrpTrp: 0.207 ± 0.008
0.39TrpTyr: 0.39 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.598TyrAla: 1.598 ± 0.027
0.816TyrCys: 0.816 ± 0.019
1.481TyrAsp: 1.481 ± 0.029
1.867TyrGlu: 1.867 ± 0.023
1.482TyrPhe: 1.482 ± 0.023
1.811TyrGly: 1.811 ± 0.027
0.826TyrHis: 0.826 ± 0.016
1.709TyrIle: 1.709 ± 0.023
1.747TyrLys: 1.747 ± 0.028
3.014TyrLeu: 3.014 ± 0.035
0.693TyrMet: 0.693 ± 0.015
1.355TyrAsn: 1.355 ± 0.023
1.401TyrPro: 1.401 ± 0.025
1.376TyrGln: 1.376 ± 0.021
1.712TyrArg: 1.712 ± 0.022
2.479TyrSer: 2.479 ± 0.031
1.713TyrThr: 1.713 ± 0.025
1.827TyrVal: 1.827 ± 0.027
0.396TyrTrp: 0.396 ± 0.011
1.12TyrTyr: 1.12 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.002XaaGlu: 0.002 ± 0.001
0.0XaaPhe: 0.0 ± 0.0
0.002XaaGly: 0.002 ± 0.001
0.001XaaHis: 0.001 ± 0.001
0.002XaaIle: 0.002 ± 0.001
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.001
0.001XaaPro: 0.001 ± 0.001
0.001XaaGln: 0.001 ± 0.001
0.001XaaArg: 0.001 ± 0.001
0.002XaaSer: 0.002 ± 0.001
0.001XaaThr: 0.001 ± 0.001
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.106XaaXaa: 0.106 ± 0.016
Statistics based on 7760 proteins (3012677 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski