Amino acid dipepetide frequency for Charadrius vociferus (Killdeer) (Aegialitis vocifera)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.696AlaAla: 5.696 ± 0.05
1.339AlaCys: 1.339 ± 0.017
2.99AlaAsp: 2.99 ± 0.024
4.535AlaGlu: 4.535 ± 0.034
2.629AlaPhe: 2.629 ± 0.022
3.92AlaGly: 3.92 ± 0.035
1.366AlaHis: 1.366 ± 0.016
3.122AlaIle: 3.122 ± 0.023
3.709AlaLys: 3.709 ± 0.029
6.496AlaLeu: 6.496 ± 0.044
1.511AlaMet: 1.511 ± 0.017
2.269AlaAsn: 2.269 ± 0.024
3.015AlaPro: 3.015 ± 0.029
2.77AlaGln: 2.77 ± 0.022
3.091AlaArg: 3.091 ± 0.028
5.108AlaSer: 5.108 ± 0.037
3.346AlaThr: 3.346 ± 0.029
4.953AlaVal: 4.953 ± 0.032
0.711AlaTrp: 0.711 ± 0.013
1.662AlaTyr: 1.662 ± 0.018
0.0AlaXaa: 0.0 ± 0.0
Cys
1.205CysAla: 1.205 ± 0.016
0.654CysCys: 0.654 ± 0.014
1.074CysAsp: 1.074 ± 0.021
1.318CysGlu: 1.318 ± 0.023
0.91CysPhe: 0.91 ± 0.012
1.484CysGly: 1.484 ± 0.025
0.636CysHis: 0.636 ± 0.013
1.19CysIle: 1.19 ± 0.02
1.323CysLys: 1.323 ± 0.017
2.181CysLeu: 2.181 ± 0.025
0.466CysMet: 0.466 ± 0.01
0.917CysAsn: 0.917 ± 0.017
1.268CysPro: 1.268 ± 0.025
1.052CysGln: 1.052 ± 0.017
1.222CysArg: 1.222 ± 0.016
2.022CysSer: 2.022 ± 0.027
1.213CysThr: 1.213 ± 0.018
1.42CysVal: 1.42 ± 0.027
0.296CysTrp: 0.296 ± 0.007
0.674CysTyr: 0.674 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
2.933AspAla: 2.933 ± 0.027
1.098AspCys: 1.098 ± 0.02
2.843AspAsp: 2.843 ± 0.03
3.677AspGlu: 3.677 ± 0.034
2.27AspPhe: 2.27 ± 0.019
3.317AspGly: 3.317 ± 0.033
1.169AspHis: 1.169 ± 0.015
2.998AspIle: 2.998 ± 0.026
2.784AspLys: 2.784 ± 0.023
5.026AspLeu: 5.026 ± 0.032
1.164AspMet: 1.164 ± 0.014
1.96AspAsn: 1.96 ± 0.021
2.644AspPro: 2.644 ± 0.027
1.857AspGln: 1.857 ± 0.019
2.362AspArg: 2.362 ± 0.023
4.046AspSer: 4.046 ± 0.033
2.516AspThr: 2.516 ± 0.02
3.329AspVal: 3.329 ± 0.027
0.664AspTrp: 0.664 ± 0.013
1.66AspTyr: 1.66 ± 0.014
0.0AspXaa: 0.0 ± 0.0
Glu
4.744GluAla: 4.744 ± 0.035
1.329GluCys: 1.329 ± 0.028
4.521GluAsp: 4.521 ± 0.033
7.919GluGlu: 7.919 ± 0.072
2.214GluPhe: 2.214 ± 0.02
3.995GluGly: 3.995 ± 0.032
1.544GluHis: 1.544 ± 0.018
3.541GluIle: 3.541 ± 0.025
5.756GluLys: 5.756 ± 0.054
6.325GluLeu: 6.325 ± 0.048
1.776GluMet: 1.776 ± 0.018
3.446GluAsn: 3.446 ± 0.028
2.511GluPro: 2.511 ± 0.023
3.201GluGln: 3.201 ± 0.032
3.876GluArg: 3.876 ± 0.037
4.429GluSer: 4.429 ± 0.036
3.554GluThr: 3.554 ± 0.026
4.525GluVal: 4.525 ± 0.029
0.751GluTrp: 0.751 ± 0.013
1.835GluTyr: 1.835 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
2.168PheAla: 2.168 ± 0.022
1.006PheCys: 1.006 ± 0.014
1.836PheAsp: 1.836 ± 0.019
2.15PheGlu: 2.15 ± 0.022
1.973PhePhe: 1.973 ± 0.022
2.323PheGly: 2.323 ± 0.027
1.083PheHis: 1.083 ± 0.014
2.113PheIle: 2.113 ± 0.021
2.269PheLys: 2.269 ± 0.021
4.197PheLeu: 4.197 ± 0.034
0.803PheMet: 0.803 ± 0.011
1.539PheAsn: 1.539 ± 0.015
1.928PhePro: 1.928 ± 0.019
1.832PheGln: 1.832 ± 0.018
2.027PheArg: 2.027 ± 0.02
3.451PheSer: 3.451 ± 0.027
2.362PheThr: 2.362 ± 0.023
2.373PheVal: 2.373 ± 0.024
0.519PheTrp: 0.519 ± 0.009
1.331PheTyr: 1.331 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
3.521GlyAla: 3.521 ± 0.031
1.253GlyCys: 1.253 ± 0.018
2.967GlyAsp: 2.967 ± 0.029
3.801GlyGlu: 3.801 ± 0.039
2.563GlyPhe: 2.563 ± 0.029
3.918GlyGly: 3.918 ± 0.048
1.517GlyHis: 1.517 ± 0.02
3.115GlyIle: 3.115 ± 0.027
3.992GlyLys: 3.992 ± 0.036
5.152GlyLeu: 5.152 ± 0.037
1.389GlyMet: 1.389 ± 0.02
2.73GlyAsn: 2.73 ± 0.024
2.734GlyPro: 2.734 ± 0.058
2.46GlyGln: 2.46 ± 0.024
3.291GlyArg: 3.291 ± 0.031
4.966GlySer: 4.966 ± 0.039
3.42GlyThr: 3.42 ± 0.031
3.483GlyVal: 3.483 ± 0.026
0.773GlyTrp: 0.773 ± 0.015
1.954GlyTyr: 1.954 ± 0.026
0.0GlyXaa: 0.0 ± 0.0
His
1.333HisAla: 1.333 ± 0.016
0.71HisCys: 0.71 ± 0.012
0.932HisAsp: 0.932 ± 0.011
1.345HisGlu: 1.345 ± 0.016
1.088HisPhe: 1.088 ± 0.016
1.498HisGly: 1.498 ± 0.021
0.847HisHis: 0.847 ± 0.015
1.35HisIle: 1.35 ± 0.016
1.414HisLys: 1.414 ± 0.021
2.806HisLeu: 2.806 ± 0.026
0.588HisMet: 0.588 ± 0.01
0.973HisAsn: 0.973 ± 0.012
1.493HisPro: 1.493 ± 0.018
1.18HisGln: 1.18 ± 0.018
1.451HisArg: 1.451 ± 0.018
2.174HisSer: 2.174 ± 0.022
1.379HisThr: 1.379 ± 0.022
1.53HisVal: 1.53 ± 0.016
0.484HisTrp: 0.484 ± 0.008
0.856HisTyr: 0.856 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
3.054IleAla: 3.054 ± 0.025
1.244IleCys: 1.244 ± 0.014
2.373IleAsp: 2.373 ± 0.022
2.935IleGlu: 2.935 ± 0.024
2.19IlePhe: 2.19 ± 0.025
2.487IleGly: 2.487 ± 0.024
1.415IleHis: 1.415 ± 0.017
2.784IleIle: 2.784 ± 0.031
3.036IleLys: 3.036 ± 0.025
5.019IleLeu: 5.019 ± 0.038
1.076IleMet: 1.076 ± 0.015
2.171IleAsn: 2.171 ± 0.021
2.847IlePro: 2.847 ± 0.025
2.494IleGln: 2.494 ± 0.022
2.684IleArg: 2.684 ± 0.023
4.048IleSer: 4.048 ± 0.029
2.901IleThr: 2.901 ± 0.024
2.947IleVal: 2.947 ± 0.025
0.588IleTrp: 0.588 ± 0.012
1.625IleTyr: 1.625 ± 0.019
0.0IleXaa: 0.0 ± 0.0
Lys
4.262LysAla: 4.262 ± 0.032
1.245LysCys: 1.245 ± 0.019
3.485LysAsp: 3.485 ± 0.027
5.689LysGlu: 5.689 ± 0.051
2.046LysPhe: 2.046 ± 0.021
3.452LysGly: 3.452 ± 0.037
1.57LysHis: 1.57 ± 0.019
3.236LysIle: 3.236 ± 0.028
5.541LysLys: 5.541 ± 0.052
5.851LysLeu: 5.851 ± 0.042
1.567LysMet: 1.567 ± 0.015
2.792LysAsn: 2.792 ± 0.024
3.055LysPro: 3.055 ± 0.036
2.961LysGln: 2.961 ± 0.026
3.553LysArg: 3.553 ± 0.032
4.246LysSer: 4.246 ± 0.036
3.419LysThr: 3.419 ± 0.027
3.754LysVal: 3.754 ± 0.027
0.683LysTrp: 0.683 ± 0.01
1.888LysTyr: 1.888 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
6.22LeuAla: 6.22 ± 0.04
2.191LeuCys: 2.191 ± 0.023
4.927LeuAsp: 4.927 ± 0.034
7.24LeuGlu: 7.24 ± 0.054
3.574LeuPhe: 3.574 ± 0.033
5.268LeuGly: 5.268 ± 0.036
2.672LeuHis: 2.672 ± 0.026
4.262LeuIle: 4.262 ± 0.029
6.426LeuLys: 6.426 ± 0.04
10.052LeuLeu: 10.052 ± 0.072
2.029LeuMet: 2.029 ± 0.022
3.986LeuAsn: 3.986 ± 0.03
5.451LeuPro: 5.451 ± 0.037
5.608LeuGln: 5.608 ± 0.042
5.271LeuArg: 5.271 ± 0.039
7.755LeuSer: 7.755 ± 0.041
4.995LeuThr: 4.995 ± 0.036
5.595LeuVal: 5.595 ± 0.039
1.085LeuTrp: 1.085 ± 0.016
2.789LeuTyr: 2.789 ± 0.027
0.0LeuXaa: 0.0 ± 0.0
Met
1.639MetAla: 1.639 ± 0.015
0.462MetCys: 0.462 ± 0.01
1.28MetAsp: 1.28 ± 0.017
1.861MetGlu: 1.861 ± 0.019
0.841MetPhe: 0.841 ± 0.013
1.28MetGly: 1.28 ± 0.017
0.527MetHis: 0.527 ± 0.009
0.984MetIle: 0.984 ± 0.015
1.614MetLys: 1.614 ± 0.02
2.065MetLeu: 2.065 ± 0.021
0.608MetMet: 0.608 ± 0.01
1.018MetAsn: 1.018 ± 0.014
1.038MetPro: 1.038 ± 0.016
1.025MetGln: 1.025 ± 0.014
1.05MetArg: 1.05 ± 0.014
1.55MetSer: 1.55 ± 0.017
1.154MetThr: 1.154 ± 0.015
1.435MetVal: 1.435 ± 0.016
0.254MetTrp: 0.254 ± 0.007
0.672MetTyr: 0.672 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.356AsnAla: 2.356 ± 0.023
0.98AsnCys: 0.98 ± 0.018
1.737AsnAsp: 1.737 ± 0.023
2.577AsnGlu: 2.577 ± 0.026
1.622AsnPhe: 1.622 ± 0.017
2.882AsnGly: 2.882 ± 0.032
0.987AsnHis: 0.987 ± 0.013
2.592AsnIle: 2.592 ± 0.022
2.626AsnLys: 2.626 ± 0.025
4.196AsnLeu: 4.196 ± 0.029
1.009AsnMet: 1.009 ± 0.013
1.848AsnAsn: 1.848 ± 0.022
2.28AsnPro: 2.28 ± 0.021
1.722AsnGln: 1.722 ± 0.018
2.141AsnArg: 2.141 ± 0.019
3.415AsnSer: 3.415 ± 0.029
2.241AsnThr: 2.241 ± 0.021
2.495AsnVal: 2.495 ± 0.024
0.509AsnTrp: 0.509 ± 0.009
1.314AsnTyr: 1.314 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
3.657ProAla: 3.657 ± 0.032
1.062ProCys: 1.062 ± 0.018
2.605ProAsp: 2.605 ± 0.023
3.811ProGlu: 3.811 ± 0.029
1.902ProPhe: 1.902 ± 0.02
3.655ProGly: 3.655 ± 0.086
1.236ProHis: 1.236 ± 0.018
1.964ProIle: 1.964 ± 0.018
2.754ProLys: 2.754 ± 0.032
4.59ProLeu: 4.59 ± 0.034
0.933ProMet: 0.933 ± 0.013
1.91ProAsn: 1.91 ± 0.021
4.235ProPro: 4.235 ± 0.055
2.301ProGln: 2.301 ± 0.024
2.587ProArg: 2.587 ± 0.025
4.938ProSer: 4.938 ± 0.039
2.665ProThr: 2.665 ± 0.028
3.72ProVal: 3.72 ± 0.032
0.572ProTrp: 0.572 ± 0.012
1.473ProTyr: 1.473 ± 0.017
0.0ProXaa: 0.0 ± 0.0
Gln
3.02GlnAla: 3.02 ± 0.027
0.982GlnCys: 0.982 ± 0.018
2.206GlnAsp: 2.206 ± 0.02
3.621GlnGlu: 3.621 ± 0.031
1.475GlnPhe: 1.475 ± 0.015
2.434GlnGly: 2.434 ± 0.029
1.294GlnHis: 1.294 ± 0.018
2.255GlnIle: 2.255 ± 0.021
3.201GlnLys: 3.201 ± 0.029
4.584GlnLeu: 4.584 ± 0.03
1.108GlnMet: 1.108 ± 0.014
2.034GlnAsn: 2.034 ± 0.023
2.321GlnPro: 2.321 ± 0.028
3.009GlnGln: 3.009 ± 0.048
2.64GlnArg: 2.64 ± 0.025
3.158GlnSer: 3.158 ± 0.031
2.344GlnThr: 2.344 ± 0.023
2.762GlnVal: 2.762 ± 0.025
0.531GlnTrp: 0.531 ± 0.011
1.297GlnTyr: 1.297 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
3.111ArgAla: 3.111 ± 0.027
1.142ArgCys: 1.142 ± 0.022
2.589ArgAsp: 2.589 ± 0.023
3.88ArgGlu: 3.88 ± 0.036
2.001ArgPhe: 2.001 ± 0.019
2.877ArgGly: 2.877 ± 0.03
1.566ArgHis: 1.566 ± 0.018
2.601ArgIle: 2.601 ± 0.02
3.978ArgLys: 3.978 ± 0.035
5.25ArgLeu: 5.25 ± 0.031
1.177ArgMet: 1.177 ± 0.016
2.245ArgAsn: 2.245 ± 0.021
2.383ArgPro: 2.383 ± 0.022
2.454ArgGln: 2.454 ± 0.027
3.678ArgArg: 3.678 ± 0.039
3.907ArgSer: 3.907 ± 0.041
2.679ArgThr: 2.679 ± 0.027
3.041ArgVal: 3.041 ± 0.023
0.634ArgTrp: 0.634 ± 0.011
1.618ArgTyr: 1.618 ± 0.016
0.0ArgXaa: 0.0 ± 0.0
Ser
5.026SerAla: 5.026 ± 0.038
1.842SerCys: 1.842 ± 0.022
3.99SerAsp: 3.99 ± 0.037
5.055SerGlu: 5.055 ± 0.04
3.118SerPhe: 3.118 ± 0.025
4.909SerGly: 4.909 ± 0.035
2.021SerHis: 2.021 ± 0.021
3.592SerIle: 3.592 ± 0.027
4.464SerLys: 4.464 ± 0.038
7.977SerLeu: 7.977 ± 0.041
1.622SerMet: 1.622 ± 0.018
3.068SerAsn: 3.068 ± 0.028
5.013SerPro: 5.013 ± 0.047
3.573SerGln: 3.573 ± 0.034
4.039SerArg: 4.039 ± 0.037
8.954SerSer: 8.954 ± 0.083
4.502SerThr: 4.502 ± 0.035
5.057SerVal: 5.057 ± 0.033
0.968SerTrp: 0.968 ± 0.012
2.243SerTyr: 2.243 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
3.814ThrAla: 3.814 ± 0.028
1.359ThrCys: 1.359 ± 0.025
2.717ThrAsp: 2.717 ± 0.025
3.69ThrGlu: 3.69 ± 0.026
2.189ThrPhe: 2.189 ± 0.021
3.462ThrGly: 3.462 ± 0.03
1.249ThrHis: 1.249 ± 0.021
2.625ThrIle: 2.625 ± 0.022
2.885ThrLys: 2.885 ± 0.026
5.252ThrLeu: 5.252 ± 0.03
1.112ThrMet: 1.112 ± 0.013
1.973ThrAsn: 1.973 ± 0.019
3.086ThrPro: 3.086 ± 0.031
2.099ThrGln: 2.099 ± 0.021
2.32ThrArg: 2.32 ± 0.018
4.605ThrSer: 4.605 ± 0.039
2.989ThrThr: 2.989 ± 0.036
4.126ThrVal: 4.126 ± 0.03
0.68ThrTrp: 0.68 ± 0.011
1.589ThrTyr: 1.589 ± 0.02
0.0ThrXaa: 0.0 ± 0.0
Val
4.216ValAla: 4.216 ± 0.03
1.604ValCys: 1.604 ± 0.027
3.087ValAsp: 3.087 ± 0.024
4.078ValGlu: 4.078 ± 0.023
2.79ValPhe: 2.79 ± 0.025
3.359ValGly: 3.359 ± 0.03
1.617ValHis: 1.617 ± 0.018
3.309ValIle: 3.309 ± 0.029
3.861ValLys: 3.861 ± 0.029
6.254ValLeu: 6.254 ± 0.04
1.41ValMet: 1.41 ± 0.015
2.604ValAsn: 2.604 ± 0.024
3.456ValPro: 3.456 ± 0.022
2.819ValGln: 2.819 ± 0.024
3.127ValArg: 3.127 ± 0.024
4.953ValSer: 4.953 ± 0.033
3.831ValThr: 3.831 ± 0.027
4.458ValVal: 4.458 ± 0.032
0.737ValTrp: 0.737 ± 0.013
1.893ValTyr: 1.893 ± 0.023
0.0ValXaa: 0.0 ± 0.0
Trp
0.675TrpAla: 0.675 ± 0.012
0.249TrpCys: 0.249 ± 0.007
0.678TrpAsp: 0.678 ± 0.013
0.782TrpGlu: 0.782 ± 0.012
0.471TrpPhe: 0.471 ± 0.01
0.664TrpGly: 0.664 ± 0.015
0.313TrpHis: 0.313 ± 0.008
0.624TrpIle: 0.624 ± 0.011
0.902TrpLys: 0.902 ± 0.012
1.175TrpLeu: 1.175 ± 0.019
0.317TrpMet: 0.317 ± 0.008
0.74TrpAsn: 0.74 ± 0.013
0.448TrpPro: 0.448 ± 0.009
0.547TrpGln: 0.547 ± 0.01
0.664TrpArg: 0.664 ± 0.01
0.892TrpSer: 0.892 ± 0.014
0.647TrpThr: 0.647 ± 0.012
0.683TrpVal: 0.683 ± 0.011
0.197TrpTrp: 0.197 ± 0.007
0.379TrpTyr: 0.379 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.589TyrAla: 1.589 ± 0.018
0.775TyrCys: 0.775 ± 0.014
1.481TyrAsp: 1.481 ± 0.016
1.826TyrGlu: 1.826 ± 0.019
1.418TyrPhe: 1.418 ± 0.017
1.814TyrGly: 1.814 ± 0.021
0.799TyrHis: 0.799 ± 0.013
1.663TyrIle: 1.663 ± 0.018
1.759TyrLys: 1.759 ± 0.019
2.952TyrLeu: 2.952 ± 0.026
0.693TyrMet: 0.693 ± 0.012
1.306TyrAsn: 1.306 ± 0.015
1.363TyrPro: 1.363 ± 0.019
1.31TyrGln: 1.31 ± 0.014
1.737TyrArg: 1.737 ± 0.019
2.418TyrSer: 2.418 ± 0.025
1.664TyrThr: 1.664 ± 0.021
1.775TyrVal: 1.775 ± 0.019
0.411TyrTrp: 0.411 ± 0.01
1.106TyrTyr: 1.106 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.028XaaXaa: 0.028 ± 0.006
Statistics based on 14257 proteins (6061522 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski