Amino acid dipepetide frequency for Puccinia coronata f. sp. avenae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.507AlaAla: 7.507 ± 0.051
1.365AlaCys: 1.365 ± 0.017
3.52AlaAsp: 3.52 ± 0.02
4.211AlaGlu: 4.211 ± 0.029
2.5AlaPhe: 2.5 ± 0.019
4.988AlaGly: 4.988 ± 0.032
2.254AlaHis: 2.254 ± 0.016
3.79AlaIle: 3.79 ± 0.024
3.84AlaLys: 3.84 ± 0.027
7.045AlaLeu: 7.045 ± 0.033
1.586AlaMet: 1.586 ± 0.017
3.154AlaAsn: 3.154 ± 0.022
5.21AlaPro: 5.21 ± 0.035
3.518AlaGln: 3.518 ± 0.02
4.469AlaArg: 4.469 ± 0.024
7.619AlaSer: 7.619 ± 0.037
4.792AlaThr: 4.792 ± 0.025
4.018AlaVal: 4.018 ± 0.019
0.852AlaTrp: 0.852 ± 0.01
1.602AlaTyr: 1.602 ± 0.015
0.0AlaXaa: 0.0 ± 0.0
Cys
0.905CysAla: 0.905 ± 0.009
0.292CysCys: 0.292 ± 0.006
0.634CysAsp: 0.634 ± 0.008
0.576CysGlu: 0.576 ± 0.007
0.603CysPhe: 0.603 ± 0.009
0.872CysGly: 0.872 ± 0.011
0.434CysHis: 0.434 ± 0.007
0.592CysIle: 0.592 ± 0.007
0.701CysLys: 0.701 ± 0.014
1.516CysLeu: 1.516 ± 0.015
0.271CysMet: 0.271 ± 0.006
0.556CysAsn: 0.556 ± 0.009
1.032CysPro: 1.032 ± 0.012
0.687CysGln: 0.687 ± 0.009
0.873CysArg: 0.873 ± 0.01
1.425CysSer: 1.425 ± 0.017
0.84CysThr: 0.84 ± 0.013
0.69CysVal: 0.69 ± 0.008
0.217CysTrp: 0.217 ± 0.005
0.377CysTyr: 0.377 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
3.413AspAla: 3.413 ± 0.02
0.708AspCys: 0.708 ± 0.009
3.716AspAsp: 3.716 ± 0.032
3.572AspGlu: 3.572 ± 0.024
1.83AspPhe: 1.83 ± 0.016
3.152AspGly: 3.152 ± 0.02
1.691AspHis: 1.691 ± 0.013
2.185AspIle: 2.185 ± 0.017
2.183AspLys: 2.183 ± 0.016
5.01AspLeu: 5.01 ± 0.026
0.921AspMet: 0.921 ± 0.01
1.974AspAsn: 1.974 ± 0.02
3.706AspPro: 3.706 ± 0.022
2.63AspGln: 2.63 ± 0.019
2.735AspArg: 2.735 ± 0.02
4.537AspSer: 4.537 ± 0.028
2.402AspThr: 2.402 ± 0.017
2.622AspVal: 2.622 ± 0.018
0.8AspTrp: 0.8 ± 0.012
1.225AspTyr: 1.225 ± 0.012
0.0AspXaa: 0.0 ± 0.0
Glu
4.294GluAla: 4.294 ± 0.026
0.551GluCys: 0.551 ± 0.007
3.462GluAsp: 3.462 ± 0.022
4.922GluGlu: 4.922 ± 0.039
1.688GluPhe: 1.688 ± 0.014
2.794GluGly: 2.794 ± 0.018
1.329GluHis: 1.329 ± 0.013
2.866GluIle: 2.866 ± 0.021
3.183GluLys: 3.183 ± 0.021
5.405GluLeu: 5.405 ± 0.032
1.179GluMet: 1.179 ± 0.012
2.154GluAsn: 2.154 ± 0.017
2.886GluPro: 2.886 ± 0.023
2.597GluGln: 2.597 ± 0.026
3.116GluArg: 3.116 ± 0.022
4.299GluSer: 4.299 ± 0.024
2.766GluThr: 2.766 ± 0.017
2.754GluVal: 2.754 ± 0.019
0.672GluTrp: 0.672 ± 0.008
1.258GluTyr: 1.258 ± 0.014
0.0GluXaa: 0.0 ± 0.0
Phe
2.286PheAla: 2.286 ± 0.019
0.594PheCys: 0.594 ± 0.008
1.978PheAsp: 1.978 ± 0.014
1.86PheGlu: 1.86 ± 0.013
1.402PhePhe: 1.402 ± 0.014
2.309PheGly: 2.309 ± 0.023
1.029PheHis: 1.029 ± 0.012
1.68PheIle: 1.68 ± 0.014
1.774PheLys: 1.774 ± 0.02
3.443PheLeu: 3.443 ± 0.022
0.652PheMet: 0.652 ± 0.008
1.574PheAsn: 1.574 ± 0.014
1.922PhePro: 1.922 ± 0.015
1.576PheGln: 1.576 ± 0.014
1.699PheArg: 1.699 ± 0.013
3.016PheSer: 3.016 ± 0.019
1.833PheThr: 1.833 ± 0.016
1.898PheVal: 1.898 ± 0.019
0.526PheTrp: 0.526 ± 0.013
0.89PheTyr: 0.89 ± 0.01
0.0PheXaa: 0.0 ± 0.0
Gly
3.923GlyAla: 3.923 ± 0.023
0.894GlyCys: 0.894 ± 0.014
2.559GlyAsp: 2.559 ± 0.018
2.69GlyGlu: 2.69 ± 0.018
2.242GlyPhe: 2.242 ± 0.017
4.716GlyGly: 4.716 ± 0.049
1.698GlyHis: 1.698 ± 0.016
2.848GlyIle: 2.848 ± 0.022
3.085GlyLys: 3.085 ± 0.021
5.428GlyLeu: 5.428 ± 0.026
1.133GlyMet: 1.133 ± 0.012
2.334GlyAsn: 2.334 ± 0.02
3.164GlyPro: 3.164 ± 0.022
2.531GlyGln: 2.531 ± 0.02
3.422GlyArg: 3.422 ± 0.021
5.464GlySer: 5.464 ± 0.035
4.021GlyThr: 4.021 ± 0.045
3.166GlyVal: 3.166 ± 0.025
0.858GlyTrp: 0.858 ± 0.011
1.637GlyTyr: 1.637 ± 0.018
0.0GlyXaa: 0.0 ± 0.0
His
2.132HisAla: 2.132 ± 0.018
0.438HisCys: 0.438 ± 0.007
1.351HisAsp: 1.351 ± 0.013
1.341HisGlu: 1.341 ± 0.013
1.061HisPhe: 1.061 ± 0.01
1.56HisGly: 1.56 ± 0.014
2.165HisHis: 2.165 ± 0.027
1.237HisIle: 1.237 ± 0.011
1.158HisLys: 1.158 ± 0.011
3.166HisLeu: 3.166 ± 0.022
0.517HisMet: 0.517 ± 0.009
1.135HisAsn: 1.135 ± 0.01
2.925HisPro: 2.925 ± 0.024
2.062HisGln: 2.062 ± 0.02
1.778HisArg: 1.778 ± 0.016
2.996HisSer: 2.996 ± 0.027
1.597HisThr: 1.597 ± 0.013
1.398HisVal: 1.398 ± 0.014
0.385HisTrp: 0.385 ± 0.007
0.728HisTyr: 0.728 ± 0.01
0.0HisXaa: 0.0 ± 0.0
Ile
3.307IleAla: 3.307 ± 0.021
0.82IleCys: 0.82 ± 0.01
2.676IleAsp: 2.676 ± 0.018
2.682IleGlu: 2.682 ± 0.019
1.715IlePhe: 1.715 ± 0.014
2.685IleGly: 2.685 ± 0.022
1.457IleHis: 1.457 ± 0.012
2.412IleIle: 2.412 ± 0.022
2.667IleLys: 2.667 ± 0.017
4.524IleLeu: 4.524 ± 0.028
0.915IleMet: 0.915 ± 0.011
2.25IleAsn: 2.25 ± 0.015
3.261IlePro: 3.261 ± 0.018
2.305IleGln: 2.305 ± 0.014
3.001IleArg: 3.001 ± 0.031
4.355IleSer: 4.355 ± 0.023
2.707IleThr: 2.707 ± 0.017
2.468IleVal: 2.468 ± 0.017
0.593IleTrp: 0.593 ± 0.008
1.175IleTyr: 1.175 ± 0.012
0.0IleXaa: 0.0 ± 0.0
Lys
4.092LysAla: 4.092 ± 0.03
0.559LysCys: 0.559 ± 0.009
2.549LysAsp: 2.549 ± 0.022
3.116LysGlu: 3.116 ± 0.023
1.636LysPhe: 1.636 ± 0.014
2.649LysGly: 2.649 ± 0.034
1.327LysHis: 1.327 ± 0.015
2.656LysIle: 2.656 ± 0.021
4.007LysLys: 4.007 ± 0.031
5.016LysLeu: 5.016 ± 0.026
1.131LysMet: 1.131 ± 0.013
2.268LysAsn: 2.268 ± 0.018
3.305LysPro: 3.305 ± 0.024
2.354LysGln: 2.354 ± 0.021
3.298LysArg: 3.298 ± 0.025
4.297LysSer: 4.297 ± 0.022
3.093LysThr: 3.093 ± 0.021
2.511LysVal: 2.511 ± 0.016
0.606LysTrp: 0.606 ± 0.008
1.209LysTyr: 1.209 ± 0.012
0.0LysXaa: 0.0 ± 0.0
Leu
8.162LeuAla: 8.162 ± 0.035
1.311LeuCys: 1.311 ± 0.012
5.27LeuAsp: 5.27 ± 0.024
5.333LeuGlu: 5.333 ± 0.03
3.212LeuPhe: 3.212 ± 0.021
5.41LeuGly: 5.41 ± 0.025
2.657LeuHis: 2.657 ± 0.014
4.92LeuIle: 4.92 ± 0.033
5.132LeuLys: 5.132 ± 0.03
9.129LeuLeu: 9.129 ± 0.049
1.91LeuMet: 1.91 ± 0.012
4.009LeuAsn: 4.009 ± 0.023
6.709LeuPro: 6.709 ± 0.034
4.083LeuGln: 4.083 ± 0.025
5.145LeuArg: 5.145 ± 0.027
8.664LeuSer: 8.664 ± 0.039
5.393LeuThr: 5.393 ± 0.027
5.489LeuVal: 5.489 ± 0.029
1.034LeuTrp: 1.034 ± 0.01
1.989LeuTyr: 1.989 ± 0.013
0.0LeuXaa: 0.0 ± 0.0
Met
1.868MetAla: 1.868 ± 0.015
0.234MetCys: 0.234 ± 0.005
1.142MetAsp: 1.142 ± 0.012
1.127MetGlu: 1.127 ± 0.01
0.617MetPhe: 0.617 ± 0.009
1.163MetGly: 1.163 ± 0.012
0.442MetHis: 0.442 ± 0.007
1.056MetIle: 1.056 ± 0.011
1.126MetLys: 1.126 ± 0.012
1.616MetLeu: 1.616 ± 0.015
0.573MetMet: 0.573 ± 0.009
0.901MetAsn: 0.901 ± 0.012
1.091MetPro: 1.091 ± 0.017
0.7MetGln: 0.7 ± 0.009
1.048MetArg: 1.048 ± 0.011
1.781MetSer: 1.781 ± 0.015
1.196MetThr: 1.196 ± 0.015
1.088MetVal: 1.088 ± 0.011
0.202MetTrp: 0.202 ± 0.005
0.422MetTyr: 0.422 ± 0.006
0.0MetXaa: 0.0 ± 0.0
Asn
2.763AsnAla: 2.763 ± 0.02
0.632AsnCys: 0.632 ± 0.01
2.002AsnAsp: 2.002 ± 0.015
2.036AsnGlu: 2.036 ± 0.016
1.453AsnPhe: 1.453 ± 0.014
2.603AsnGly: 2.603 ± 0.026
1.602AsnHis: 1.602 ± 0.015
1.846AsnIle: 1.846 ± 0.014
2.087AsnLys: 2.087 ± 0.017
4.16AsnLeu: 4.16 ± 0.026
0.787AsnMet: 0.787 ± 0.01
2.607AsnAsn: 2.607 ± 0.035
3.523AsnPro: 3.523 ± 0.022
2.552AsnGln: 2.552 ± 0.016
2.303AsnArg: 2.303 ± 0.015
4.129AsnSer: 4.129 ± 0.026
2.489AsnThr: 2.489 ± 0.018
1.965AsnVal: 1.965 ± 0.016
0.532AsnTrp: 0.532 ± 0.008
1.081AsnTyr: 1.081 ± 0.011
0.0AsnXaa: 0.0 ± 0.0
Pro
6.583ProAla: 6.583 ± 0.042
0.801ProCys: 0.801 ± 0.011
3.22ProAsp: 3.22 ± 0.022
3.401ProGlu: 3.401 ± 0.027
2.194ProPhe: 2.194 ± 0.018
3.657ProGly: 3.657 ± 0.025
2.199ProHis: 2.199 ± 0.019
3.158ProIle: 3.158 ± 0.021
3.093ProLys: 3.093 ± 0.026
5.889ProLeu: 5.889 ± 0.034
1.143ProMet: 1.143 ± 0.011
3.113ProAsn: 3.113 ± 0.021
7.412ProPro: 7.412 ± 0.062
2.955ProGln: 2.955 ± 0.023
3.421ProArg: 3.421 ± 0.02
8.667ProSer: 8.667 ± 0.044
5.185ProThr: 5.185 ± 0.029
3.692ProVal: 3.692 ± 0.026
0.596ProTrp: 0.596 ± 0.009
1.476ProTyr: 1.476 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
4.158GlnAla: 4.158 ± 0.031
0.49GlnCys: 0.49 ± 0.007
2.237GlnAsp: 2.237 ± 0.016
2.668GlnGlu: 2.668 ± 0.022
1.472GlnPhe: 1.472 ± 0.013
2.024GlnGly: 2.024 ± 0.019
1.618GlnHis: 1.618 ± 0.016
2.267GlnIle: 2.267 ± 0.014
2.283GlnLys: 2.283 ± 0.017
4.827GlnLeu: 4.827 ± 0.031
0.952GlnMet: 0.952 ± 0.011
1.94GlnAsn: 1.94 ± 0.021
3.769GlnPro: 3.769 ± 0.028
3.997GlnGln: 3.997 ± 0.072
2.57GlnArg: 2.57 ± 0.017
4.295GlnSer: 4.295 ± 0.026
2.766GlnThr: 2.766 ± 0.019
2.556GlnVal: 2.556 ± 0.022
0.527GlnTrp: 0.527 ± 0.009
0.982GlnTyr: 0.982 ± 0.012
0.0GlnXaa: 0.0 ± 0.0
Arg
4.386ArgAla: 4.386 ± 0.021
0.801ArgCys: 0.801 ± 0.011
2.499ArgAsp: 2.499 ± 0.019
2.847ArgGlu: 2.847 ± 0.022
1.996ArgPhe: 1.996 ± 0.015
2.894ArgGly: 2.894 ± 0.023
1.591ArgHis: 1.591 ± 0.014
2.703ArgIle: 2.703 ± 0.016
3.251ArgLys: 3.251 ± 0.02
5.708ArgLeu: 5.708 ± 0.03
1.155ArgMet: 1.155 ± 0.011
2.144ArgAsn: 2.144 ± 0.014
3.916ArgPro: 3.916 ± 0.024
2.615ArgGln: 2.615 ± 0.018
4.296ArgArg: 4.296 ± 0.027
5.181ArgSer: 5.181 ± 0.029
3.422ArgThr: 3.422 ± 0.022
3.182ArgVal: 3.182 ± 0.034
0.74ArgTrp: 0.74 ± 0.01
1.425ArgTyr: 1.425 ± 0.02
0.0ArgXaa: 0.0 ± 0.0
Ser
7.081SerAla: 7.081 ± 0.04
1.286SerCys: 1.286 ± 0.013
4.712SerAsp: 4.712 ± 0.032
4.211SerGlu: 4.211 ± 0.025
3.15SerPhe: 3.15 ± 0.021
5.386SerGly: 5.386 ± 0.032
3.171SerHis: 3.171 ± 0.022
4.443SerIle: 4.443 ± 0.023
4.59SerLys: 4.59 ± 0.025
8.699SerLeu: 8.699 ± 0.037
1.739SerMet: 1.739 ± 0.013
4.559SerAsn: 4.559 ± 0.025
7.125SerPro: 7.125 ± 0.047
4.508SerGln: 4.508 ± 0.026
5.424SerArg: 5.424 ± 0.033
13.624SerSer: 13.624 ± 0.096
6.826SerThr: 6.826 ± 0.039
4.361SerVal: 4.361 ± 0.024
1.048SerTrp: 1.048 ± 0.014
1.949SerTyr: 1.949 ± 0.014
0.0SerXaa: 0.0 ± 0.0
Thr
4.592ThrAla: 4.592 ± 0.024
0.961ThrCys: 0.961 ± 0.014
2.644ThrAsp: 2.644 ± 0.017
2.736ThrGlu: 2.736 ± 0.018
1.966ThrPhe: 1.966 ± 0.015
3.826ThrGly: 3.826 ± 0.024
1.885ThrHis: 1.885 ± 0.015
2.936ThrIle: 2.936 ± 0.02
2.847ThrLys: 2.847 ± 0.023
5.739ThrLeu: 5.739 ± 0.035
1.06ThrMet: 1.06 ± 0.011
2.646ThrAsn: 2.646 ± 0.02
4.885ThrPro: 4.885 ± 0.033
2.662ThrGln: 2.662 ± 0.02
3.422ThrArg: 3.422 ± 0.02
6.452ThrSer: 6.452 ± 0.033
4.931ThrThr: 4.931 ± 0.036
2.871ThrVal: 2.871 ± 0.019
0.712ThrTrp: 0.712 ± 0.009
1.281ThrTyr: 1.281 ± 0.013
0.0ThrXaa: 0.0 ± 0.0
Val
4.081ValAla: 4.081 ± 0.026
0.772ValCys: 0.772 ± 0.012
2.978ValAsp: 2.978 ± 0.019
3.074ValGlu: 3.074 ± 0.023
1.887ValPhe: 1.887 ± 0.022
3.077ValGly: 3.077 ± 0.022
1.414ValHis: 1.414 ± 0.016
2.682ValIle: 2.682 ± 0.02
2.656ValLys: 2.656 ± 0.018
4.845ValLeu: 4.845 ± 0.026
1.045ValMet: 1.045 ± 0.01
2.177ValAsn: 2.177 ± 0.018
3.722ValPro: 3.722 ± 0.036
2.253ValGln: 2.253 ± 0.017
2.7ValArg: 2.7 ± 0.018
4.231ValSer: 4.231 ± 0.021
2.858ValThr: 2.858 ± 0.018
3.147ValVal: 3.147 ± 0.026
0.708ValTrp: 0.708 ± 0.01
1.323ValTyr: 1.323 ± 0.013
0.0ValXaa: 0.0 ± 0.0
Trp
0.903TrpAla: 0.903 ± 0.011
0.193TrpCys: 0.193 ± 0.006
0.676TrpAsp: 0.676 ± 0.009
0.671TrpGlu: 0.671 ± 0.009
0.402TrpPhe: 0.402 ± 0.007
0.624TrpGly: 0.624 ± 0.008
0.296TrpHis: 0.296 ± 0.006
0.674TrpIle: 0.674 ± 0.008
0.759TrpLys: 0.759 ± 0.009
1.236TrpLeu: 1.236 ± 0.013
0.286TrpMet: 0.286 ± 0.006
0.61TrpAsn: 0.61 ± 0.008
0.583TrpPro: 0.583 ± 0.009
0.494TrpGln: 0.494 ± 0.01
0.745TrpArg: 0.745 ± 0.012
0.994TrpSer: 0.994 ± 0.011
0.754TrpThr: 0.754 ± 0.009
0.616TrpVal: 0.616 ± 0.008
0.194TrpTrp: 0.194 ± 0.005
0.418TrpTyr: 0.418 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.403TyrAla: 1.403 ± 0.011
0.388TyrCys: 0.388 ± 0.007
1.187TyrAsp: 1.187 ± 0.011
1.098TyrGlu: 1.098 ± 0.012
0.915TyrPhe: 0.915 ± 0.011
1.423TyrGly: 1.423 ± 0.016
0.87TyrHis: 0.87 ± 0.01
1.026TyrIle: 1.026 ± 0.01
1.26TyrLys: 1.26 ± 0.028
2.561TyrLeu: 2.561 ± 0.021
0.439TyrMet: 0.439 ± 0.007
1.048TyrAsn: 1.048 ± 0.011
1.71TyrPro: 1.71 ± 0.029
1.321TyrGln: 1.321 ± 0.014
1.285TyrArg: 1.285 ± 0.01
1.868TyrSer: 1.868 ± 0.015
1.181TyrThr: 1.181 ± 0.012
1.138TyrVal: 1.138 ± 0.012
0.326TyrTrp: 0.326 ± 0.006
0.683TyrTyr: 0.683 ± 0.009
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25533 proteins (9799942 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski