Amino acid dipepetide frequency for Aspergillus flavus (strain ATCC 200026 / FGSC A1120 / NRRL 3357 / JCM 12722 / SRRC 167)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.177AlaAla: 8.177 ± 0.047
1.118AlaCys: 1.118 ± 0.014
4.063AlaAsp: 4.063 ± 0.029
4.844AlaGlu: 4.844 ± 0.046
3.189AlaPhe: 3.189 ± 0.025
5.658AlaGly: 5.658 ± 0.038
1.769AlaHis: 1.769 ± 0.019
4.406AlaIle: 4.406 ± 0.028
3.742AlaLys: 3.742 ± 0.029
7.769AlaLeu: 7.769 ± 0.043
1.997AlaMet: 1.997 ± 0.021
2.893AlaAsn: 2.893 ± 0.025
4.347AlaPro: 4.347 ± 0.039
3.235AlaGln: 3.235 ± 0.03
4.669AlaArg: 4.669 ± 0.029
6.847AlaSer: 6.847 ± 0.041
5.09AlaThr: 5.09 ± 0.037
5.404AlaVal: 5.404 ± 0.033
1.186AlaTrp: 1.186 ± 0.018
2.262AlaTyr: 2.262 ± 0.021
0.001AlaXaa: 0.001 ± 0.0
Cys
1.017CysAla: 1.017 ± 0.014
0.274CysCys: 0.274 ± 0.008
0.699CysAsp: 0.699 ± 0.012
0.626CysGlu: 0.626 ± 0.011
0.58CysPhe: 0.58 ± 0.011
0.968CysGly: 0.968 ± 0.014
0.36CysHis: 0.36 ± 0.008
0.77CysIle: 0.77 ± 0.013
0.517CysLys: 0.517 ± 0.011
1.392CysLeu: 1.392 ± 0.018
0.313CysMet: 0.313 ± 0.007
0.467CysAsn: 0.467 ± 0.01
0.703CysPro: 0.703 ± 0.014
0.504CysGln: 0.504 ± 0.01
0.809CysArg: 0.809 ± 0.013
1.01CysSer: 1.01 ± 0.015
0.745CysThr: 0.745 ± 0.013
0.86CysVal: 0.86 ± 0.015
0.219CysTrp: 0.219 ± 0.006
0.397CysTyr: 0.397 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
4.425AspAla: 4.425 ± 0.029
0.635AspCys: 0.635 ± 0.011
3.761AspAsp: 3.761 ± 0.036
4.1AspGlu: 4.1 ± 0.04
2.149AspPhe: 2.149 ± 0.021
3.938AspGly: 3.938 ± 0.028
1.265AspHis: 1.265 ± 0.013
3.235AspIle: 3.235 ± 0.028
2.276AspLys: 2.276 ± 0.023
5.102AspLeu: 5.102 ± 0.032
1.284AspMet: 1.284 ± 0.015
1.949AspAsn: 1.949 ± 0.019
3.365AspPro: 3.365 ± 0.026
1.945AspGln: 1.945 ± 0.021
3.065AspArg: 3.065 ± 0.023
4.094AspSer: 4.094 ± 0.031
3.015AspThr: 3.015 ± 0.024
3.663AspVal: 3.663 ± 0.023
0.891AspTrp: 0.891 ± 0.013
1.69AspTyr: 1.69 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
5.126GluAla: 5.126 ± 0.042
0.672GluCys: 0.672 ± 0.012
3.977GluAsp: 3.977 ± 0.033
5.209GluGlu: 5.209 ± 0.059
1.942GluPhe: 1.942 ± 0.017
3.704GluGly: 3.704 ± 0.031
1.383GluHis: 1.383 ± 0.016
3.072GluIle: 3.072 ± 0.022
3.635GluLys: 3.635 ± 0.033
5.194GluLeu: 5.194 ± 0.04
1.419GluMet: 1.419 ± 0.017
2.369GluAsn: 2.369 ± 0.02
2.708GluPro: 2.708 ± 0.033
2.478GluGln: 2.478 ± 0.026
3.826GluArg: 3.826 ± 0.03
4.318GluSer: 4.318 ± 0.036
3.511GluThr: 3.511 ± 0.027
3.562GluVal: 3.562 ± 0.025
0.912GluTrp: 0.912 ± 0.011
1.764GluTyr: 1.764 ± 0.02
0.0GluXaa: 0.0 ± 0.0
Phe
3.028PheAla: 3.028 ± 0.028
0.603PheCys: 0.603 ± 0.011
2.273PheAsp: 2.273 ± 0.021
2.097PheGlu: 2.097 ± 0.023
1.747PhePhe: 1.747 ± 0.02
2.873PheGly: 2.873 ± 0.028
0.977PheHis: 0.977 ± 0.012
1.958PheIle: 1.958 ± 0.022
1.474PheLys: 1.474 ± 0.018
3.731PheLeu: 3.731 ± 0.031
0.826PheMet: 0.826 ± 0.012
1.49PheAsn: 1.49 ± 0.017
2.046PhePro: 2.046 ± 0.021
1.464PheGln: 1.464 ± 0.017
1.996PheArg: 1.996 ± 0.019
3.066PheSer: 3.066 ± 0.027
2.229PheThr: 2.229 ± 0.018
2.473PheVal: 2.473 ± 0.022
0.679PheTrp: 0.679 ± 0.013
1.234PheTyr: 1.234 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
5.209GlyAla: 5.209 ± 0.038
0.963GlyCys: 0.963 ± 0.015
3.593GlyAsp: 3.593 ± 0.028
3.583GlyGlu: 3.583 ± 0.025
2.923GlyPhe: 2.923 ± 0.023
5.394GlyGly: 5.394 ± 0.044
1.73GlyHis: 1.73 ± 0.02
3.775GlyIle: 3.775 ± 0.03
3.361GlyLys: 3.361 ± 0.029
6.317GlyLeu: 6.317 ± 0.037
1.609GlyMet: 1.609 ± 0.019
2.593GlyAsn: 2.593 ± 0.02
3.267GlyPro: 3.267 ± 0.032
2.592GlyGln: 2.592 ± 0.026
4.003GlyArg: 4.003 ± 0.031
5.686GlySer: 5.686 ± 0.034
4.002GlyThr: 4.002 ± 0.029
4.584GlyVal: 4.584 ± 0.035
1.251GlyTrp: 1.251 ± 0.016
2.302GlyTyr: 2.302 ± 0.026
0.0GlyXaa: 0.0 ± 0.0
His
1.81HisAla: 1.81 ± 0.019
0.367HisCys: 0.367 ± 0.008
1.328HisAsp: 1.328 ± 0.017
1.336HisGlu: 1.336 ± 0.018
0.955HisPhe: 0.955 ± 0.014
1.745HisGly: 1.745 ± 0.022
0.832HisHis: 0.832 ± 0.015
1.296HisIle: 1.296 ± 0.012
0.869HisLys: 0.869 ± 0.013
2.289HisLeu: 2.289 ± 0.022
0.524HisMet: 0.524 ± 0.009
0.868HisAsn: 0.868 ± 0.013
1.675HisPro: 1.675 ± 0.02
0.952HisGln: 0.952 ± 0.015
1.568HisArg: 1.568 ± 0.019
1.914HisSer: 1.914 ± 0.02
1.313HisThr: 1.313 ± 0.015
1.444HisVal: 1.444 ± 0.018
0.385HisTrp: 0.385 ± 0.008
0.733HisTyr: 0.733 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
4.337IleAla: 4.337 ± 0.032
0.824IleCys: 0.824 ± 0.012
2.895IleAsp: 2.895 ± 0.023
2.898IleGlu: 2.898 ± 0.024
2.13IlePhe: 2.13 ± 0.024
3.429IleGly: 3.429 ± 0.027
1.305IleHis: 1.305 ± 0.015
2.696IleIle: 2.696 ± 0.022
2.076IleLys: 2.076 ± 0.02
4.892IleLeu: 4.892 ± 0.036
1.088IleMet: 1.088 ± 0.016
1.844IleAsn: 1.844 ± 0.019
3.204IlePro: 3.204 ± 0.025
2.004IleGln: 2.004 ± 0.019
2.878IleArg: 2.878 ± 0.023
4.056IleSer: 4.056 ± 0.025
2.977IleThr: 2.977 ± 0.026
3.399IleVal: 3.399 ± 0.026
0.773IleTrp: 0.773 ± 0.012
1.574IleTyr: 1.574 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
3.987LysAla: 3.987 ± 0.032
0.546LysCys: 0.546 ± 0.012
2.743LysAsp: 2.743 ± 0.024
3.297LysGlu: 3.297 ± 0.034
1.411LysPhe: 1.411 ± 0.017
2.966LysGly: 2.966 ± 0.023
1.085LysHis: 1.085 ± 0.014
2.175LysIle: 2.175 ± 0.023
2.952LysLys: 2.952 ± 0.041
3.978LysLeu: 3.978 ± 0.032
0.943LysMet: 0.943 ± 0.013
1.726LysAsn: 1.726 ± 0.018
2.569LysPro: 2.569 ± 0.026
1.819LysGln: 1.819 ± 0.019
3.15LysArg: 3.15 ± 0.027
3.338LysSer: 3.338 ± 0.025
2.655LysThr: 2.655 ± 0.021
2.85LysVal: 2.85 ± 0.024
0.676LysTrp: 0.676 ± 0.013
1.403LysTyr: 1.403 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
7.76LeuAla: 7.76 ± 0.043
1.316LeuCys: 1.316 ± 0.017
5.195LeuAsp: 5.195 ± 0.028
5.547LeuGlu: 5.547 ± 0.04
3.526LeuPhe: 3.526 ± 0.031
6.2LeuGly: 6.2 ± 0.046
2.347LeuHis: 2.347 ± 0.023
4.25LeuIle: 4.25 ± 0.035
4.086LeuLys: 4.086 ± 0.03
8.837LeuLeu: 8.837 ± 0.062
1.932LeuMet: 1.932 ± 0.021
3.337LeuAsn: 3.337 ± 0.025
5.463LeuPro: 5.463 ± 0.037
3.921LeuGln: 3.921 ± 0.029
5.75LeuArg: 5.75 ± 0.036
7.634LeuSer: 7.634 ± 0.046
5.031LeuThr: 5.031 ± 0.031
5.719LeuVal: 5.719 ± 0.039
1.288LeuTrp: 1.288 ± 0.017
2.578LeuTyr: 2.578 ± 0.026
0.0LeuXaa: 0.0 ± 0.0
Met
2.138MetAla: 2.138 ± 0.021
0.28MetCys: 0.28 ± 0.007
1.239MetAsp: 1.239 ± 0.014
1.319MetGlu: 1.319 ± 0.016
0.771MetPhe: 0.771 ± 0.011
1.522MetGly: 1.522 ± 0.016
0.489MetHis: 0.489 ± 0.01
1.084MetIle: 1.084 ± 0.014
1.038MetLys: 1.038 ± 0.013
1.93MetLeu: 1.93 ± 0.021
0.573MetMet: 0.573 ± 0.011
0.855MetAsn: 0.855 ± 0.012
1.192MetPro: 1.192 ± 0.014
0.877MetGln: 0.877 ± 0.013
1.257MetArg: 1.257 ± 0.016
1.875MetSer: 1.875 ± 0.019
1.335MetThr: 1.335 ± 0.016
1.441MetVal: 1.441 ± 0.017
0.274MetTrp: 0.274 ± 0.007
0.563MetTyr: 0.563 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.126AsnAla: 3.126 ± 0.027
0.499AsnCys: 0.499 ± 0.01
1.969AsnAsp: 1.969 ± 0.02
2.071AsnGlu: 2.071 ± 0.02
1.38AsnPhe: 1.38 ± 0.016
2.992AsnGly: 2.992 ± 0.027
0.888AsnHis: 0.888 ± 0.013
2.189AsnIle: 2.189 ± 0.018
1.532AsnLys: 1.532 ± 0.019
3.328AsnLeu: 3.328 ± 0.023
0.879AsnMet: 0.879 ± 0.014
1.483AsnAsn: 1.483 ± 0.02
2.492AsnPro: 2.492 ± 0.021
1.351AsnGln: 1.351 ± 0.018
1.95AsnArg: 1.95 ± 0.017
2.715AsnSer: 2.715 ± 0.022
2.237AsnThr: 2.237 ± 0.02
2.435AsnVal: 2.435 ± 0.023
0.602AsnTrp: 0.602 ± 0.012
1.124AsnTyr: 1.124 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
4.675ProAla: 4.675 ± 0.041
0.572ProCys: 0.572 ± 0.012
3.169ProAsp: 3.169 ± 0.026
3.873ProGlu: 3.873 ± 0.033
2.135ProPhe: 2.135 ± 0.021
3.873ProGly: 3.873 ± 0.034
1.285ProHis: 1.285 ± 0.017
2.569ProIle: 2.569 ± 0.022
2.507ProLys: 2.507 ± 0.026
4.752ProLeu: 4.752 ± 0.032
1.057ProMet: 1.057 ± 0.016
2.174ProAsn: 2.174 ± 0.019
4.39ProPro: 4.39 ± 0.058
2.375ProGln: 2.375 ± 0.028
3.293ProArg: 3.293 ± 0.027
5.812ProSer: 5.812 ± 0.049
3.862ProThr: 3.862 ± 0.034
3.635ProVal: 3.635 ± 0.031
0.796ProTrp: 0.796 ± 0.013
1.585ProTyr: 1.585 ± 0.019
0.0ProXaa: 0.0 ± 0.0
Gln
3.287GlnAla: 3.287 ± 0.026
0.482GlnCys: 0.482 ± 0.01
2.043GlnAsp: 2.043 ± 0.02
2.478GlnGlu: 2.478 ± 0.021
1.327GlnPhe: 1.327 ± 0.015
2.524GlnGly: 2.524 ± 0.022
1.027GlnHis: 1.027 ± 0.015
1.942GlnIle: 1.942 ± 0.016
1.978GlnLys: 1.978 ± 0.019
3.566GlnLeu: 3.566 ± 0.025
0.899GlnMet: 0.899 ± 0.013
1.57GlnAsn: 1.57 ± 0.017
2.449GlnPro: 2.449 ± 0.03
2.216GlnGln: 2.216 ± 0.036
2.577GlnArg: 2.577 ± 0.025
3.138GlnSer: 3.138 ± 0.027
2.349GlnThr: 2.349 ± 0.022
2.256GlnVal: 2.256 ± 0.019
0.614GlnTrp: 0.614 ± 0.01
1.229GlnTyr: 1.229 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
4.469ArgAla: 4.469 ± 0.033
0.756ArgCys: 0.756 ± 0.014
3.269ArgAsp: 3.269 ± 0.027
3.722ArgGlu: 3.722 ± 0.034
2.211ArgPhe: 2.211 ± 0.021
3.615ArgGly: 3.615 ± 0.029
1.543ArgHis: 1.543 ± 0.017
2.932ArgIle: 2.932 ± 0.024
3.279ArgLys: 3.279 ± 0.028
5.609ArgLeu: 5.609 ± 0.034
1.307ArgMet: 1.307 ± 0.016
2.239ArgAsn: 2.239 ± 0.02
3.279ArgPro: 3.279 ± 0.027
2.559ArgGln: 2.559 ± 0.024
4.834ArgArg: 4.834 ± 0.045
4.69ArgSer: 4.69 ± 0.037
3.229ArgThr: 3.229 ± 0.024
3.465ArgVal: 3.465 ± 0.024
1.006ArgTrp: 1.006 ± 0.013
1.771ArgTyr: 1.771 ± 0.02
0.0ArgXaa: 0.0 ± 0.0
Ser
6.408SerAla: 6.408 ± 0.042
0.973SerCys: 0.973 ± 0.015
4.227SerAsp: 4.227 ± 0.029
4.217SerGlu: 4.217 ± 0.034
3.15SerPhe: 3.15 ± 0.025
5.551SerGly: 5.551 ± 0.037
1.966SerHis: 1.966 ± 0.021
4.132SerIle: 4.132 ± 0.032
3.591SerLys: 3.591 ± 0.029
7.501SerLeu: 7.501 ± 0.044
1.722SerMet: 1.722 ± 0.017
3.028SerAsn: 3.028 ± 0.023
5.202SerPro: 5.202 ± 0.044
3.32SerGln: 3.32 ± 0.029
4.916SerArg: 4.916 ± 0.038
8.426SerSer: 8.426 ± 0.066
5.449SerThr: 5.449 ± 0.04
4.801SerVal: 4.801 ± 0.034
1.205SerTrp: 1.205 ± 0.016
2.201SerTyr: 2.201 ± 0.024
0.0SerXaa: 0.0 ± 0.0
Thr
5.053ThrAla: 5.053 ± 0.03
0.783ThrCys: 0.783 ± 0.014
2.944ThrAsp: 2.944 ± 0.022
3.221ThrGlu: 3.221 ± 0.028
2.297ThrPhe: 2.297 ± 0.022
4.323ThrGly: 4.323 ± 0.031
1.311ThrHis: 1.311 ± 0.016
3.193ThrIle: 3.193 ± 0.023
2.543ThrLys: 2.543 ± 0.025
5.372ThrLeu: 5.372 ± 0.035
1.236ThrMet: 1.236 ± 0.014
2.128ThrAsn: 2.128 ± 0.019
4.143ThrPro: 4.143 ± 0.038
2.105ThrGln: 2.105 ± 0.018
3.016ThrArg: 3.016 ± 0.023
5.16ThrSer: 5.16 ± 0.034
4.119ThrThr: 4.119 ± 0.037
3.997ThrVal: 3.997 ± 0.028
0.907ThrTrp: 0.907 ± 0.016
1.708ThrTyr: 1.708 ± 0.022
0.0ThrXaa: 0.0 ± 0.0
Val
5.212ValAla: 5.212 ± 0.034
0.918ValCys: 0.918 ± 0.014
3.819ValAsp: 3.819 ± 0.028
3.846ValGlu: 3.846 ± 0.029
2.589ValPhe: 2.589 ± 0.024
4.212ValGly: 4.212 ± 0.029
1.469ValHis: 1.469 ± 0.018
3.197ValIle: 3.197 ± 0.025
2.824ValLys: 2.824 ± 0.024
5.821ValLeu: 5.821 ± 0.035
1.386ValMet: 1.386 ± 0.016
2.35ValAsn: 2.35 ± 0.021
3.635ValPro: 3.635 ± 0.028
2.463ValGln: 2.463 ± 0.023
3.499ValArg: 3.499 ± 0.029
4.944ValSer: 4.944 ± 0.031
3.696ValThr: 3.696 ± 0.029
4.419ValVal: 4.419 ± 0.033
0.921ValTrp: 0.921 ± 0.013
1.904ValTyr: 1.904 ± 0.021
0.0ValXaa: 0.0 ± 0.0
Trp
1.161TrpAla: 1.161 ± 0.016
0.212TrpCys: 0.212 ± 0.006
0.946TrpAsp: 0.946 ± 0.015
0.885TrpGlu: 0.885 ± 0.012
0.58TrpPhe: 0.58 ± 0.011
1.005TrpGly: 1.005 ± 0.014
0.368TrpHis: 0.368 ± 0.008
0.83TrpIle: 0.83 ± 0.014
0.859TrpLys: 0.859 ± 0.013
1.464TrpLeu: 1.464 ± 0.018
0.393TrpMet: 0.393 ± 0.008
0.674TrpAsn: 0.674 ± 0.012
0.639TrpPro: 0.639 ± 0.011
0.585TrpGln: 0.585 ± 0.012
0.983TrpArg: 0.983 ± 0.01
1.105TrpSer: 1.105 ± 0.014
0.959TrpThr: 0.959 ± 0.014
0.951TrpVal: 0.951 ± 0.016
0.299TrpTrp: 0.299 ± 0.008
0.479TrpTyr: 0.479 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.259TyrAla: 2.259 ± 0.02
0.438TyrCys: 0.438 ± 0.009
1.695TyrAsp: 1.695 ± 0.019
1.612TyrGlu: 1.612 ± 0.018
1.272TyrPhe: 1.272 ± 0.016
2.243TyrGly: 2.243 ± 0.026
0.805TyrHis: 0.805 ± 0.012
1.579TyrIle: 1.579 ± 0.018
1.132TyrLys: 1.132 ± 0.016
2.871TyrLeu: 2.871 ± 0.025
0.664TyrMet: 0.664 ± 0.011
1.205TyrAsn: 1.205 ± 0.016
1.622TyrPro: 1.622 ± 0.021
1.185TyrGln: 1.185 ± 0.015
1.744TyrArg: 1.744 ± 0.018
2.159TyrSer: 2.159 ± 0.021
1.758TyrThr: 1.758 ± 0.019
1.769TyrVal: 1.769 ± 0.018
0.489TyrTrp: 0.489 ± 0.01
1.042TyrTyr: 1.042 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.0
Statistics based on 13500 proteins (5679895 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski