Amino acid dipepetide frequency for Hypocrea virens (strain Gv29-8 / FGSC 10586) (Gliocladium virens) (Trichoderma virens)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.556AlaAla: 9.556 ± 0.063
1.125AlaCys: 1.125 ± 0.016
4.31AlaAsp: 4.31 ± 0.03
5.086AlaGlu: 5.086 ± 0.039
3.233AlaPhe: 3.233 ± 0.028
5.618AlaGly: 5.618 ± 0.041
1.741AlaHis: 1.741 ± 0.017
4.74AlaIle: 4.74 ± 0.035
4.243AlaLys: 4.243 ± 0.03
7.847AlaLeu: 7.847 ± 0.046
2.028AlaMet: 2.028 ± 0.02
3.148AlaAsn: 3.148 ± 0.026
4.484AlaPro: 4.484 ± 0.039
3.429AlaGln: 3.429 ± 0.033
4.674AlaArg: 4.674 ± 0.03
7.322AlaSer: 7.322 ± 0.044
5.254AlaThr: 5.254 ± 0.032
5.471AlaVal: 5.471 ± 0.035
1.226AlaTrp: 1.226 ± 0.018
2.237AlaTyr: 2.237 ± 0.02
0.0AlaXaa: 0.0 ± 0.0
Cys
0.962CysAla: 0.962 ± 0.013
0.271CysCys: 0.271 ± 0.008
0.713CysAsp: 0.713 ± 0.012
0.611CysGlu: 0.611 ± 0.011
0.598CysPhe: 0.598 ± 0.011
0.996CysGly: 0.996 ± 0.016
0.344CysHis: 0.344 ± 0.008
0.793CysIle: 0.793 ± 0.012
0.542CysLys: 0.542 ± 0.012
1.318CysLeu: 1.318 ± 0.018
0.271CysMet: 0.271 ± 0.006
0.482CysAsn: 0.482 ± 0.01
0.662CysPro: 0.662 ± 0.015
0.507CysGln: 0.507 ± 0.01
0.768CysArg: 0.768 ± 0.014
0.978CysSer: 0.978 ± 0.014
0.709CysThr: 0.709 ± 0.014
0.812CysVal: 0.812 ± 0.013
0.218CysTrp: 0.218 ± 0.006
0.39CysTyr: 0.39 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
4.908AspAla: 4.908 ± 0.029
0.671AspCys: 0.671 ± 0.012
4.386AspAsp: 4.386 ± 0.035
4.564AspGlu: 4.564 ± 0.039
2.279AspPhe: 2.279 ± 0.022
4.285AspGly: 4.285 ± 0.034
1.206AspHis: 1.206 ± 0.016
3.362AspIle: 3.362 ± 0.025
2.616AspLys: 2.616 ± 0.023
5.006AspLeu: 5.006 ± 0.036
1.332AspMet: 1.332 ± 0.016
1.964AspAsn: 1.964 ± 0.021
3.063AspPro: 3.063 ± 0.023
1.851AspGln: 1.851 ± 0.017
2.863AspArg: 2.863 ± 0.024
4.082AspSer: 4.082 ± 0.031
2.847AspThr: 2.847 ± 0.023
3.772AspVal: 3.772 ± 0.032
0.905AspTrp: 0.905 ± 0.014
1.63AspTyr: 1.63 ± 0.021
0.0AspXaa: 0.0 ± 0.0
Glu
5.629GluAla: 5.629 ± 0.045
0.655GluCys: 0.655 ± 0.011
4.058GluAsp: 4.058 ± 0.035
5.268GluGlu: 5.268 ± 0.054
2.02GluPhe: 2.02 ± 0.02
3.472GluGly: 3.472 ± 0.029
1.351GluHis: 1.351 ± 0.015
3.213GluIle: 3.213 ± 0.026
3.636GluLys: 3.636 ± 0.033
5.323GluLeu: 5.323 ± 0.038
1.536GluMet: 1.536 ± 0.016
2.261GluAsn: 2.261 ± 0.023
2.682GluPro: 2.682 ± 0.028
2.436GluGln: 2.436 ± 0.027
3.673GluArg: 3.673 ± 0.035
4.225GluSer: 4.225 ± 0.029
3.426GluThr: 3.426 ± 0.027
3.414GluVal: 3.414 ± 0.03
0.917GluTrp: 0.917 ± 0.015
1.714GluTyr: 1.714 ± 0.02
0.0GluXaa: 0.0 ± 0.0
Phe
3.13PheAla: 3.13 ± 0.025
0.602PheCys: 0.602 ± 0.01
2.362PheAsp: 2.362 ± 0.022
2.175PheGlu: 2.175 ± 0.02
1.714PhePhe: 1.714 ± 0.021
2.919PheGly: 2.919 ± 0.027
0.915PheHis: 0.915 ± 0.013
2.051PheIle: 2.051 ± 0.022
1.574PheLys: 1.574 ± 0.017
3.528PheLeu: 3.528 ± 0.031
0.819PheMet: 0.819 ± 0.011
1.511PheAsn: 1.511 ± 0.015
1.926PhePro: 1.926 ± 0.02
1.479PheGln: 1.479 ± 0.018
1.944PheArg: 1.944 ± 0.018
3.104PheSer: 3.104 ± 0.026
2.198PheThr: 2.198 ± 0.019
2.43PheVal: 2.43 ± 0.023
0.678PheTrp: 0.678 ± 0.011
1.165PheTyr: 1.165 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
5.251GlyAla: 5.251 ± 0.038
0.919GlyCys: 0.919 ± 0.013
3.711GlyAsp: 3.711 ± 0.03
3.567GlyGlu: 3.567 ± 0.029
2.818GlyPhe: 2.818 ± 0.026
5.516GlyGly: 5.516 ± 0.055
1.711GlyHis: 1.711 ± 0.022
3.824GlyIle: 3.824 ± 0.026
3.509GlyLys: 3.509 ± 0.032
6.077GlyLeu: 6.077 ± 0.039
1.563GlyMet: 1.563 ± 0.016
2.642GlyAsn: 2.642 ± 0.024
3.168GlyPro: 3.168 ± 0.031
2.557GlyGln: 2.557 ± 0.024
4.003GlyArg: 4.003 ± 0.032
5.656GlySer: 5.656 ± 0.044
3.856GlyThr: 3.856 ± 0.035
4.256GlyVal: 4.256 ± 0.03
1.193GlyTrp: 1.193 ± 0.014
2.153GlyTyr: 2.153 ± 0.022
0.0GlyXaa: 0.0 ± 0.0
His
1.758HisAla: 1.758 ± 0.018
0.34HisCys: 0.34 ± 0.008
1.346HisAsp: 1.346 ± 0.019
1.316HisGlu: 1.316 ± 0.018
0.935HisPhe: 0.935 ± 0.013
1.713HisGly: 1.713 ± 0.02
0.888HisHis: 0.888 ± 0.017
1.316HisIle: 1.316 ± 0.016
0.963HisLys: 0.963 ± 0.013
2.273HisLeu: 2.273 ± 0.022
0.522HisMet: 0.522 ± 0.008
0.868HisAsn: 0.868 ± 0.013
1.513HisPro: 1.513 ± 0.018
1.024HisGln: 1.024 ± 0.017
1.447HisArg: 1.447 ± 0.019
1.743HisSer: 1.743 ± 0.019
1.183HisThr: 1.183 ± 0.015
1.428HisVal: 1.428 ± 0.018
0.375HisTrp: 0.375 ± 0.007
0.697HisTyr: 0.697 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
4.473IleAla: 4.473 ± 0.03
0.848IleCys: 0.848 ± 0.012
3.088IleAsp: 3.088 ± 0.024
3.071IleGlu: 3.071 ± 0.028
2.202IlePhe: 2.202 ± 0.021
3.417IleGly: 3.417 ± 0.029
1.314IleHis: 1.314 ± 0.015
2.86IleIle: 2.86 ± 0.031
2.448IleLys: 2.448 ± 0.021
4.852IleLeu: 4.852 ± 0.034
1.102IleMet: 1.102 ± 0.014
2.035IleAsn: 2.035 ± 0.02
3.14IlePro: 3.14 ± 0.025
2.152IleGln: 2.152 ± 0.024
2.984IleArg: 2.984 ± 0.025
4.182IleSer: 4.182 ± 0.033
3.0IleThr: 3.0 ± 0.028
3.342IleVal: 3.342 ± 0.029
0.827IleTrp: 0.827 ± 0.013
1.529IleTyr: 1.529 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
4.412LysAla: 4.412 ± 0.036
0.533LysCys: 0.533 ± 0.012
2.876LysAsp: 2.876 ± 0.027
3.345LysGlu: 3.345 ± 0.032
1.589LysPhe: 1.589 ± 0.018
2.981LysGly: 2.981 ± 0.028
1.14LysHis: 1.14 ± 0.015
2.436LysIle: 2.436 ± 0.023
3.278LysLys: 3.278 ± 0.038
4.362LysLeu: 4.362 ± 0.032
1.104LysMet: 1.104 ± 0.015
1.787LysAsn: 1.787 ± 0.016
2.653LysPro: 2.653 ± 0.026
1.885LysGln: 1.885 ± 0.018
3.247LysArg: 3.247 ± 0.029
3.543LysSer: 3.543 ± 0.025
2.898LysThr: 2.898 ± 0.025
2.814LysVal: 2.814 ± 0.026
0.727LysTrp: 0.727 ± 0.014
1.446LysTyr: 1.446 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
7.994LeuAla: 7.994 ± 0.044
1.279LeuCys: 1.279 ± 0.018
5.376LeuAsp: 5.376 ± 0.033
5.582LeuGlu: 5.582 ± 0.044
3.427LeuPhe: 3.427 ± 0.029
6.044LeuGly: 6.044 ± 0.039
2.237LeuHis: 2.237 ± 0.021
4.309LeuIle: 4.309 ± 0.034
4.305LeuLys: 4.305 ± 0.033
8.689LeuLeu: 8.689 ± 0.058
1.872LeuMet: 1.872 ± 0.018
3.279LeuAsn: 3.279 ± 0.024
5.351LeuPro: 5.351 ± 0.036
3.994LeuGln: 3.994 ± 0.033
5.633LeuArg: 5.633 ± 0.036
7.424LeuSer: 7.424 ± 0.041
4.789LeuThr: 4.789 ± 0.033
5.552LeuVal: 5.552 ± 0.038
1.297LeuTrp: 1.297 ± 0.014
2.432LeuTyr: 2.432 ± 0.022
0.0LeuXaa: 0.0 ± 0.0
Met
2.386MetAla: 2.386 ± 0.02
0.25MetCys: 0.25 ± 0.007
1.306MetAsp: 1.306 ± 0.016
1.329MetGlu: 1.329 ± 0.016
0.74MetPhe: 0.74 ± 0.012
1.468MetGly: 1.468 ± 0.018
0.483MetHis: 0.483 ± 0.01
1.067MetIle: 1.067 ± 0.014
1.042MetLys: 1.042 ± 0.013
1.944MetLeu: 1.944 ± 0.015
0.609MetMet: 0.609 ± 0.013
0.832MetAsn: 0.832 ± 0.013
1.324MetPro: 1.324 ± 0.017
0.888MetGln: 0.888 ± 0.014
1.286MetArg: 1.286 ± 0.017
1.844MetSer: 1.844 ± 0.02
1.311MetThr: 1.311 ± 0.016
1.332MetVal: 1.332 ± 0.017
0.28MetTrp: 0.28 ± 0.007
0.552MetTyr: 0.552 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.173AsnAla: 3.173 ± 0.029
0.487AsnCys: 0.487 ± 0.011
2.034AsnAsp: 2.034 ± 0.019
2.024AsnGlu: 2.024 ± 0.022
1.46AsnPhe: 1.46 ± 0.019
3.119AsnGly: 3.119 ± 0.032
0.868AsnHis: 0.868 ± 0.012
2.239AsnIle: 2.239 ± 0.02
1.675AsnLys: 1.675 ± 0.015
3.373AsnLeu: 3.373 ± 0.025
0.9AsnMet: 0.9 ± 0.014
1.579AsnAsn: 1.579 ± 0.021
2.396AsnPro: 2.396 ± 0.022
1.383AsnGln: 1.383 ± 0.018
1.932AsnArg: 1.932 ± 0.019
2.824AsnSer: 2.824 ± 0.023
2.179AsnThr: 2.179 ± 0.023
2.326AsnVal: 2.326 ± 0.019
0.615AsnTrp: 0.615 ± 0.012
1.113AsnTyr: 1.113 ± 0.013
0.0AsnXaa: 0.0 ± 0.0
Pro
4.786ProAla: 4.786 ± 0.043
0.55ProCys: 0.55 ± 0.012
3.134ProAsp: 3.134 ± 0.028
3.631ProGlu: 3.631 ± 0.028
2.05ProPhe: 2.05 ± 0.019
3.733ProGly: 3.733 ± 0.035
1.223ProHis: 1.223 ± 0.016
2.573ProIle: 2.573 ± 0.024
2.634ProLys: 2.634 ± 0.022
4.764ProLeu: 4.764 ± 0.032
1.106ProMet: 1.106 ± 0.015
2.194ProAsn: 2.194 ± 0.022
4.613ProPro: 4.613 ± 0.066
2.327ProGln: 2.327 ± 0.03
3.12ProArg: 3.12 ± 0.029
5.472ProSer: 5.472 ± 0.042
3.616ProThr: 3.616 ± 0.029
3.377ProVal: 3.377 ± 0.03
0.758ProTrp: 0.758 ± 0.014
1.455ProTyr: 1.455 ± 0.019
0.0ProXaa: 0.0 ± 0.0
Gln
3.504GlnAla: 3.504 ± 0.034
0.468GlnCys: 0.468 ± 0.008
2.151GlnAsp: 2.151 ± 0.019
2.408GlnGlu: 2.408 ± 0.023
1.388GlnPhe: 1.388 ± 0.016
2.46GlnGly: 2.46 ± 0.023
1.053GlnHis: 1.053 ± 0.017
1.994GlnIle: 1.994 ± 0.022
1.971GlnLys: 1.971 ± 0.02
3.684GlnLeu: 3.684 ± 0.027
0.912GlnMet: 0.912 ± 0.014
1.526GlnAsn: 1.526 ± 0.019
2.456GlnPro: 2.456 ± 0.029
2.481GlnGln: 2.481 ± 0.046
2.593GlnArg: 2.593 ± 0.024
3.052GlnSer: 3.052 ± 0.03
2.279GlnThr: 2.279 ± 0.021
2.237GlnVal: 2.237 ± 0.025
0.609GlnTrp: 0.609 ± 0.01
1.189GlnTyr: 1.189 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
4.408ArgAla: 4.408 ± 0.033
0.737ArgCys: 0.737 ± 0.013
3.342ArgAsp: 3.342 ± 0.026
3.638ArgGlu: 3.638 ± 0.035
2.138ArgPhe: 2.138 ± 0.02
3.578ArgGly: 3.578 ± 0.03
1.528ArgHis: 1.528 ± 0.017
3.017ArgIle: 3.017 ± 0.024
3.231ArgLys: 3.231 ± 0.025
5.538ArgLeu: 5.538 ± 0.036
1.286ArgMet: 1.286 ± 0.017
2.22ArgAsn: 2.22 ± 0.02
3.276ArgPro: 3.276 ± 0.03
2.596ArgGln: 2.596 ± 0.021
4.745ArgArg: 4.745 ± 0.041
4.422ArgSer: 4.422 ± 0.039
3.024ArgThr: 3.024 ± 0.025
3.254ArgVal: 3.254 ± 0.02
0.917ArgTrp: 0.917 ± 0.013
1.672ArgTyr: 1.672 ± 0.019
0.0ArgXaa: 0.0 ± 0.0
Ser
6.462SerAla: 6.462 ± 0.043
0.938SerCys: 0.938 ± 0.014
4.177SerAsp: 4.177 ± 0.03
4.006SerGlu: 4.006 ± 0.03
3.095SerPhe: 3.095 ± 0.025
5.544SerGly: 5.544 ± 0.041
1.933SerHis: 1.933 ± 0.017
4.316SerIle: 4.316 ± 0.035
3.772SerLys: 3.772 ± 0.03
7.351SerLeu: 7.351 ± 0.041
1.738SerMet: 1.738 ± 0.021
3.057SerAsn: 3.057 ± 0.025
4.958SerPro: 4.958 ± 0.048
3.339SerGln: 3.339 ± 0.031
4.826SerArg: 4.826 ± 0.042
8.296SerSer: 8.296 ± 0.067
5.158SerThr: 5.158 ± 0.041
4.551SerVal: 4.551 ± 0.029
1.208SerTrp: 1.208 ± 0.014
2.128SerTyr: 2.128 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
5.186ThrAla: 5.186 ± 0.031
0.766ThrCys: 0.766 ± 0.013
2.78ThrAsp: 2.78 ± 0.024
3.055ThrGlu: 3.055 ± 0.024
2.226ThrPhe: 2.226 ± 0.019
4.057ThrGly: 4.057 ± 0.029
1.216ThrHis: 1.216 ± 0.015
3.268ThrIle: 3.268 ± 0.028
2.671ThrLys: 2.671 ± 0.02
5.19ThrLeu: 5.19 ± 0.029
1.216ThrMet: 1.216 ± 0.016
2.128ThrAsn: 2.128 ± 0.02
3.894ThrPro: 3.894 ± 0.035
2.011ThrGln: 2.011 ± 0.02
2.969ThrArg: 2.969 ± 0.024
4.933ThrSer: 4.933 ± 0.036
4.002ThrThr: 4.002 ± 0.046
3.672ThrVal: 3.672 ± 0.032
0.893ThrTrp: 0.893 ± 0.012
1.621ThrTyr: 1.621 ± 0.018
0.0ThrXaa: 0.0 ± 0.0
Val
5.356ValAla: 5.356 ± 0.037
0.845ValCys: 0.845 ± 0.014
3.749ValAsp: 3.749 ± 0.028
3.779ValGlu: 3.779 ± 0.032
2.499ValPhe: 2.499 ± 0.023
3.922ValGly: 3.922 ± 0.034
1.361ValHis: 1.361 ± 0.017
3.143ValIle: 3.143 ± 0.028
2.937ValLys: 2.937 ± 0.025
5.604ValLeu: 5.604 ± 0.037
1.322ValMet: 1.322 ± 0.016
2.323ValAsn: 2.323 ± 0.023
3.438ValPro: 3.438 ± 0.025
2.356ValGln: 2.356 ± 0.02
3.276ValArg: 3.276 ± 0.029
4.592ValSer: 4.592 ± 0.026
3.506ValThr: 3.506 ± 0.026
4.324ValVal: 4.324 ± 0.035
0.879ValTrp: 0.879 ± 0.013
1.732ValTyr: 1.732 ± 0.017
0.0ValXaa: 0.0 ± 0.0
Trp
1.245TrpAla: 1.245 ± 0.017
0.206TrpCys: 0.206 ± 0.006
0.987TrpAsp: 0.987 ± 0.015
0.878TrpGlu: 0.878 ± 0.013
0.575TrpPhe: 0.575 ± 0.011
0.966TrpGly: 0.966 ± 0.013
0.373TrpHis: 0.373 ± 0.009
0.851TrpIle: 0.851 ± 0.013
0.828TrpLys: 0.828 ± 0.011
1.424TrpLeu: 1.424 ± 0.019
0.378TrpMet: 0.378 ± 0.008
0.684TrpAsn: 0.684 ± 0.012
0.635TrpPro: 0.635 ± 0.013
0.611TrpGln: 0.611 ± 0.011
0.975TrpArg: 0.975 ± 0.013
1.084TrpSer: 1.084 ± 0.014
0.945TrpThr: 0.945 ± 0.012
0.923TrpVal: 0.923 ± 0.013
0.298TrpTrp: 0.298 ± 0.008
0.44TrpTyr: 0.44 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.196TyrAla: 2.196 ± 0.022
0.457TyrCys: 0.457 ± 0.01
1.711TyrAsp: 1.711 ± 0.018
1.578TyrGlu: 1.578 ± 0.02
1.239TyrPhe: 1.239 ± 0.016
2.139TyrGly: 2.139 ± 0.022
0.763TyrHis: 0.763 ± 0.011
1.495TyrIle: 1.495 ± 0.018
1.177TyrLys: 1.177 ± 0.016
2.717TyrLeu: 2.717 ± 0.022
0.656TyrMet: 0.656 ± 0.011
1.179TyrAsn: 1.179 ± 0.015
1.482TyrPro: 1.482 ± 0.02
1.109TyrGln: 1.109 ± 0.014
1.61TyrArg: 1.61 ± 0.016
2.073TyrSer: 2.073 ± 0.02
1.575TyrThr: 1.575 ± 0.016
1.655TyrVal: 1.655 ± 0.019
0.485TyrTrp: 0.485 ± 0.01
0.956TyrTyr: 0.956 ± 0.013
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12389 proteins (5835099 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski