Amino acid dipepetide frequency for Colletotrichum fructicola chrysovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.13AlaAla: 11.13 ± 1.44
2.605AlaCys: 2.605 ± 0.635
4.499AlaAsp: 4.499 ± 1.139
7.104AlaGlu: 7.104 ± 1.282
1.894AlaPhe: 1.894 ± 0.774
7.814AlaGly: 7.814 ± 0.78
1.421AlaHis: 1.421 ± 1.025
3.789AlaIle: 3.789 ± 0.916
3.078AlaLys: 3.078 ± 0.752
11.13AlaLeu: 11.13 ± 1.616
3.789AlaMet: 3.789 ± 1.37
2.605AlaAsn: 2.605 ± 0.681
5.683AlaPro: 5.683 ± 1.626
5.21AlaGln: 5.21 ± 0.984
8.525AlaArg: 8.525 ± 0.988
8.051AlaSer: 8.051 ± 1.999
5.446AlaThr: 5.446 ± 0.644
7.578AlaVal: 7.578 ± 2.112
1.421AlaTrp: 1.421 ± 0.498
2.842AlaTyr: 2.842 ± 0.64
0.0AlaXaa: 0.0 ± 0.0
Cys
1.894CysAla: 1.894 ± 0.698
0.237CysCys: 0.237 ± 0.217
0.947CysAsp: 0.947 ± 0.358
0.947CysGlu: 0.947 ± 0.408
0.947CysPhe: 0.947 ± 0.508
2.605CysGly: 2.605 ± 1.219
0.71CysHis: 0.71 ± 0.378
0.474CysIle: 0.474 ± 0.323
0.0CysLys: 0.0 ± 0.0
0.947CysLeu: 0.947 ± 0.425
0.71CysMet: 0.71 ± 0.631
0.71CysAsn: 0.71 ± 0.539
0.947CysPro: 0.947 ± 0.702
0.474CysGln: 0.474 ± 0.359
1.421CysArg: 1.421 ± 0.463
1.658CysSer: 1.658 ± 0.874
0.237CysThr: 0.237 ± 0.217
1.421CysVal: 1.421 ± 0.78
0.237CysTrp: 0.237 ± 0.327
0.71CysTyr: 0.71 ± 0.491
0.0CysXaa: 0.0 ± 0.0
Asp
5.446AspAla: 5.446 ± 1.181
0.474AspCys: 0.474 ± 0.248
4.499AspAsp: 4.499 ± 0.967
2.131AspGlu: 2.131 ± 0.963
2.131AspPhe: 2.131 ± 0.821
5.21AspGly: 5.21 ± 0.941
0.947AspHis: 0.947 ± 0.484
0.71AspIle: 0.71 ± 0.357
1.894AspLys: 1.894 ± 1.142
2.842AspLeu: 2.842 ± 0.582
0.474AspMet: 0.474 ± 0.259
1.658AspAsn: 1.658 ± 1.023
1.658AspPro: 1.658 ± 0.725
2.368AspGln: 2.368 ± 1.286
1.894AspArg: 1.894 ± 0.678
3.315AspSer: 3.315 ± 0.866
2.131AspThr: 2.131 ± 0.33
7.341AspVal: 7.341 ± 1.659
0.0AspTrp: 0.0 ± 0.0
2.368AspTyr: 2.368 ± 0.506
0.0AspXaa: 0.0 ± 0.0
Glu
5.446GluAla: 5.446 ± 1.806
0.71GluCys: 0.71 ± 0.491
2.842GluAsp: 2.842 ± 0.845
4.026GluGlu: 4.026 ± 1.472
2.368GluPhe: 2.368 ± 0.99
3.552GluGly: 3.552 ± 1.097
1.658GluHis: 1.658 ± 0.644
1.421GluIle: 1.421 ± 0.46
1.421GluLys: 1.421 ± 0.731
8.288GluLeu: 8.288 ± 1.414
0.947GluMet: 0.947 ± 0.509
1.184GluAsn: 1.184 ± 0.535
3.078GluPro: 3.078 ± 0.854
2.605GluGln: 2.605 ± 0.569
3.789GluArg: 3.789 ± 0.737
3.552GluSer: 3.552 ± 0.616
2.605GluThr: 2.605 ± 0.328
5.446GluVal: 5.446 ± 0.873
1.184GluTrp: 1.184 ± 0.36
1.184GluTyr: 1.184 ± 0.535
0.0GluXaa: 0.0 ± 0.0
Phe
3.789PheAla: 3.789 ± 1.396
0.474PheCys: 0.474 ± 0.336
1.894PheAsp: 1.894 ± 0.499
1.421PheGlu: 1.421 ± 0.574
2.131PhePhe: 2.131 ± 0.986
1.658PheGly: 1.658 ± 0.75
0.237PheHis: 0.237 ± 0.242
1.421PheIle: 1.421 ± 0.749
1.184PheLys: 1.184 ± 0.551
2.605PheLeu: 2.605 ± 0.809
1.421PheMet: 1.421 ± 0.827
0.947PheAsn: 0.947 ± 0.439
1.184PhePro: 1.184 ± 0.622
0.237PheGln: 0.237 ± 0.18
2.842PheArg: 2.842 ± 0.997
2.605PheSer: 2.605 ± 1.091
2.368PheThr: 2.368 ± 0.779
3.078PheVal: 3.078 ± 1.44
0.947PheTrp: 0.947 ± 0.532
0.474PheTyr: 0.474 ± 0.351
0.0PheXaa: 0.0 ± 0.0
Gly
9.472GlyAla: 9.472 ± 2.244
1.894GlyCys: 1.894 ± 0.615
4.736GlyAsp: 4.736 ± 1.007
3.789GlyGlu: 3.789 ± 1.149
2.842GlyPhe: 2.842 ± 0.863
8.525GlyGly: 8.525 ± 1.392
2.131GlyHis: 2.131 ± 0.631
1.894GlyIle: 1.894 ± 0.433
2.368GlyLys: 2.368 ± 0.594
7.814GlyLeu: 7.814 ± 0.822
2.131GlyMet: 2.131 ± 0.907
0.947GlyAsn: 0.947 ± 0.599
5.21GlyPro: 5.21 ± 1.431
3.315GlyGln: 3.315 ± 1.229
5.21GlyArg: 5.21 ± 0.683
6.63GlySer: 6.63 ± 1.44
5.21GlyThr: 5.21 ± 1.032
6.157GlyVal: 6.157 ± 1.263
0.947GlyTrp: 0.947 ± 0.417
1.894GlyTyr: 1.894 ± 0.445
0.0GlyXaa: 0.0 ± 0.0
His
2.368HisAla: 2.368 ± 0.581
0.947HisCys: 0.947 ± 0.35
0.947HisAsp: 0.947 ± 0.35
1.421HisGlu: 1.421 ± 0.659
0.947HisPhe: 0.947 ± 0.274
1.658HisGly: 1.658 ± 0.627
1.421HisHis: 1.421 ± 0.437
1.184HisIle: 1.184 ± 0.535
0.474HisLys: 0.474 ± 0.286
1.421HisLeu: 1.421 ± 0.364
0.947HisMet: 0.947 ± 0.358
0.474HisAsn: 0.474 ± 0.248
1.184HisPro: 1.184 ± 0.754
0.947HisGln: 0.947 ± 0.282
1.421HisArg: 1.421 ± 0.397
2.131HisSer: 2.131 ± 0.888
2.368HisThr: 2.368 ± 0.615
2.842HisVal: 2.842 ± 0.557
0.947HisTrp: 0.947 ± 0.282
0.71HisTyr: 0.71 ± 0.378
0.0HisXaa: 0.0 ± 0.0
Ile
2.131IleAla: 2.131 ± 0.716
0.71IleCys: 0.71 ± 0.35
1.184IleAsp: 1.184 ± 0.36
1.894IleGlu: 1.894 ± 0.445
0.71IlePhe: 0.71 ± 0.361
2.605IleGly: 2.605 ± 0.967
1.658IleHis: 1.658 ± 0.594
0.947IleIle: 0.947 ± 0.305
0.947IleLys: 0.947 ± 0.305
1.894IleLeu: 1.894 ± 0.919
0.474IleMet: 0.474 ± 0.304
0.947IleAsn: 0.947 ± 0.383
1.894IlePro: 1.894 ± 0.561
0.474IleGln: 0.474 ± 0.359
1.421IleArg: 1.421 ± 0.437
1.894IleSer: 1.894 ± 0.743
2.842IleThr: 2.842 ± 0.725
3.078IleVal: 3.078 ± 1.159
0.0IleTrp: 0.0 ± 0.0
0.71IleTyr: 0.71 ± 0.282
0.0IleXaa: 0.0 ± 0.0
Lys
3.315LysAla: 3.315 ± 1.068
0.71LysCys: 0.71 ± 0.383
1.421LysAsp: 1.421 ± 0.583
1.894LysGlu: 1.894 ± 0.454
1.894LysPhe: 1.894 ± 0.7
0.474LysGly: 0.474 ± 0.259
1.421LysHis: 1.421 ± 0.493
1.184LysIle: 1.184 ± 0.667
1.184LysLys: 1.184 ± 0.474
2.368LysLeu: 2.368 ± 0.665
0.474LysMet: 0.474 ± 0.359
0.237LysAsn: 0.237 ± 0.242
2.368LysPro: 2.368 ± 1.084
0.71LysGln: 0.71 ± 0.335
1.421LysArg: 1.421 ± 0.614
1.894LysSer: 1.894 ± 0.772
2.368LysThr: 2.368 ± 0.71
2.605LysVal: 2.605 ± 0.754
0.71LysTrp: 0.71 ± 0.532
0.71LysTyr: 0.71 ± 0.327
0.0LysXaa: 0.0 ± 0.0
Leu
9.472LeuAla: 9.472 ± 1.476
2.131LeuCys: 2.131 ± 0.903
5.446LeuAsp: 5.446 ± 1.22
5.92LeuGlu: 5.92 ± 1.346
1.184LeuPhe: 1.184 ± 0.488
10.893LeuGly: 10.893 ± 1.269
2.368LeuHis: 2.368 ± 1.043
2.842LeuIle: 2.842 ± 1.054
3.315LeuLys: 3.315 ± 0.995
8.051LeuLeu: 8.051 ± 1.247
0.947LeuMet: 0.947 ± 0.635
4.026LeuAsn: 4.026 ± 1.004
5.92LeuPro: 5.92 ± 0.809
4.262LeuGln: 4.262 ± 0.679
6.867LeuArg: 6.867 ± 1.041
7.814LeuSer: 7.814 ± 1.042
4.262LeuThr: 4.262 ± 1.059
5.446LeuVal: 5.446 ± 1.237
0.71LeuTrp: 0.71 ± 0.282
2.605LeuTyr: 2.605 ± 0.896
0.0LeuXaa: 0.0 ± 0.0
Met
3.078MetAla: 3.078 ± 1.015
0.0MetCys: 0.0 ± 0.0
1.421MetAsp: 1.421 ± 0.425
0.947MetGlu: 0.947 ± 0.495
0.0MetPhe: 0.0 ± 0.0
1.658MetGly: 1.658 ± 0.86
0.947MetHis: 0.947 ± 0.416
0.237MetIle: 0.237 ± 0.18
0.474MetLys: 0.474 ± 0.359
4.026MetLeu: 4.026 ± 1.486
0.237MetMet: 0.237 ± 0.21
0.237MetAsn: 0.237 ± 0.242
0.0MetPro: 0.0 ± 0.0
1.658MetGln: 1.658 ± 0.469
1.658MetArg: 1.658 ± 0.59
1.894MetSer: 1.894 ± 1.07
1.421MetThr: 1.421 ± 0.479
1.184MetVal: 1.184 ± 0.36
0.237MetTrp: 0.237 ± 0.21
1.894MetTyr: 1.894 ± 0.358
0.0MetXaa: 0.0 ± 0.0
Asn
1.894AsnAla: 1.894 ± 0.347
0.947AsnCys: 0.947 ± 0.37
0.474AsnAsp: 0.474 ± 0.279
1.184AsnGlu: 1.184 ± 0.301
1.894AsnPhe: 1.894 ± 0.784
1.184AsnGly: 1.184 ± 0.444
0.947AsnHis: 0.947 ± 0.27
1.184AsnIle: 1.184 ± 0.632
0.947AsnLys: 0.947 ± 0.492
1.658AsnLeu: 1.658 ± 0.545
0.71AsnMet: 0.71 ± 0.374
0.71AsnAsn: 0.71 ± 0.219
0.947AsnPro: 0.947 ± 0.358
0.71AsnGln: 0.71 ± 0.282
2.131AsnArg: 2.131 ± 0.749
2.605AsnSer: 2.605 ± 0.743
1.894AsnThr: 1.894 ± 0.411
2.131AsnVal: 2.131 ± 0.518
0.237AsnTrp: 0.237 ± 0.21
0.71AsnTyr: 0.71 ± 0.361
0.0AsnXaa: 0.0 ± 0.0
Pro
4.499ProAla: 4.499 ± 1.568
1.421ProCys: 1.421 ± 0.983
3.552ProAsp: 3.552 ± 0.631
3.552ProGlu: 3.552 ± 0.609
0.947ProPhe: 0.947 ± 0.274
5.21ProGly: 5.21 ± 0.794
1.421ProHis: 1.421 ± 0.625
1.421ProIle: 1.421 ± 0.777
2.605ProLys: 2.605 ± 0.787
3.552ProLeu: 3.552 ± 0.824
0.237ProMet: 0.237 ± 0.21
1.894ProAsn: 1.894 ± 0.214
2.131ProPro: 2.131 ± 0.745
2.605ProGln: 2.605 ± 1.147
2.842ProArg: 2.842 ± 0.885
4.026ProSer: 4.026 ± 1.069
4.262ProThr: 4.262 ± 1.166
3.789ProVal: 3.789 ± 0.905
0.71ProTrp: 0.71 ± 0.382
1.421ProTyr: 1.421 ± 0.614
0.0ProXaa: 0.0 ± 0.0
Gln
4.499GlnAla: 4.499 ± 0.81
0.71GlnCys: 0.71 ± 0.219
0.71GlnAsp: 0.71 ± 0.357
2.605GlnGlu: 2.605 ± 0.404
1.421GlnPhe: 1.421 ± 0.405
1.421GlnGly: 1.421 ± 0.314
1.421GlnHis: 1.421 ± 0.475
0.474GlnIle: 0.474 ± 0.483
0.474GlnLys: 0.474 ± 0.338
5.446GlnLeu: 5.446 ± 1.018
0.474GlnMet: 0.474 ± 0.206
0.474GlnAsn: 0.474 ± 0.279
2.605GlnPro: 2.605 ± 0.837
2.842GlnGln: 2.842 ± 0.767
2.368GlnArg: 2.368 ± 0.581
2.842GlnSer: 2.842 ± 0.866
2.368GlnThr: 2.368 ± 0.624
1.894GlnVal: 1.894 ± 0.544
0.947GlnTrp: 0.947 ± 0.35
1.421GlnTyr: 1.421 ± 0.459
0.0GlnXaa: 0.0 ± 0.0
Arg
7.814ArgAla: 7.814 ± 2.765
1.894ArgCys: 1.894 ± 0.665
2.605ArgAsp: 2.605 ± 1.132
4.973ArgGlu: 4.973 ± 1.307
3.078ArgPhe: 3.078 ± 0.775
6.157ArgGly: 6.157 ± 1.555
2.131ArgHis: 2.131 ± 0.602
2.131ArgIle: 2.131 ± 0.938
1.894ArgLys: 1.894 ± 0.749
7.578ArgLeu: 7.578 ± 1.004
1.658ArgMet: 1.658 ± 0.431
2.131ArgAsn: 2.131 ± 1.016
2.605ArgPro: 2.605 ± 0.824
1.658ArgGln: 1.658 ± 0.238
6.867ArgArg: 6.867 ± 2.763
3.789ArgSer: 3.789 ± 1.257
4.973ArgThr: 4.973 ± 1.503
7.578ArgVal: 7.578 ± 1.49
0.71ArgTrp: 0.71 ± 0.42
2.131ArgTyr: 2.131 ± 0.631
0.0ArgXaa: 0.0 ± 0.0
Ser
7.578SerAla: 7.578 ± 0.81
0.947SerCys: 0.947 ± 0.733
2.605SerAsp: 2.605 ± 1.081
6.157SerGlu: 6.157 ± 1.597
2.842SerPhe: 2.842 ± 1.361
8.998SerGly: 8.998 ± 1.669
1.658SerHis: 1.658 ± 0.681
1.658SerIle: 1.658 ± 0.893
2.131SerLys: 2.131 ± 0.797
8.998SerLeu: 8.998 ± 1.356
2.368SerMet: 2.368 ± 0.871
1.421SerAsn: 1.421 ± 0.312
4.736SerPro: 4.736 ± 1.162
1.658SerGln: 1.658 ± 0.742
5.21SerArg: 5.21 ± 1.314
6.867SerSer: 6.867 ± 1.977
1.658SerThr: 1.658 ± 0.316
6.63SerVal: 6.63 ± 1.64
0.71SerTrp: 0.71 ± 0.349
1.658SerTyr: 1.658 ± 0.619
0.0SerXaa: 0.0 ± 0.0
Thr
5.21ThrAla: 5.21 ± 0.659
0.237ThrCys: 0.237 ± 0.217
2.842ThrAsp: 2.842 ± 0.838
1.894ThrGlu: 1.894 ± 0.592
3.078ThrPhe: 3.078 ± 0.837
2.368ThrGly: 2.368 ± 0.429
1.658ThrHis: 1.658 ± 0.725
1.894ThrIle: 1.894 ± 0.819
1.894ThrLys: 1.894 ± 0.597
4.499ThrLeu: 4.499 ± 1.705
1.894ThrMet: 1.894 ± 0.683
1.421ThrAsn: 1.421 ± 0.551
2.842ThrPro: 2.842 ± 0.613
2.131ThrGln: 2.131 ± 0.775
5.92ThrArg: 5.92 ± 1.029
4.973ThrSer: 4.973 ± 0.633
3.078ThrThr: 3.078 ± 1.287
7.578ThrVal: 7.578 ± 0.973
0.237ThrTrp: 0.237 ± 0.18
1.658ThrTyr: 1.658 ± 0.527
0.0ThrXaa: 0.0 ± 0.0
Val
11.366ValAla: 11.366 ± 0.917
0.947ValCys: 0.947 ± 0.455
5.21ValAsp: 5.21 ± 1.433
4.499ValGlu: 4.499 ± 0.603
1.184ValPhe: 1.184 ± 0.481
6.867ValGly: 6.867 ± 1.224
1.658ValHis: 1.658 ± 0.684
2.605ValIle: 2.605 ± 0.735
2.131ValLys: 2.131 ± 0.618
6.63ValLeu: 6.63 ± 1.336
1.184ValMet: 1.184 ± 0.47
2.131ValAsn: 2.131 ± 0.57
5.683ValPro: 5.683 ± 0.825
1.894ValGln: 1.894 ± 0.569
9.235ValArg: 9.235 ± 2.262
6.394ValSer: 6.394 ± 1.28
5.446ValThr: 5.446 ± 1.34
6.394ValVal: 6.394 ± 0.946
0.947ValTrp: 0.947 ± 0.508
3.315ValTyr: 3.315 ± 0.67
0.0ValXaa: 0.0 ± 0.0
Trp
1.894TrpAla: 1.894 ± 0.55
0.237TrpCys: 0.237 ± 0.298
0.0TrpAsp: 0.0 ± 0.0
0.71TrpGlu: 0.71 ± 0.3
0.474TrpPhe: 0.474 ± 0.266
1.421TrpGly: 1.421 ± 0.456
0.474TrpHis: 0.474 ± 0.391
0.474TrpIle: 0.474 ± 0.371
0.237TrpLys: 0.237 ± 0.18
1.658TrpLeu: 1.658 ± 0.475
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.474TrpGln: 0.474 ± 0.421
1.184TrpArg: 1.184 ± 0.312
0.947TrpSer: 0.947 ± 0.355
0.71TrpThr: 0.71 ± 0.374
0.474TrpVal: 0.474 ± 0.359
0.474TrpTrp: 0.474 ± 0.279
0.474TrpTyr: 0.474 ± 0.279
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.315TyrAla: 3.315 ± 0.579
0.0TyrCys: 0.0 ± 0.0
1.658TyrAsp: 1.658 ± 0.86
0.237TyrGlu: 0.237 ± 0.18
0.947TyrPhe: 0.947 ± 0.305
3.078TyrGly: 3.078 ± 1.065
0.474TyrHis: 0.474 ± 0.246
0.474TyrIle: 0.474 ± 0.359
0.71TyrLys: 0.71 ± 0.374
3.078TyrLeu: 3.078 ± 0.599
1.894TyrMet: 1.894 ± 0.817
0.947TyrAsn: 0.947 ± 0.703
1.421TyrPro: 1.421 ± 0.731
1.184TyrGln: 1.184 ± 0.45
2.131TyrArg: 2.131 ± 1.201
2.605TyrSer: 2.605 ± 0.65
1.658TyrThr: 1.658 ± 0.345
3.078TyrVal: 3.078 ± 0.522
0.0TyrTrp: 0.0 ± 0.0
1.421TyrTyr: 1.421 ± 0.368
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4224 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski