Amino acid dipepetide frequency for Yerba mate chlorosis-associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.993AlaAla: 1.993 ± 0.962
0.997AlaCys: 0.997 ± 0.382
2.242AlaAsp: 2.242 ± 0.588
1.744AlaGlu: 1.744 ± 0.233
1.246AlaPhe: 1.246 ± 0.735
2.491AlaGly: 2.491 ± 1.042
0.747AlaHis: 0.747 ± 0.791
2.74AlaIle: 2.74 ± 1.074
2.74AlaLys: 2.74 ± 1.08
5.979AlaLeu: 5.979 ± 1.343
0.498AlaMet: 0.498 ± 0.3
0.997AlaAsn: 0.997 ± 0.468
1.744AlaPro: 1.744 ± 1.371
1.246AlaGln: 1.246 ± 0.413
2.491AlaArg: 2.491 ± 0.629
5.232AlaSer: 5.232 ± 2.384
1.495AlaThr: 1.495 ± 0.472
2.491AlaVal: 2.491 ± 0.892
1.744AlaTrp: 1.744 ± 0.565
2.74AlaTyr: 2.74 ± 0.534
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.246CysAsp: 1.246 ± 0.447
1.246CysGlu: 1.246 ± 0.284
1.246CysPhe: 1.246 ± 0.298
0.997CysGly: 0.997 ± 0.376
0.747CysHis: 0.747 ± 0.456
0.997CysIle: 0.997 ± 0.525
1.246CysLys: 1.246 ± 0.484
1.246CysLeu: 1.246 ± 0.67
0.997CysMet: 0.997 ± 0.468
1.495CysAsn: 1.495 ± 1.081
0.997CysPro: 0.997 ± 0.582
0.997CysGln: 0.997 ± 0.371
0.498CysArg: 0.498 ± 0.234
1.744CysSer: 1.744 ± 0.513
0.249CysThr: 0.249 ± 0.337
0.747CysVal: 0.747 ± 0.456
0.498CysTrp: 0.498 ± 0.234
1.744CysTyr: 1.744 ± 0.434
0.0CysXaa: 0.0 ± 0.0
Asp
0.498AspAla: 0.498 ± 0.3
0.997AspCys: 0.997 ± 0.559
3.737AspAsp: 3.737 ± 1.23
4.235AspGlu: 4.235 ± 1.678
2.491AspPhe: 2.491 ± 0.822
2.491AspGly: 2.491 ± 0.412
2.491AspHis: 2.491 ± 0.664
4.983AspIle: 4.983 ± 1.473
4.235AspLys: 4.235 ± 0.625
4.733AspLeu: 4.733 ± 0.849
1.246AspMet: 1.246 ± 0.718
2.74AspAsn: 2.74 ± 0.721
4.235AspPro: 4.235 ± 0.578
1.246AspGln: 1.246 ± 0.524
1.993AspArg: 1.993 ± 1.199
4.484AspSer: 4.484 ± 0.664
2.491AspThr: 2.491 ± 0.52
1.993AspVal: 1.993 ± 0.43
0.997AspTrp: 0.997 ± 0.468
1.246AspTyr: 1.246 ± 0.499
0.0AspXaa: 0.0 ± 0.0
Glu
1.744GluAla: 1.744 ± 1.282
1.495GluCys: 1.495 ± 0.795
3.986GluAsp: 3.986 ± 1.002
5.979GluGlu: 5.979 ± 2.827
1.744GluPhe: 1.744 ± 0.399
2.99GluGly: 2.99 ± 0.655
1.495GluHis: 1.495 ± 0.63
5.73GluIle: 5.73 ± 1.112
5.979GluLys: 5.979 ± 1.375
6.726GluLeu: 6.726 ± 1.468
2.491GluMet: 2.491 ± 0.486
3.986GluAsn: 3.986 ± 1.552
1.495GluPro: 1.495 ± 0.64
1.744GluGln: 1.744 ± 0.554
1.993GluArg: 1.993 ± 0.325
4.733GluSer: 4.733 ± 1.581
3.737GluThr: 3.737 ± 1.512
3.239GluVal: 3.239 ± 0.756
0.747GluTrp: 0.747 ± 0.585
2.242GluTyr: 2.242 ± 0.589
0.0GluXaa: 0.0 ± 0.0
Phe
2.242PheAla: 2.242 ± 0.902
1.495PheCys: 1.495 ± 0.565
2.99PheAsp: 2.99 ± 1.179
2.242PheGlu: 2.242 ± 0.822
1.744PhePhe: 1.744 ± 0.975
1.495PheGly: 1.495 ± 0.565
0.997PheHis: 0.997 ± 0.425
2.491PheIle: 2.491 ± 0.863
3.986PheLys: 3.986 ± 1.085
6.228PheLeu: 6.228 ± 1.187
0.498PheMet: 0.498 ± 0.306
2.242PheAsn: 2.242 ± 0.589
2.74PhePro: 2.74 ± 0.482
1.744PheGln: 1.744 ± 0.79
1.246PheArg: 1.246 ± 0.577
6.726PheSer: 6.726 ± 1.003
0.498PheThr: 0.498 ± 0.3
1.495PheVal: 1.495 ± 0.422
0.498PheTrp: 0.498 ± 0.3
2.491PheTyr: 2.491 ± 0.66
0.0PheXaa: 0.0 ± 0.0
Gly
2.242GlyAla: 2.242 ± 0.876
0.498GlyCys: 0.498 ± 0.431
3.986GlyAsp: 3.986 ± 0.684
2.99GlyGlu: 2.99 ± 0.655
1.993GlyPhe: 1.993 ± 0.304
3.737GlyGly: 3.737 ± 0.811
1.246GlyHis: 1.246 ± 0.602
4.484GlyIle: 4.484 ± 1.483
3.986GlyLys: 3.986 ± 0.666
5.979GlyLeu: 5.979 ± 1.266
0.498GlyMet: 0.498 ± 0.283
2.74GlyAsn: 2.74 ± 0.612
0.747GlyPro: 0.747 ± 0.321
0.997GlyGln: 0.997 ± 0.423
2.99GlyArg: 2.99 ± 0.62
4.733GlySer: 4.733 ± 0.422
1.495GlyThr: 1.495 ± 0.22
2.74GlyVal: 2.74 ± 0.389
0.747GlyTrp: 0.747 ± 0.45
1.744GlyTyr: 1.744 ± 0.484
0.0GlyXaa: 0.0 ± 0.0
His
0.997HisAla: 0.997 ± 0.271
0.498HisCys: 0.498 ± 0.301
1.744HisAsp: 1.744 ± 0.576
0.747HisGlu: 0.747 ± 0.297
1.495HisPhe: 1.495 ± 0.685
0.747HisGly: 0.747 ± 0.297
0.997HisHis: 0.997 ± 0.376
0.997HisIle: 0.997 ± 0.468
1.495HisLys: 1.495 ± 0.652
3.488HisLeu: 3.488 ± 0.729
0.747HisMet: 0.747 ± 0.326
1.993HisAsn: 1.993 ± 0.757
0.997HisPro: 0.997 ± 0.423
1.744HisGln: 1.744 ± 0.773
1.246HisArg: 1.246 ± 0.411
1.744HisSer: 1.744 ± 0.565
1.246HisThr: 1.246 ± 0.532
0.249HisVal: 0.249 ± 0.284
0.0HisTrp: 0.0 ± 0.0
0.747HisTyr: 0.747 ± 0.272
0.0HisXaa: 0.0 ± 0.0
Ile
2.491IleAla: 2.491 ± 0.832
0.747IleCys: 0.747 ± 0.499
5.232IleAsp: 5.232 ± 1.436
3.488IleGlu: 3.488 ± 0.875
3.986IlePhe: 3.986 ± 1.179
4.484IleGly: 4.484 ± 1.32
0.997IleHis: 0.997 ± 0.448
6.228IleIle: 6.228 ± 1.513
7.723IleLys: 7.723 ± 1.609
4.733IleLeu: 4.733 ± 0.828
2.491IleMet: 2.491 ± 1.029
3.986IleAsn: 3.986 ± 0.869
3.737IlePro: 3.737 ± 1.458
2.99IleGln: 2.99 ± 1.017
3.239IleArg: 3.239 ± 0.689
7.972IleSer: 7.972 ± 1.096
4.235IleThr: 4.235 ± 1.859
4.733IleVal: 4.733 ± 0.634
0.249IleTrp: 0.249 ± 0.15
1.744IleTyr: 1.744 ± 0.563
0.0IleXaa: 0.0 ± 0.0
Lys
3.488LysAla: 3.488 ± 1.547
1.246LysCys: 1.246 ± 0.577
2.99LysAsp: 2.99 ± 1.009
7.225LysGlu: 7.225 ± 1.952
3.488LysPhe: 3.488 ± 0.804
4.484LysGly: 4.484 ± 0.343
1.744LysHis: 1.744 ± 0.407
7.474LysIle: 7.474 ± 0.717
7.474LysLys: 7.474 ± 1.306
7.474LysLeu: 7.474 ± 1.306
1.744LysMet: 1.744 ± 0.584
4.733LysAsn: 4.733 ± 0.98
2.242LysPro: 2.242 ± 0.567
2.74LysGln: 2.74 ± 0.73
2.242LysArg: 2.242 ± 0.518
6.477LysSer: 6.477 ± 1.698
6.228LysThr: 6.228 ± 0.752
2.99LysVal: 2.99 ± 0.553
2.242LysTrp: 2.242 ± 0.811
1.495LysTyr: 1.495 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
5.73LeuAla: 5.73 ± 1.812
2.242LeuCys: 2.242 ± 0.481
3.239LeuAsp: 3.239 ± 1.057
6.228LeuGlu: 6.228 ± 1.498
4.733LeuPhe: 4.733 ± 1.535
6.228LeuGly: 6.228 ± 0.663
2.491LeuHis: 2.491 ± 0.703
8.719LeuIle: 8.719 ± 1.323
7.225LeuLys: 7.225 ± 1.324
13.702LeuLeu: 13.702 ± 1.626
4.484LeuMet: 4.484 ± 0.988
5.481LeuAsn: 5.481 ± 1.594
4.733LeuPro: 4.733 ± 1.017
3.239LeuGln: 3.239 ± 1.03
5.73LeuArg: 5.73 ± 1.234
7.225LeuSer: 7.225 ± 0.748
5.232LeuThr: 5.232 ± 1.64
4.733LeuVal: 4.733 ± 0.944
1.744LeuTrp: 1.744 ± 0.633
3.488LeuTyr: 3.488 ± 1.133
0.0LeuXaa: 0.0 ± 0.0
Met
1.495MetAla: 1.495 ± 0.422
0.747MetCys: 0.747 ± 0.463
0.997MetAsp: 0.997 ± 0.425
1.246MetGlu: 1.246 ± 0.431
1.993MetPhe: 1.993 ± 0.796
0.997MetGly: 0.997 ± 0.371
0.249MetHis: 0.249 ± 0.326
1.495MetIle: 1.495 ± 0.484
2.74MetLys: 2.74 ± 0.844
2.491MetLeu: 2.491 ± 1.21
0.747MetMet: 0.747 ± 0.463
0.498MetAsn: 0.498 ± 0.234
1.246MetPro: 1.246 ± 0.532
0.249MetGln: 0.249 ± 0.15
1.744MetArg: 1.744 ± 0.565
1.993MetSer: 1.993 ± 1.06
1.993MetThr: 1.993 ± 0.636
1.993MetVal: 1.993 ± 0.405
0.498MetTrp: 0.498 ± 0.234
0.997MetTyr: 0.997 ± 0.371
0.0MetXaa: 0.0 ± 0.0
Asn
1.744AsnAla: 1.744 ± 0.565
1.246AsnCys: 1.246 ± 0.298
1.246AsnAsp: 1.246 ± 0.58
2.242AsnGlu: 2.242 ± 0.39
2.74AsnPhe: 2.74 ± 0.548
1.993AsnGly: 1.993 ± 0.89
1.495AsnHis: 1.495 ± 0.22
4.484AsnIle: 4.484 ± 0.663
4.733AsnLys: 4.733 ± 1.002
8.47AsnLeu: 8.47 ± 0.511
1.744AsnMet: 1.744 ± 0.751
3.737AsnAsn: 3.737 ± 0.82
2.74AsnPro: 2.74 ± 0.556
2.491AsnGln: 2.491 ± 0.486
1.246AsnArg: 1.246 ± 0.619
1.744AsnSer: 1.744 ± 0.78
2.99AsnThr: 2.99 ± 1.025
2.491AsnVal: 2.491 ± 1.163
0.747AsnTrp: 0.747 ± 0.45
2.74AsnTyr: 2.74 ± 0.782
0.0AsnXaa: 0.0 ± 0.0
Pro
1.744ProAla: 1.744 ± 0.731
0.249ProCys: 0.249 ± 0.15
2.242ProAsp: 2.242 ± 0.815
3.737ProGlu: 3.737 ± 0.853
2.99ProPhe: 2.99 ± 0.71
1.495ProGly: 1.495 ± 0.659
0.997ProHis: 0.997 ± 0.6
2.74ProIle: 2.74 ± 0.514
3.488ProLys: 3.488 ± 0.728
6.726ProLeu: 6.726 ± 1.532
0.747ProMet: 0.747 ± 0.548
1.246ProAsn: 1.246 ± 0.368
1.744ProPro: 1.744 ± 1.049
1.495ProGln: 1.495 ± 0.484
1.993ProArg: 1.993 ± 0.517
2.74ProSer: 2.74 ± 0.804
1.993ProThr: 1.993 ± 0.646
2.491ProVal: 2.491 ± 0.993
0.249ProTrp: 0.249 ± 0.337
0.997ProTyr: 0.997 ± 0.423
0.0ProXaa: 0.0 ± 0.0
Gln
0.498GlnAla: 0.498 ± 0.3
0.747GlnCys: 0.747 ± 0.321
2.242GlnAsp: 2.242 ± 0.62
2.491GlnGlu: 2.491 ± 0.913
2.242GlnPhe: 2.242 ± 1.064
1.744GlnGly: 1.744 ± 0.368
0.747GlnHis: 0.747 ± 0.444
2.491GlnIle: 2.491 ± 0.496
2.242GlnLys: 2.242 ± 0.822
3.986GlnLeu: 3.986 ± 1.151
0.498GlnMet: 0.498 ± 0.3
1.993GlnAsn: 1.993 ± 0.979
0.747GlnPro: 0.747 ± 0.272
1.246GlnGln: 1.246 ± 0.449
0.747GlnArg: 0.747 ± 0.45
2.74GlnSer: 2.74 ± 1.06
1.246GlnThr: 1.246 ± 1.237
3.488GlnVal: 3.488 ± 0.968
0.498GlnTrp: 0.498 ± 0.3
0.997GlnTyr: 0.997 ± 0.371
0.0GlnXaa: 0.0 ± 0.0
Arg
2.242ArgAla: 2.242 ± 0.736
0.997ArgCys: 0.997 ± 0.371
3.239ArgAsp: 3.239 ± 0.744
3.239ArgGlu: 3.239 ± 0.784
0.498ArgPhe: 0.498 ± 0.357
3.239ArgGly: 3.239 ± 0.857
0.0ArgHis: 0.0 ± 0.0
1.993ArgIle: 1.993 ± 0.918
1.993ArgLys: 1.993 ± 0.485
2.74ArgLeu: 2.74 ± 0.957
0.997ArgMet: 0.997 ± 0.382
1.993ArgAsn: 1.993 ± 0.631
0.997ArgPro: 0.997 ± 0.371
2.242ArgGln: 2.242 ± 0.909
1.993ArgArg: 1.993 ± 1.199
2.74ArgSer: 2.74 ± 0.877
1.744ArgThr: 1.744 ± 1.049
3.488ArgVal: 3.488 ± 1.273
0.747ArgTrp: 0.747 ± 0.272
2.74ArgTyr: 2.74 ± 0.638
0.0ArgXaa: 0.0 ± 0.0
Ser
4.235SerAla: 4.235 ± 1.413
0.997SerCys: 0.997 ± 0.898
4.733SerAsp: 4.733 ± 1.373
5.232SerGlu: 5.232 ± 1.405
4.983SerPhe: 4.983 ± 1.233
2.74SerGly: 2.74 ± 1.089
2.242SerHis: 2.242 ± 1.003
6.228SerIle: 6.228 ± 1.296
6.228SerLys: 6.228 ± 1.589
7.972SerLeu: 7.972 ± 1.253
2.74SerMet: 2.74 ± 0.707
4.733SerAsn: 4.733 ± 2.005
3.986SerPro: 3.986 ± 0.462
2.242SerGln: 2.242 ± 0.65
2.74SerArg: 2.74 ± 0.722
7.474SerSer: 7.474 ± 2.473
4.484SerThr: 4.484 ± 0.665
2.74SerVal: 2.74 ± 0.753
1.744SerTrp: 1.744 ± 0.86
4.983SerTyr: 4.983 ± 0.999
0.0SerXaa: 0.0 ± 0.0
Thr
3.239ThrAla: 3.239 ± 0.874
0.747ThrCys: 0.747 ± 0.363
2.74ThrAsp: 2.74 ± 0.683
4.484ThrGlu: 4.484 ± 1.608
1.993ThrPhe: 1.993 ± 0.673
3.737ThrGly: 3.737 ± 1.07
0.747ThrHis: 0.747 ± 0.41
3.737ThrIle: 3.737 ± 1.187
4.484ThrLys: 4.484 ± 1.0
5.73ThrLeu: 5.73 ± 1.077
0.997ThrMet: 0.997 ± 0.375
2.491ThrAsn: 2.491 ± 0.814
1.744ThrPro: 1.744 ± 0.543
1.993ThrGln: 1.993 ± 1.314
1.246ThrArg: 1.246 ± 0.356
3.737ThrSer: 3.737 ± 1.371
2.99ThrThr: 2.99 ± 1.072
2.491ThrVal: 2.491 ± 1.004
1.744ThrTrp: 1.744 ± 0.602
1.744ThrTyr: 1.744 ± 0.481
0.0ThrXaa: 0.0 ± 0.0
Val
3.737ValAla: 3.737 ± 0.597
1.495ValCys: 1.495 ± 0.543
3.239ValAsp: 3.239 ± 0.475
1.744ValGlu: 1.744 ± 0.67
1.495ValPhe: 1.495 ± 0.56
1.246ValGly: 1.246 ± 0.58
1.993ValHis: 1.993 ± 0.696
3.488ValIle: 3.488 ± 0.626
4.733ValLys: 4.733 ± 0.644
3.986ValLeu: 3.986 ± 0.64
0.249ValMet: 0.249 ± 0.15
2.99ValAsn: 2.99 ± 0.726
2.242ValPro: 2.242 ± 0.694
1.744ValGln: 1.744 ± 0.484
2.491ValArg: 2.491 ± 0.969
4.484ValSer: 4.484 ± 0.951
5.232ValThr: 5.232 ± 2.094
2.242ValVal: 2.242 ± 0.73
0.498ValTrp: 0.498 ± 0.431
0.747ValTyr: 0.747 ± 0.463
0.0ValXaa: 0.0 ± 0.0
Trp
0.997TrpAla: 0.997 ± 0.778
0.249TrpCys: 0.249 ± 0.15
0.997TrpAsp: 0.997 ± 0.371
0.747TrpGlu: 0.747 ± 0.413
0.997TrpPhe: 0.997 ± 0.403
0.747TrpGly: 0.747 ± 0.456
0.0TrpHis: 0.0 ± 0.0
1.993TrpIle: 1.993 ± 0.91
1.246TrpLys: 1.246 ± 0.524
1.246TrpLeu: 1.246 ± 0.296
0.498TrpMet: 0.498 ± 0.3
1.246TrpAsn: 1.246 ± 0.496
0.249TrpPro: 0.249 ± 0.326
0.0TrpGln: 0.0 ± 0.0
1.495TrpArg: 1.495 ± 0.669
0.249TrpSer: 0.249 ± 0.15
1.246TrpThr: 1.246 ± 0.496
1.744TrpVal: 1.744 ± 1.634
0.249TrpTrp: 0.249 ± 0.284
0.747TrpTyr: 0.747 ± 0.387
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.242TyrAla: 2.242 ± 0.514
1.246TyrCys: 1.246 ± 0.539
0.498TyrAsp: 0.498 ± 0.3
2.99TyrGlu: 2.99 ± 0.826
2.242TyrPhe: 2.242 ± 0.584
2.491TyrGly: 2.491 ± 0.916
1.495TyrHis: 1.495 ± 0.643
1.993TyrIle: 1.993 ± 0.523
2.242TyrLys: 2.242 ± 0.774
2.99TyrLeu: 2.99 ± 0.982
1.246TyrMet: 1.246 ± 0.496
1.744TyrAsn: 1.744 ± 0.58
2.74TyrPro: 2.74 ± 1.358
1.246TyrGln: 1.246 ± 0.464
0.498TyrArg: 0.498 ± 0.234
4.484TyrSer: 4.484 ± 0.561
1.993TyrThr: 1.993 ± 0.984
1.495TyrVal: 1.495 ± 0.55
0.498TyrTrp: 0.498 ± 0.3
1.993TyrTyr: 1.993 ± 0.775
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4015 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski