Amino acid dipepetide frequency for Mulberry vein banding virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.791AlaAla: 1.791 ± 1.226
1.592AlaCys: 1.592 ± 0.726
3.383AlaAsp: 3.383 ± 0.312
2.786AlaGlu: 2.786 ± 1.256
1.791AlaPhe: 1.791 ± 0.651
2.189AlaGly: 2.189 ± 0.642
0.995AlaHis: 0.995 ± 0.381
2.985AlaIle: 2.985 ± 0.38
2.985AlaLys: 2.985 ± 1.764
4.577AlaLeu: 4.577 ± 1.337
1.791AlaMet: 1.791 ± 0.731
1.791AlaAsn: 1.791 ± 0.401
1.393AlaPro: 1.393 ± 0.714
0.995AlaGln: 0.995 ± 0.278
0.398AlaArg: 0.398 ± 0.793
4.179AlaSer: 4.179 ± 1.467
1.592AlaThr: 1.592 ± 1.124
3.781AlaVal: 3.781 ± 0.9
0.398AlaTrp: 0.398 ± 0.236
1.393AlaTyr: 1.393 ± 0.584
0.0AlaXaa: 0.0 ± 0.0
Cys
1.393CysAla: 1.393 ± 0.337
0.398CysCys: 0.398 ± 0.236
0.995CysAsp: 0.995 ± 0.82
1.194CysGlu: 1.194 ± 0.422
1.194CysPhe: 1.194 ± 0.575
1.393CysGly: 1.393 ± 1.215
0.398CysHis: 0.398 ± 0.357
2.189CysIle: 2.189 ± 0.867
2.388CysLys: 2.388 ± 0.912
2.189CysLeu: 2.189 ± 0.518
0.597CysMet: 0.597 ± 0.366
0.796CysAsn: 0.796 ± 0.648
0.597CysPro: 0.597 ± 0.354
0.0CysGln: 0.0 ± 0.0
0.995CysArg: 0.995 ± 0.381
1.592CysSer: 1.592 ± 1.014
1.194CysThr: 1.194 ± 1.121
1.592CysVal: 1.592 ± 1.321
0.199CysTrp: 0.199 ± 0.118
0.597CysTyr: 0.597 ± 0.558
0.0CysXaa: 0.0 ± 0.0
Asp
1.592AspAla: 1.592 ± 0.322
2.189AspCys: 2.189 ± 0.822
3.781AspAsp: 3.781 ± 0.712
2.388AspGlu: 2.388 ± 0.579
4.179AspPhe: 4.179 ± 0.993
1.592AspGly: 1.592 ± 0.712
0.398AspHis: 0.398 ± 0.475
6.169AspIle: 6.169 ± 1.472
5.373AspLys: 5.373 ± 1.242
5.572AspLeu: 5.572 ± 1.19
2.786AspMet: 2.786 ± 0.598
2.786AspAsn: 2.786 ± 0.364
1.592AspPro: 1.592 ± 0.507
2.786AspGln: 2.786 ± 0.733
2.388AspArg: 2.388 ± 0.972
3.582AspSer: 3.582 ± 1.038
1.99AspThr: 1.99 ± 1.196
4.776AspVal: 4.776 ± 1.339
0.796AspTrp: 0.796 ± 0.487
2.985AspTyr: 2.985 ± 0.625
0.0AspXaa: 0.0 ± 0.0
Glu
2.985GluAla: 2.985 ± 0.821
1.592GluCys: 1.592 ± 0.322
3.383GluAsp: 3.383 ± 0.554
5.771GluGlu: 5.771 ± 0.411
3.781GluPhe: 3.781 ± 0.542
2.388GluGly: 2.388 ± 0.842
0.597GluHis: 0.597 ± 0.211
5.572GluIle: 5.572 ± 0.869
6.169GluLys: 6.169 ± 1.45
6.567GluLeu: 6.567 ± 1.962
2.189GluMet: 2.189 ± 0.561
4.378GluAsn: 4.378 ± 0.365
0.796GluPro: 0.796 ± 0.382
1.393GluGln: 1.393 ± 0.819
1.791GluArg: 1.791 ± 0.769
6.368GluSer: 6.368 ± 0.892
3.184GluThr: 3.184 ± 0.836
3.184GluVal: 3.184 ± 0.682
0.398GluTrp: 0.398 ± 0.357
2.786GluTyr: 2.786 ± 0.881
0.0GluXaa: 0.0 ± 0.0
Phe
2.388PheAla: 2.388 ± 0.997
1.592PheCys: 1.592 ± 0.898
2.388PheAsp: 2.388 ± 0.522
2.587PheGlu: 2.587 ± 0.652
2.189PhePhe: 2.189 ± 0.867
2.189PheGly: 2.189 ± 0.588
0.796PheHis: 0.796 ± 0.485
2.189PheIle: 2.189 ± 0.562
5.97PheLys: 5.97 ± 0.953
5.771PheLeu: 5.771 ± 1.38
1.194PheMet: 1.194 ± 0.34
2.587PheAsn: 2.587 ± 0.689
2.388PhePro: 2.388 ± 0.265
2.189PheGln: 2.189 ± 0.63
1.791PheArg: 1.791 ± 0.824
4.776PheSer: 4.776 ± 1.201
1.592PheThr: 1.592 ± 0.522
2.388PheVal: 2.388 ± 0.59
0.199PheTrp: 0.199 ± 0.238
1.791PheTyr: 1.791 ± 0.535
0.0PheXaa: 0.0 ± 0.0
Gly
1.592GlyAla: 1.592 ± 0.598
1.592GlyCys: 1.592 ± 1.01
2.587GlyAsp: 2.587 ± 0.618
2.786GlyGlu: 2.786 ± 0.74
1.791GlyPhe: 1.791 ± 0.805
1.393GlyGly: 1.393 ± 0.819
1.393GlyHis: 1.393 ± 0.4
3.184GlyIle: 3.184 ± 1.003
2.786GlyLys: 2.786 ± 1.01
4.378GlyLeu: 4.378 ± 1.353
0.796GlyMet: 0.796 ± 0.473
3.582GlyAsn: 3.582 ± 0.615
1.393GlyPro: 1.393 ± 0.35
0.796GlyGln: 0.796 ± 0.39
1.194GlyArg: 1.194 ± 0.493
2.786GlySer: 2.786 ± 0.691
1.592GlyThr: 1.592 ± 0.898
2.388GlyVal: 2.388 ± 0.73
0.199GlyTrp: 0.199 ± 0.118
1.99GlyTyr: 1.99 ± 0.693
0.0GlyXaa: 0.0 ± 0.0
His
0.398HisAla: 0.398 ± 0.357
0.398HisCys: 0.398 ± 0.191
1.99HisAsp: 1.99 ± 0.27
0.597HisGlu: 0.597 ± 0.338
2.189HisPhe: 2.189 ± 0.864
0.398HisGly: 0.398 ± 0.236
0.398HisHis: 0.398 ± 0.191
0.398HisIle: 0.398 ± 0.357
0.597HisLys: 0.597 ± 0.354
1.791HisLeu: 1.791 ± 1.034
0.199HisMet: 0.199 ± 0.118
1.592HisAsn: 1.592 ± 0.357
0.796HisPro: 0.796 ± 0.284
0.199HisGln: 0.199 ± 0.118
0.199HisArg: 0.199 ± 0.118
1.99HisSer: 1.99 ± 0.435
1.791HisThr: 1.791 ± 0.382
1.393HisVal: 1.393 ± 0.374
0.199HisTrp: 0.199 ± 0.118
0.796HisTyr: 0.796 ± 0.463
0.0HisXaa: 0.0 ± 0.0
Ile
3.184IleAla: 3.184 ± 0.816
0.995IleCys: 0.995 ± 0.884
6.169IleAsp: 6.169 ± 1.003
3.582IleGlu: 3.582 ± 0.822
2.786IlePhe: 2.786 ± 0.639
2.189IleGly: 2.189 ± 0.538
1.592IleHis: 1.592 ± 0.71
2.786IleIle: 2.786 ± 0.428
7.761IleLys: 7.761 ± 1.194
5.97IleLeu: 5.97 ± 0.597
2.786IleMet: 2.786 ± 0.937
5.771IleAsn: 5.771 ± 0.78
3.383IlePro: 3.383 ± 0.597
3.383IleGln: 3.383 ± 0.757
2.786IleArg: 2.786 ± 0.473
5.97IleSer: 5.97 ± 0.852
4.776IleThr: 4.776 ± 1.24
4.776IleVal: 4.776 ± 0.891
0.597IleTrp: 0.597 ± 0.211
3.383IleTyr: 3.383 ± 1.556
0.0IleXaa: 0.0 ± 0.0
Lys
4.776LysAla: 4.776 ± 1.16
1.592LysCys: 1.592 ± 0.712
3.98LysAsp: 3.98 ± 0.775
6.567LysGlu: 6.567 ± 1.779
4.577LysPhe: 4.577 ± 0.784
2.786LysGly: 2.786 ± 0.743
1.393LysHis: 1.393 ± 0.486
6.368LysIle: 6.368 ± 0.698
8.358LysLys: 8.358 ± 1.313
8.159LysLeu: 8.159 ± 0.852
3.383LysMet: 3.383 ± 0.58
5.97LysAsn: 5.97 ± 1.665
1.791LysPro: 1.791 ± 0.925
1.592LysGln: 1.592 ± 1.375
3.383LysArg: 3.383 ± 0.558
7.96LysSer: 7.96 ± 1.131
7.96LysThr: 7.96 ± 2.003
4.577LysVal: 4.577 ± 1.814
0.796LysTrp: 0.796 ± 0.284
2.985LysTyr: 2.985 ± 0.952
0.0LysXaa: 0.0 ± 0.0
Leu
4.975LeuAla: 4.975 ± 1.151
0.796LeuCys: 0.796 ± 0.487
4.776LeuAsp: 4.776 ± 0.959
6.368LeuGlu: 6.368 ± 1.395
3.582LeuPhe: 3.582 ± 0.641
3.781LeuGly: 3.781 ± 1.133
1.592LeuHis: 1.592 ± 0.71
6.965LeuIle: 6.965 ± 0.861
8.756LeuLys: 8.756 ± 1.888
6.965LeuLeu: 6.965 ± 1.48
4.179LeuMet: 4.179 ± 1.115
7.164LeuAsn: 7.164 ± 0.685
1.592LeuPro: 1.592 ± 0.357
2.786LeuGln: 2.786 ± 0.777
2.587LeuArg: 2.587 ± 0.335
11.741LeuSer: 11.741 ± 2.162
5.771LeuThr: 5.771 ± 1.505
4.776LeuVal: 4.776 ± 2.123
0.398LeuTrp: 0.398 ± 0.475
3.184LeuTyr: 3.184 ± 1.158
0.0LeuXaa: 0.0 ± 0.0
Met
0.398MetAla: 0.398 ± 0.236
0.398MetCys: 0.398 ± 0.374
1.791MetAsp: 1.791 ± 0.522
2.388MetGlu: 2.388 ± 0.487
0.597MetPhe: 0.597 ± 0.211
1.393MetGly: 1.393 ± 0.631
0.796MetHis: 0.796 ± 0.849
2.786MetIle: 2.786 ± 0.703
3.184MetLys: 3.184 ± 1.158
1.592MetLeu: 1.592 ± 0.75
1.791MetMet: 1.791 ± 0.824
2.587MetAsn: 2.587 ± 1.536
0.796MetPro: 0.796 ± 0.491
0.796MetGln: 0.796 ± 0.424
1.194MetArg: 1.194 ± 0.709
4.776MetSer: 4.776 ± 0.927
2.388MetThr: 2.388 ± 0.809
2.786MetVal: 2.786 ± 0.811
0.0MetTrp: 0.0 ± 0.0
1.592MetTyr: 1.592 ± 0.628
0.0MetXaa: 0.0 ± 0.0
Asn
3.184AsnAla: 3.184 ± 1.067
0.796AsnCys: 0.796 ± 0.382
3.98AsnAsp: 3.98 ± 0.614
4.776AsnGlu: 4.776 ± 0.792
4.577AsnPhe: 4.577 ± 0.7
2.587AsnGly: 2.587 ± 0.816
0.796AsnHis: 0.796 ± 0.525
5.771AsnIle: 5.771 ± 0.57
3.383AsnLys: 3.383 ± 0.853
7.562AsnLeu: 7.562 ± 1.649
0.995AsnMet: 0.995 ± 0.591
2.388AsnAsn: 2.388 ± 0.681
1.791AsnPro: 1.791 ± 0.401
3.184AsnGln: 3.184 ± 0.479
1.791AsnArg: 1.791 ± 0.856
5.97AsnSer: 5.97 ± 0.829
2.587AsnThr: 2.587 ± 0.992
3.383AsnVal: 3.383 ± 0.479
0.995AsnTrp: 0.995 ± 0.796
2.786AsnTyr: 2.786 ± 1.055
0.0AsnXaa: 0.0 ± 0.0
Pro
1.194ProAla: 1.194 ± 0.726
0.199ProCys: 0.199 ± 0.396
1.393ProAsp: 1.393 ± 0.372
1.791ProGlu: 1.791 ± 0.443
1.393ProPhe: 1.393 ± 0.486
1.791ProGly: 1.791 ± 1.347
0.0ProHis: 0.0 ± 0.0
2.189ProIle: 2.189 ± 0.732
3.383ProLys: 3.383 ± 1.085
2.189ProLeu: 2.189 ± 0.918
1.194ProMet: 1.194 ± 0.365
2.388ProAsn: 2.388 ± 0.969
0.597ProPro: 0.597 ± 0.578
0.597ProGln: 0.597 ± 0.211
0.398ProArg: 0.398 ± 0.191
2.985ProSer: 2.985 ± 0.362
1.791ProThr: 1.791 ± 1.161
1.791ProVal: 1.791 ± 0.652
0.0ProTrp: 0.0 ± 0.0
1.393ProTyr: 1.393 ± 0.486
0.0ProXaa: 0.0 ± 0.0
Gln
0.796GlnAla: 0.796 ± 0.325
0.597GlnCys: 0.597 ± 0.575
1.393GlnAsp: 1.393 ± 0.596
1.99GlnGlu: 1.99 ± 0.465
0.597GlnPhe: 0.597 ± 0.578
1.194GlnGly: 1.194 ± 0.527
0.597GlnHis: 0.597 ± 0.355
2.189GlnIle: 2.189 ± 0.695
2.189GlnLys: 2.189 ± 0.731
2.587GlnLeu: 2.587 ± 0.182
1.592GlnMet: 1.592 ± 0.444
2.189GlnAsn: 2.189 ± 0.282
1.194GlnPro: 1.194 ± 0.449
0.597GlnGln: 0.597 ± 0.713
1.592GlnArg: 1.592 ± 0.71
3.781GlnSer: 3.781 ± 0.941
1.393GlnThr: 1.393 ± 0.486
2.388GlnVal: 2.388 ± 1.04
0.0GlnTrp: 0.0 ± 0.0
0.597GlnTyr: 0.597 ± 0.338
0.0GlnXaa: 0.0 ± 0.0
Arg
1.791ArgAla: 1.791 ± 0.988
0.0ArgCys: 0.0 ± 0.0
1.99ArgAsp: 1.99 ± 0.465
2.388ArgGlu: 2.388 ± 0.534
1.194ArgPhe: 1.194 ± 0.573
1.194ArgGly: 1.194 ± 0.486
1.194ArgHis: 1.194 ± 0.709
2.388ArgIle: 2.388 ± 0.59
2.985ArgLys: 2.985 ± 0.781
3.184ArgLeu: 3.184 ± 0.951
0.995ArgMet: 0.995 ± 0.444
2.388ArgAsn: 2.388 ± 1.036
0.597ArgPro: 0.597 ± 0.354
1.791ArgGln: 1.791 ± 0.611
0.796ArgArg: 0.796 ± 0.284
1.592ArgSer: 1.592 ± 0.567
2.388ArgThr: 2.388 ± 0.953
1.393ArgVal: 1.393 ± 0.543
0.597ArgTrp: 0.597 ± 0.354
1.592ArgTyr: 1.592 ± 0.765
0.0ArgXaa: 0.0 ± 0.0
Ser
4.378SerAla: 4.378 ± 1.259
1.393SerCys: 1.393 ± 0.785
6.169SerAsp: 6.169 ± 0.945
6.766SerGlu: 6.766 ± 0.726
4.378SerPhe: 4.378 ± 1.527
3.781SerGly: 3.781 ± 0.814
1.592SerHis: 1.592 ± 0.591
7.761SerIle: 7.761 ± 1.24
8.159SerLys: 8.159 ± 0.815
9.552SerLeu: 9.552 ± 1.376
2.587SerMet: 2.587 ± 0.771
5.97SerAsn: 5.97 ± 1.592
2.587SerPro: 2.587 ± 0.999
1.791SerGln: 1.791 ± 0.805
4.378SerArg: 4.378 ± 0.737
8.159SerSer: 8.159 ± 1.114
5.373SerThr: 5.373 ± 2.164
5.771SerVal: 5.771 ± 1.307
0.796SerTrp: 0.796 ± 0.765
3.383SerTyr: 3.383 ± 0.503
0.0SerXaa: 0.0 ± 0.0
Thr
1.791ThrAla: 1.791 ± 1.115
2.587ThrCys: 2.587 ± 0.74
2.388ThrAsp: 2.388 ± 0.851
4.577ThrGlu: 4.577 ± 0.77
2.587ThrPhe: 2.587 ± 0.533
2.587ThrGly: 2.587 ± 0.687
1.393ThrHis: 1.393 ± 0.435
5.771ThrIle: 5.771 ± 1.035
3.383ThrLys: 3.383 ± 1.131
3.98ThrLeu: 3.98 ± 1.336
0.995ThrMet: 0.995 ± 0.591
3.781ThrAsn: 3.781 ± 1.133
1.791ThrPro: 1.791 ± 0.845
0.995ThrGln: 0.995 ± 0.601
1.393ThrArg: 1.393 ± 0.59
6.169ThrSer: 6.169 ± 0.734
3.383ThrThr: 3.383 ± 0.46
3.383ThrVal: 3.383 ± 1.577
0.398ThrTrp: 0.398 ± 0.374
2.587ThrTyr: 2.587 ± 0.663
0.0ThrXaa: 0.0 ± 0.0
Val
2.786ValAla: 2.786 ± 0.436
1.592ValCys: 1.592 ± 0.472
3.98ValAsp: 3.98 ± 1.458
3.98ValGlu: 3.98 ± 0.934
2.786ValPhe: 2.786 ± 0.473
2.985ValGly: 2.985 ± 1.829
1.99ValHis: 1.99 ± 0.672
3.383ValIle: 3.383 ± 0.61
5.373ValLys: 5.373 ± 2.48
5.771ValLeu: 5.771 ± 1.215
1.791ValMet: 1.791 ± 1.269
2.189ValAsn: 2.189 ± 0.541
2.985ValPro: 2.985 ± 1.288
1.592ValGln: 1.592 ± 0.567
2.189ValArg: 2.189 ± 0.583
5.97ValSer: 5.97 ± 0.895
2.786ValThr: 2.786 ± 1.358
2.189ValVal: 2.189 ± 1.372
0.597ValTrp: 0.597 ± 0.366
2.388ValTyr: 2.388 ± 0.844
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.398TrpAsp: 0.398 ± 0.191
0.199TrpGlu: 0.199 ± 0.118
0.398TrpPhe: 0.398 ± 0.191
0.199TrpGly: 0.199 ± 0.238
0.0TrpHis: 0.0 ± 0.0
0.398TrpIle: 0.398 ± 0.475
1.393TrpLys: 1.393 ± 0.316
0.796TrpLeu: 0.796 ± 0.284
0.398TrpMet: 0.398 ± 0.236
0.597TrpAsn: 0.597 ± 0.366
0.199TrpPro: 0.199 ± 0.238
0.0TrpGln: 0.0 ± 0.0
0.199TrpArg: 0.199 ± 0.396
1.194TrpSer: 1.194 ± 0.365
0.398TrpThr: 0.398 ± 0.34
0.796TrpVal: 0.796 ± 0.872
0.199TrpTrp: 0.199 ± 0.238
0.398TrpTyr: 0.398 ± 0.422
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.393TyrAla: 1.393 ± 0.538
1.592TyrCys: 1.592 ± 0.591
2.786TyrAsp: 2.786 ± 0.454
1.791TyrGlu: 1.791 ± 0.272
2.388TyrPhe: 2.388 ± 0.849
2.189TyrGly: 2.189 ± 0.993
0.398TyrHis: 0.398 ± 0.236
3.184TyrIle: 3.184 ± 0.971
4.378TyrLys: 4.378 ± 1.728
3.582TyrLeu: 3.582 ± 0.852
1.592TyrMet: 1.592 ± 0.492
2.388TyrAsn: 2.388 ± 0.976
0.398TyrPro: 0.398 ± 0.191
1.99TyrGln: 1.99 ± 1.184
0.995TyrArg: 0.995 ± 0.381
3.184TyrSer: 3.184 ± 0.855
1.99TyrThr: 1.99 ± 0.435
1.99TyrVal: 1.99 ± 0.465
0.398TyrTrp: 0.398 ± 0.793
1.99TyrTyr: 1.99 ± 0.482
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (5026 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski