Amino acid dipepetide frequency for Cowpea golden mosaic virus-[Nigeria]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.621AlaAla: 4.621 ± 2.649
1.848AlaCys: 1.848 ± 1.115
0.924AlaAsp: 0.924 ± 0.831
2.773AlaGlu: 2.773 ± 1.488
2.773AlaPhe: 2.773 ± 1.282
0.924AlaGly: 0.924 ± 1.024
2.773AlaHis: 2.773 ± 1.933
3.697AlaIle: 3.697 ± 1.539
3.697AlaLys: 3.697 ± 1.485
7.394AlaLeu: 7.394 ± 1.817
0.0AlaMet: 0.0 ± 0.0
2.773AlaAsn: 2.773 ± 1.409
1.848AlaPro: 1.848 ± 1.276
2.773AlaGln: 2.773 ± 1.928
5.545AlaArg: 5.545 ± 1.921
4.621AlaSer: 4.621 ± 3.107
3.697AlaThr: 3.697 ± 2.299
1.848AlaVal: 1.848 ± 1.381
1.848AlaTrp: 1.848 ± 0.835
1.848AlaTyr: 1.848 ± 1.041
0.0AlaXaa: 0.0 ± 0.0
Cys
0.924CysAla: 0.924 ± 0.951
0.924CysCys: 0.924 ± 0.831
2.773CysAsp: 2.773 ± 2.095
0.924CysGlu: 0.924 ± 0.831
0.924CysPhe: 0.924 ± 1.315
1.848CysGly: 1.848 ± 0.988
0.0CysHis: 0.0 ± 0.0
0.924CysIle: 0.924 ± 1.006
0.924CysLys: 0.924 ± 0.831
0.0CysLeu: 0.0 ± 0.0
0.924CysMet: 0.924 ± 1.024
1.848CysAsn: 1.848 ± 0.988
0.924CysPro: 0.924 ± 0.69
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
5.545CysSer: 5.545 ± 1.76
0.924CysThr: 0.924 ± 1.315
0.924CysVal: 0.924 ± 1.024
0.924CysTrp: 0.924 ± 1.006
0.924CysTyr: 0.924 ± 1.315
0.0CysXaa: 0.0 ± 0.0
Asp
1.848AspAla: 1.848 ± 1.381
0.924AspCys: 0.924 ± 1.315
2.773AspAsp: 2.773 ± 0.853
2.773AspGlu: 2.773 ± 1.282
0.924AspPhe: 0.924 ± 0.69
2.773AspGly: 2.773 ± 2.071
0.924AspHis: 0.924 ± 1.006
1.848AspIle: 1.848 ± 0.988
0.924AspLys: 0.924 ± 0.69
7.394AspLeu: 7.394 ± 2.954
0.0AspMet: 0.0 ± 0.0
3.697AspAsn: 3.697 ± 2.079
1.848AspPro: 1.848 ± 1.041
0.0AspGln: 0.0 ± 0.0
4.621AspArg: 4.621 ± 1.439
3.697AspSer: 3.697 ± 2.117
2.773AspThr: 2.773 ± 1.517
8.318AspVal: 8.318 ± 2.255
1.848AspTrp: 1.848 ± 1.381
0.924AspTyr: 0.924 ± 0.951
0.0AspXaa: 0.0 ± 0.0
Glu
2.773GluAla: 2.773 ± 1.087
0.924GluCys: 0.924 ± 0.951
3.697GluAsp: 3.697 ± 2.552
2.773GluGlu: 2.773 ± 1.477
2.773GluPhe: 2.773 ± 1.423
3.697GluGly: 3.697 ± 1.67
0.0GluHis: 0.0 ± 0.0
1.848GluIle: 1.848 ± 2.012
1.848GluLys: 1.848 ± 1.029
3.697GluLeu: 3.697 ± 1.985
0.0GluMet: 0.0 ± 0.0
1.848GluAsn: 1.848 ± 1.662
7.394GluPro: 7.394 ± 2.873
1.848GluGln: 1.848 ± 1.387
0.0GluArg: 0.0 ± 0.0
5.545GluSer: 5.545 ± 2.987
1.848GluThr: 1.848 ± 2.629
1.848GluVal: 1.848 ± 1.657
1.848GluTrp: 1.848 ± 1.442
1.848GluTyr: 1.848 ± 1.662
0.0GluXaa: 0.0 ± 0.0
Phe
0.924PheAla: 0.924 ± 0.69
0.0PheCys: 0.0 ± 0.0
3.697PheAsp: 3.697 ± 1.409
2.773PheGlu: 2.773 ± 1.087
1.848PhePhe: 1.848 ± 1.662
0.924PheGly: 0.924 ± 0.831
3.697PheHis: 3.697 ± 0.96
1.848PheIle: 1.848 ± 1.276
2.773PheLys: 2.773 ± 1.928
4.621PheLeu: 4.621 ± 1.089
0.924PheMet: 0.924 ± 0.69
4.621PheAsn: 4.621 ± 1.639
0.924PhePro: 0.924 ± 0.69
5.545PheGln: 5.545 ± 2.213
3.697PheArg: 3.697 ± 1.318
0.924PheSer: 0.924 ± 0.69
1.848PheThr: 1.848 ± 1.06
1.848PheVal: 1.848 ± 1.381
0.0PheTrp: 0.0 ± 0.0
0.924PheTyr: 0.924 ± 0.831
0.0PheXaa: 0.0 ± 0.0
Gly
6.47GlyAla: 6.47 ± 2.101
1.848GlyCys: 1.848 ± 1.06
2.773GlyAsp: 2.773 ± 1.812
5.545GlyGlu: 5.545 ± 1.433
0.924GlyPhe: 0.924 ± 0.951
2.773GlyGly: 2.773 ± 1.287
0.924GlyHis: 0.924 ± 0.69
3.697GlyIle: 3.697 ± 1.018
4.621GlyLys: 4.621 ± 2.057
3.697GlyLeu: 3.697 ± 2.103
0.924GlyMet: 0.924 ± 0.831
2.773GlyAsn: 2.773 ± 1.409
1.848GlyPro: 1.848 ± 0.835
1.848GlyGln: 1.848 ± 0.835
1.848GlyArg: 1.848 ± 0.835
2.773GlySer: 2.773 ± 1.087
1.848GlyThr: 1.848 ± 1.115
1.848GlyVal: 1.848 ± 1.442
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.848HisAla: 1.848 ± 1.662
0.924HisCys: 0.924 ± 0.951
2.773HisAsp: 2.773 ± 1.087
4.621HisGlu: 4.621 ± 1.639
0.924HisPhe: 0.924 ± 0.69
3.697HisGly: 3.697 ± 2.116
2.773HisHis: 2.773 ± 1.356
0.924HisIle: 0.924 ± 1.024
0.924HisLys: 0.924 ± 1.006
2.773HisLeu: 2.773 ± 1.423
0.0HisMet: 0.0 ± 0.0
2.773HisAsn: 2.773 ± 1.452
1.848HisPro: 1.848 ± 1.029
0.0HisGln: 0.0 ± 0.0
3.697HisArg: 3.697 ± 2.121
0.0HisSer: 0.0 ± 0.0
2.773HisThr: 2.773 ± 1.902
1.848HisVal: 1.848 ± 1.041
0.0HisTrp: 0.0 ± 0.0
1.848HisTyr: 1.848 ± 1.381
0.0HisXaa: 0.0 ± 0.0
Ile
0.924IleAla: 0.924 ± 0.951
0.924IleCys: 0.924 ± 1.315
3.697IleAsp: 3.697 ± 1.539
2.773IleGlu: 2.773 ± 1.415
7.394IlePhe: 7.394 ± 1.765
0.924IleGly: 0.924 ± 0.831
1.848IleHis: 1.848 ± 2.012
0.924IleIle: 0.924 ± 1.006
6.47IleLys: 6.47 ± 2.135
0.924IleLeu: 0.924 ± 1.006
0.924IleMet: 0.924 ± 0.69
3.697IleAsn: 3.697 ± 1.182
2.773IlePro: 2.773 ± 1.415
3.697IleGln: 3.697 ± 2.552
2.773IleArg: 2.773 ± 1.675
5.545IleSer: 5.545 ± 2.479
3.697IleThr: 3.697 ± 1.018
3.697IleVal: 3.697 ± 1.182
2.773IleTrp: 2.773 ± 1.689
2.773IleTyr: 2.773 ± 1.689
0.0IleXaa: 0.0 ± 0.0
Lys
3.697LysAla: 3.697 ± 0.96
1.848LysCys: 1.848 ± 1.442
3.697LysAsp: 3.697 ± 2.761
4.621LysGlu: 4.621 ± 1.768
0.924LysPhe: 0.924 ± 0.69
0.924LysGly: 0.924 ± 0.69
1.848LysHis: 1.848 ± 0.835
0.924LysIle: 0.924 ± 0.831
2.773LysLys: 2.773 ± 1.812
1.848LysLeu: 1.848 ± 0.835
0.0LysMet: 0.0 ± 0.0
4.621LysAsn: 4.621 ± 2.057
2.773LysPro: 2.773 ± 1.287
2.773LysGln: 2.773 ± 0.853
4.621LysArg: 4.621 ± 3.422
5.545LysSer: 5.545 ± 3.261
2.773LysThr: 2.773 ± 0.919
6.47LysVal: 6.47 ± 0.974
0.0LysTrp: 0.0 ± 0.0
2.773LysTyr: 2.773 ± 1.122
0.0LysXaa: 0.0 ± 0.0
Leu
5.545LeuAla: 5.545 ± 2.348
1.848LeuCys: 1.848 ± 1.381
5.545LeuAsp: 5.545 ± 2.367
5.545LeuGlu: 5.545 ± 2.542
2.773LeuPhe: 2.773 ± 0.919
5.545LeuGly: 5.545 ± 1.768
2.773LeuHis: 2.773 ± 2.071
0.924LeuIle: 0.924 ± 1.315
4.621LeuLys: 4.621 ± 1.768
4.621LeuLeu: 4.621 ± 2.923
2.773LeuMet: 2.773 ± 1.626
2.773LeuAsn: 2.773 ± 1.415
3.697LeuPro: 3.697 ± 1.582
3.697LeuGln: 3.697 ± 1.349
6.47LeuArg: 6.47 ± 3.881
2.773LeuSer: 2.773 ± 2.498
8.318LeuThr: 8.318 ± 3.565
2.773LeuVal: 2.773 ± 1.517
0.924LeuTrp: 0.924 ± 0.831
3.697LeuTyr: 3.697 ± 1.018
0.0LeuXaa: 0.0 ± 0.0
Met
0.924MetAla: 0.924 ± 0.831
0.924MetCys: 0.924 ± 1.024
1.848MetAsp: 1.848 ± 1.115
1.848MetGlu: 1.848 ± 2.629
1.848MetPhe: 1.848 ± 1.281
4.621MetGly: 4.621 ± 1.292
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.848MetLys: 1.848 ± 0.835
2.773MetLeu: 2.773 ± 1.469
0.924MetMet: 0.924 ± 0.831
0.924MetAsn: 0.924 ± 0.831
0.924MetPro: 0.924 ± 0.69
0.924MetGln: 0.924 ± 0.951
1.848MetArg: 1.848 ± 1.115
0.924MetSer: 0.924 ± 1.006
0.924MetThr: 0.924 ± 1.006
0.0MetVal: 0.0 ± 0.0
0.924MetTrp: 0.924 ± 0.69
0.924MetTyr: 0.924 ± 0.831
0.0MetXaa: 0.0 ± 0.0
Asn
4.621AsnAla: 4.621 ± 1.705
0.0AsnCys: 0.0 ± 0.0
0.924AsnAsp: 0.924 ± 0.69
0.924AsnGlu: 0.924 ± 0.831
0.0AsnPhe: 0.0 ± 0.0
1.848AsnGly: 1.848 ± 1.442
4.621AsnHis: 4.621 ± 2.346
7.394AsnIle: 7.394 ± 2.466
3.697AsnLys: 3.697 ± 1.889
3.697AsnLeu: 3.697 ± 2.26
1.848AsnMet: 1.848 ± 1.616
2.773AsnAsn: 2.773 ± 1.575
3.697AsnPro: 3.697 ± 1.295
1.848AsnGln: 1.848 ± 1.387
3.697AsnArg: 3.697 ± 1.569
9.242AsnSer: 9.242 ± 3.778
2.773AsnThr: 2.773 ± 1.122
2.773AsnVal: 2.773 ± 2.071
0.0AsnTrp: 0.0 ± 0.0
0.924AsnTyr: 0.924 ± 0.69
0.0AsnXaa: 0.0 ± 0.0
Pro
2.773ProAla: 2.773 ± 2.165
2.773ProCys: 2.773 ± 1.686
1.848ProAsp: 1.848 ± 0.835
0.0ProGlu: 0.0 ± 0.0
1.848ProPhe: 1.848 ± 1.041
0.924ProGly: 0.924 ± 0.69
2.773ProHis: 2.773 ± 1.423
3.697ProIle: 3.697 ± 1.56
4.621ProLys: 4.621 ± 1.985
3.697ProLeu: 3.697 ± 2.058
1.848ProMet: 1.848 ± 1.662
2.773ProAsn: 2.773 ± 1.452
0.924ProPro: 0.924 ± 0.951
4.621ProGln: 4.621 ± 2.207
5.545ProArg: 5.545 ± 1.955
4.621ProSer: 4.621 ± 2.056
5.545ProThr: 5.545 ± 2.457
3.697ProVal: 3.697 ± 1.914
0.0ProTrp: 0.0 ± 0.0
1.848ProTyr: 1.848 ± 0.835
0.0ProXaa: 0.0 ± 0.0
Gln
2.773GlnAla: 2.773 ± 1.336
0.924GlnCys: 0.924 ± 0.831
0.924GlnAsp: 0.924 ± 1.315
3.697GlnGlu: 3.697 ± 1.914
1.848GlnPhe: 1.848 ± 1.041
2.773GlnGly: 2.773 ± 1.282
0.924GlnHis: 0.924 ± 0.69
7.394GlnIle: 7.394 ± 3.17
0.924GlnLys: 0.924 ± 1.024
0.924GlnLeu: 0.924 ± 1.315
0.924GlnMet: 0.924 ± 0.951
2.773GlnAsn: 2.773 ± 1.535
2.773GlnPro: 2.773 ± 2.248
3.697GlnGln: 3.697 ± 1.345
0.0GlnArg: 0.0 ± 0.0
3.697GlnSer: 3.697 ± 1.133
2.773GlnThr: 2.773 ± 1.535
1.848GlnVal: 1.848 ± 1.115
0.924GlnTrp: 0.924 ± 0.69
0.924GlnTyr: 0.924 ± 0.831
0.0GlnXaa: 0.0 ± 0.0
Arg
4.621ArgAla: 4.621 ± 2.691
0.924ArgCys: 0.924 ± 0.69
1.848ArgAsp: 1.848 ± 1.662
1.848ArgGlu: 1.848 ± 1.276
4.621ArgPhe: 4.621 ± 3.197
2.773ArgGly: 2.773 ± 0.853
2.773ArgHis: 2.773 ± 1.469
2.773ArgIle: 2.773 ± 0.853
1.848ArgLys: 1.848 ± 1.115
7.394ArgLeu: 7.394 ± 1.593
1.848ArgMet: 1.848 ± 1.577
1.848ArgAsn: 1.848 ± 1.06
3.697ArgPro: 3.697 ± 1.67
0.924ArgGln: 0.924 ± 0.69
12.015ArgArg: 12.015 ± 5.359
10.166ArgSer: 10.166 ± 1.88
2.773ArgThr: 2.773 ± 1.122
5.545ArgVal: 5.545 ± 1.331
0.0ArgTrp: 0.0 ± 0.0
1.848ArgTyr: 1.848 ± 1.286
0.0ArgXaa: 0.0 ± 0.0
Ser
3.697SerAla: 3.697 ± 1.133
2.773SerCys: 2.773 ± 1.933
2.773SerAsp: 2.773 ± 0.853
0.0SerGlu: 0.0 ± 0.0
4.621SerPhe: 4.621 ± 2.057
3.697SerGly: 3.697 ± 1.889
2.773SerHis: 2.773 ± 1.611
4.621SerIle: 4.621 ± 1.779
4.621SerLys: 4.621 ± 1.679
7.394SerLeu: 7.394 ± 2.198
2.773SerMet: 2.773 ± 1.53
7.394SerAsn: 7.394 ± 2.814
5.545SerPro: 5.545 ± 1.584
3.697SerGln: 3.697 ± 1.696
5.545SerArg: 5.545 ± 3.697
9.242SerSer: 9.242 ± 4.095
6.47SerThr: 6.47 ± 3.708
1.848SerVal: 1.848 ± 1.281
0.924SerTrp: 0.924 ± 0.69
2.773SerTyr: 2.773 ± 1.415
0.0SerXaa: 0.0 ± 0.0
Thr
2.773ThrAla: 2.773 ± 0.919
1.848ThrCys: 1.848 ± 1.442
0.924ThrAsp: 0.924 ± 1.315
2.773ThrGlu: 2.773 ± 2.294
3.697ThrPhe: 3.697 ± 2.079
3.697ThrGly: 3.697 ± 1.409
1.848ThrHis: 1.848 ± 1.06
7.394ThrIle: 7.394 ± 2.818
1.848ThrLys: 1.848 ± 1.381
4.621ThrLeu: 4.621 ± 5.399
0.924ThrMet: 0.924 ± 0.69
2.773ThrAsn: 2.773 ± 1.517
7.394ThrPro: 7.394 ± 3.608
2.773ThrGln: 2.773 ± 1.322
0.924ThrArg: 0.924 ± 0.831
1.848ThrSer: 1.848 ± 1.577
1.848ThrThr: 1.848 ± 1.387
3.697ThrVal: 3.697 ± 2.14
0.924ThrTrp: 0.924 ± 1.006
2.773ThrTyr: 2.773 ± 1.477
0.0ThrXaa: 0.0 ± 0.0
Val
0.924ValAla: 0.924 ± 0.69
0.0ValCys: 0.0 ± 0.0
5.545ValAsp: 5.545 ± 1.809
0.0ValGlu: 0.0 ± 0.0
1.848ValPhe: 1.848 ± 1.041
0.924ValGly: 0.924 ± 0.831
1.848ValHis: 1.848 ± 1.427
6.47ValIle: 6.47 ± 2.399
4.621ValLys: 4.621 ± 2.303
4.621ValLeu: 4.621 ± 1.908
3.697ValMet: 3.697 ± 2.902
1.848ValAsn: 1.848 ± 1.381
4.621ValPro: 4.621 ± 1.095
1.848ValGln: 1.848 ± 1.06
4.621ValArg: 4.621 ± 2.056
2.773ValSer: 2.773 ± 1.164
2.773ValThr: 2.773 ± 1.902
1.848ValVal: 1.848 ± 0.835
0.0ValTrp: 0.0 ± 0.0
2.773ValTyr: 2.773 ± 1.517
0.0ValXaa: 0.0 ± 0.0
Trp
2.773TrpAla: 2.773 ± 2.071
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.924TrpGlu: 0.924 ± 0.831
0.0TrpPhe: 0.0 ± 0.0
1.848TrpGly: 1.848 ± 0.835
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
1.848TrpMet: 1.848 ± 1.115
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.924TrpGln: 0.924 ± 0.69
2.773TrpArg: 2.773 ± 1.356
0.0TrpSer: 0.0 ± 0.0
0.924TrpThr: 0.924 ± 1.006
0.924TrpVal: 0.924 ± 1.006
0.0TrpTrp: 0.0 ± 0.0
0.924TrpTyr: 0.924 ± 0.69
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.773TyrAla: 2.773 ± 1.517
0.924TyrCys: 0.924 ± 1.315
1.848TyrAsp: 1.848 ± 1.281
0.924TyrGlu: 0.924 ± 1.006
1.848TyrPhe: 1.848 ± 1.115
1.848TyrGly: 1.848 ± 0.835
1.848TyrHis: 1.848 ± 1.029
1.848TyrIle: 1.848 ± 1.041
1.848TyrLys: 1.848 ± 0.835
6.47TyrLeu: 6.47 ± 2.876
1.848TyrMet: 1.848 ± 1.055
2.773TyrAsn: 2.773 ± 1.164
0.924TyrPro: 0.924 ± 0.69
0.0TyrGln: 0.0 ± 0.0
1.848TyrArg: 1.848 ± 1.662
3.697TyrSer: 3.697 ± 1.889
0.0TyrThr: 0.0 ± 0.0
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.924TyrTyr: 0.924 ± 0.951
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1083 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski