Amino acid dipepetide frequency for Pteromalus puparum negative-strand RNA virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.624AlaAla: 5.624 ± 1.89
0.256AlaCys: 0.256 ± 0.132
4.09AlaAsp: 4.09 ± 0.513
3.834AlaGlu: 3.834 ± 1.099
2.045AlaPhe: 2.045 ± 0.812
5.112AlaGly: 5.112 ± 0.915
0.511AlaHis: 0.511 ± 0.263
5.624AlaIle: 5.624 ± 0.678
1.278AlaLys: 1.278 ± 1.002
7.157AlaLeu: 7.157 ± 1.992
2.045AlaMet: 2.045 ± 1.209
2.301AlaAsn: 2.301 ± 0.545
3.834AlaPro: 3.834 ± 1.411
3.067AlaGln: 3.067 ± 1.035
2.556AlaArg: 2.556 ± 1.533
5.112AlaSer: 5.112 ± 1.841
3.834AlaThr: 3.834 ± 0.755
4.346AlaVal: 4.346 ± 0.54
2.045AlaTrp: 2.045 ± 0.477
1.789AlaTyr: 1.789 ± 0.385
0.0AlaXaa: 0.0 ± 0.0
Cys
1.278CysAla: 1.278 ± 0.705
0.256CysCys: 0.256 ± 0.132
0.511CysAsp: 0.511 ± 0.237
0.767CysGlu: 0.767 ± 0.395
0.767CysPhe: 0.767 ± 0.259
0.767CysGly: 0.767 ± 0.264
0.256CysHis: 0.256 ± 0.132
1.022CysIle: 1.022 ± 0.473
0.767CysLys: 0.767 ± 0.264
1.534CysLeu: 1.534 ± 0.579
0.0CysMet: 0.0 ± 0.0
0.767CysAsn: 0.767 ± 0.264
0.767CysPro: 0.767 ± 0.316
0.511CysGln: 0.511 ± 0.263
1.534CysArg: 1.534 ± 0.81
0.767CysSer: 0.767 ± 0.264
0.767CysThr: 0.767 ± 0.601
0.256CysVal: 0.256 ± 0.132
0.511CysTrp: 0.511 ± 0.263
1.278CysTyr: 1.278 ± 0.323
0.0CysXaa: 0.0 ± 0.0
Asp
3.067AspAla: 3.067 ± 1.167
1.022AspCys: 1.022 ± 0.473
2.045AspAsp: 2.045 ± 1.031
3.067AspGlu: 3.067 ± 1.99
2.301AspPhe: 2.301 ± 0.689
1.789AspGly: 1.789 ± 0.43
2.045AspHis: 2.045 ± 0.72
3.323AspIle: 3.323 ± 0.577
3.579AspLys: 3.579 ± 0.852
6.902AspLeu: 6.902 ± 1.813
0.767AspMet: 0.767 ± 0.395
1.789AspAsn: 1.789 ± 0.712
4.346AspPro: 4.346 ± 1.27
2.556AspGln: 2.556 ± 0.816
3.067AspArg: 3.067 ± 0.743
2.556AspSer: 2.556 ± 1.465
2.301AspThr: 2.301 ± 0.847
2.301AspVal: 2.301 ± 0.49
0.511AspTrp: 0.511 ± 0.229
0.511AspTyr: 0.511 ± 0.229
0.0AspXaa: 0.0 ± 0.0
Glu
4.09GluAla: 4.09 ± 0.895
1.789GluCys: 1.789 ± 0.715
2.301GluAsp: 2.301 ± 1.028
5.879GluGlu: 5.879 ± 2.467
1.534GluPhe: 1.534 ± 0.565
2.045GluGly: 2.045 ± 0.509
1.789GluHis: 1.789 ± 0.591
4.09GluIle: 4.09 ± 1.439
3.323GluLys: 3.323 ± 2.154
6.646GluLeu: 6.646 ± 1.013
1.022GluMet: 1.022 ± 0.759
2.556GluAsn: 2.556 ± 0.732
1.534GluPro: 1.534 ± 0.901
5.879GluGln: 5.879 ± 1.543
1.534GluArg: 1.534 ± 0.706
4.09GluSer: 4.09 ± 1.015
2.812GluThr: 2.812 ± 0.758
3.323GluVal: 3.323 ± 1.565
0.767GluTrp: 0.767 ± 0.351
2.301GluTyr: 2.301 ± 0.552
0.0GluXaa: 0.0 ± 0.0
Phe
1.022PheAla: 1.022 ± 0.526
1.022PheCys: 1.022 ± 0.242
1.278PheAsp: 1.278 ± 0.264
1.022PheGlu: 1.022 ± 0.309
0.767PhePhe: 0.767 ± 0.395
2.045PheGly: 2.045 ± 0.812
0.256PheHis: 0.256 ± 0.132
2.045PheIle: 2.045 ± 0.498
3.323PheLys: 3.323 ± 0.839
2.812PheLeu: 2.812 ± 0.75
1.022PheMet: 1.022 ± 0.341
1.022PheAsn: 1.022 ± 0.526
1.278PhePro: 1.278 ± 0.79
2.556PheGln: 2.556 ± 0.524
2.301PheArg: 2.301 ± 0.82
2.301PheSer: 2.301 ± 0.679
3.067PheThr: 3.067 ± 0.78
2.045PheVal: 2.045 ± 1.053
0.0PheTrp: 0.0 ± 0.0
1.534PheTyr: 1.534 ± 0.381
0.0PheXaa: 0.0 ± 0.0
Gly
5.879GlyAla: 5.879 ± 1.033
0.256GlyCys: 0.256 ± 0.132
2.301GlyAsp: 2.301 ± 0.845
4.346GlyGlu: 4.346 ± 0.937
2.045GlyPhe: 2.045 ± 0.986
4.09GlyGly: 4.09 ± 0.727
1.789GlyHis: 1.789 ± 0.761
2.045GlyIle: 2.045 ± 0.915
1.789GlyLys: 1.789 ± 1.591
6.135GlyLeu: 6.135 ± 1.152
3.067GlyMet: 3.067 ± 0.519
1.534GlyAsn: 1.534 ± 0.726
3.323GlyPro: 3.323 ± 1.183
2.301GlyGln: 2.301 ± 0.575
2.301GlyArg: 2.301 ± 0.475
4.09GlySer: 4.09 ± 1.078
3.834GlyThr: 3.834 ± 1.358
5.624GlyVal: 5.624 ± 1.103
1.278GlyTrp: 1.278 ± 0.449
1.278GlyTyr: 1.278 ± 0.449
0.0GlyXaa: 0.0 ± 0.0
His
0.767HisAla: 0.767 ± 0.395
0.767HisCys: 0.767 ± 0.395
1.789HisAsp: 1.789 ± 0.654
1.022HisGlu: 1.022 ± 0.716
1.534HisPhe: 1.534 ± 0.564
0.511HisGly: 0.511 ± 0.263
0.767HisHis: 0.767 ± 0.395
2.301HisIle: 2.301 ± 0.86
0.511HisLys: 0.511 ± 0.237
2.045HisLeu: 2.045 ± 0.467
0.511HisMet: 0.511 ± 0.253
0.511HisAsn: 0.511 ± 0.263
1.789HisPro: 1.789 ± 0.784
1.534HisGln: 1.534 ± 0.565
2.045HisArg: 2.045 ± 1.053
1.022HisSer: 1.022 ± 0.341
1.789HisThr: 1.789 ± 0.443
0.511HisVal: 0.511 ± 0.229
0.256HisTrp: 0.256 ± 0.132
1.278HisTyr: 1.278 ± 0.319
0.0HisXaa: 0.0 ± 0.0
Ile
4.346IleAla: 4.346 ± 1.23
0.767IleCys: 0.767 ± 0.259
2.301IleAsp: 2.301 ± 0.353
2.812IleGlu: 2.812 ± 0.566
1.278IlePhe: 1.278 ± 0.484
3.323IleGly: 3.323 ± 1.082
1.022IleHis: 1.022 ± 0.242
2.045IleIle: 2.045 ± 0.688
3.579IleLys: 3.579 ± 1.041
8.691IleLeu: 8.691 ± 1.631
1.022IleMet: 1.022 ± 0.242
2.556IleAsn: 2.556 ± 0.809
2.556IlePro: 2.556 ± 0.645
3.067IleGln: 3.067 ± 1.363
4.346IleArg: 4.346 ± 1.464
6.391IleSer: 6.391 ± 0.653
5.879IleThr: 5.879 ± 0.586
3.579IleVal: 3.579 ± 0.744
1.022IleTrp: 1.022 ± 0.379
1.534IleTyr: 1.534 ± 0.421
0.0IleXaa: 0.0 ± 0.0
Lys
3.067LysAla: 3.067 ± 0.926
1.278LysCys: 1.278 ± 0.471
2.045LysAsp: 2.045 ± 0.782
3.067LysGlu: 3.067 ± 1.337
1.789LysPhe: 1.789 ± 0.566
4.346LysGly: 4.346 ± 1.216
1.022LysHis: 1.022 ± 0.344
4.857LysIle: 4.857 ± 1.135
2.301LysLys: 2.301 ± 2.032
6.135LysLeu: 6.135 ± 1.115
1.022LysMet: 1.022 ± 0.639
2.045LysAsn: 2.045 ± 0.623
1.278LysPro: 1.278 ± 0.602
1.789LysGln: 1.789 ± 0.599
3.067LysArg: 3.067 ± 0.787
3.834LysSer: 3.834 ± 0.794
2.045LysThr: 2.045 ± 0.509
1.789LysVal: 1.789 ± 0.695
0.0LysTrp: 0.0 ± 0.0
1.022LysTyr: 1.022 ± 0.344
0.0LysXaa: 0.0 ± 0.0
Leu
7.669LeuAla: 7.669 ± 1.201
1.278LeuCys: 1.278 ± 0.447
5.368LeuAsp: 5.368 ± 1.632
8.436LeuGlu: 8.436 ± 1.671
3.834LeuPhe: 3.834 ± 0.999
7.157LeuGly: 7.157 ± 0.883
2.301LeuHis: 2.301 ± 0.689
6.135LeuIle: 6.135 ± 0.664
5.368LeuLys: 5.368 ± 1.923
15.337LeuLeu: 15.337 ± 2.64
1.789LeuMet: 1.789 ± 0.627
3.834LeuAsn: 3.834 ± 0.68
6.902LeuPro: 6.902 ± 1.259
4.857LeuGln: 4.857 ± 1.166
6.646LeuArg: 6.646 ± 1.827
7.924LeuSer: 7.924 ± 1.704
10.225LeuThr: 10.225 ± 0.88
9.714LeuVal: 9.714 ± 0.923
1.022LeuTrp: 1.022 ± 0.314
1.789LeuTyr: 1.789 ± 0.418
0.0LeuXaa: 0.0 ± 0.0
Met
1.022MetAla: 1.022 ± 0.341
0.256MetCys: 0.256 ± 0.277
2.556MetAsp: 2.556 ± 0.545
1.789MetGlu: 1.789 ± 0.385
1.022MetPhe: 1.022 ± 0.458
1.278MetGly: 1.278 ± 0.293
0.511MetHis: 0.511 ± 0.263
1.534MetIle: 1.534 ± 0.81
0.256MetLys: 0.256 ± 0.132
1.022MetLeu: 1.022 ± 0.344
1.022MetMet: 1.022 ± 0.458
1.022MetAsn: 1.022 ± 0.341
0.767MetPro: 0.767 ± 0.482
0.767MetGln: 0.767 ± 0.395
0.511MetArg: 0.511 ± 0.263
2.045MetSer: 2.045 ± 0.729
2.045MetThr: 2.045 ± 0.682
1.534MetVal: 1.534 ± 0.564
0.256MetTrp: 0.256 ± 0.132
1.278MetTyr: 1.278 ± 0.484
0.0MetXaa: 0.0 ± 0.0
Asn
2.556AsnAla: 2.556 ± 1.448
0.767AsnCys: 0.767 ± 0.259
1.789AsnAsp: 1.789 ± 0.784
1.789AsnGlu: 1.789 ± 0.443
0.767AsnPhe: 0.767 ± 0.395
1.022AsnGly: 1.022 ± 0.344
1.022AsnHis: 1.022 ± 0.344
2.045AsnIle: 2.045 ± 0.758
1.278AsnLys: 1.278 ± 0.449
4.09AsnLeu: 4.09 ± 1.179
0.511AsnMet: 0.511 ± 0.229
1.534AsnAsn: 1.534 ± 0.567
2.812AsnPro: 2.812 ± 0.659
2.556AsnGln: 2.556 ± 0.671
2.045AsnArg: 2.045 ± 0.648
2.812AsnSer: 2.812 ± 0.54
2.045AsnThr: 2.045 ± 0.986
1.789AsnVal: 1.789 ± 2.256
1.022AsnTrp: 1.022 ± 0.526
1.789AsnTyr: 1.789 ± 0.443
0.0AsnXaa: 0.0 ± 0.0
Pro
2.301ProAla: 2.301 ± 0.49
0.767ProCys: 0.767 ± 0.316
2.556ProAsp: 2.556 ± 0.859
3.323ProGlu: 3.323 ± 1.094
1.022ProPhe: 1.022 ± 0.309
4.857ProGly: 4.857 ± 1.368
1.022ProHis: 1.022 ± 0.734
2.301ProIle: 2.301 ± 0.552
4.346ProLys: 4.346 ± 1.245
5.624ProLeu: 5.624 ± 1.092
0.256ProMet: 0.256 ± 0.132
1.789ProAsn: 1.789 ± 0.872
4.601ProPro: 4.601 ± 2.39
1.789ProGln: 1.789 ± 0.24
2.556ProArg: 2.556 ± 0.655
6.391ProSer: 6.391 ± 2.074
3.323ProThr: 3.323 ± 0.841
2.812ProVal: 2.812 ± 0.485
0.767ProTrp: 0.767 ± 0.395
2.045ProTyr: 2.045 ± 0.354
0.0ProXaa: 0.0 ± 0.0
Gln
3.323GlnAla: 3.323 ± 1.497
0.511GlnCys: 0.511 ± 0.554
2.556GlnAsp: 2.556 ± 0.574
3.834GlnGlu: 3.834 ± 0.539
1.278GlnPhe: 1.278 ± 0.705
2.045GlnGly: 2.045 ± 0.611
2.045GlnHis: 2.045 ± 0.812
2.812GlnIle: 2.812 ± 0.745
1.534GlnLys: 1.534 ± 0.789
7.669GlnLeu: 7.669 ± 1.316
1.022GlnMet: 1.022 ± 0.485
1.022GlnAsn: 1.022 ± 0.242
1.278GlnPro: 1.278 ± 0.264
3.067GlnGln: 3.067 ± 0.85
1.022GlnArg: 1.022 ± 0.379
3.067GlnSer: 3.067 ± 0.871
2.556GlnThr: 2.556 ± 0.605
2.812GlnVal: 2.812 ± 1.443
0.767GlnTrp: 0.767 ± 0.259
2.301GlnTyr: 2.301 ± 0.339
0.0GlnXaa: 0.0 ± 0.0
Arg
2.301ArgAla: 2.301 ± 0.566
0.256ArgCys: 0.256 ± 0.132
1.534ArgAsp: 1.534 ± 0.586
4.346ArgGlu: 4.346 ± 0.82
3.323ArgPhe: 3.323 ± 1.123
3.579ArgGly: 3.579 ± 1.185
0.767ArgHis: 0.767 ± 0.316
2.812ArgIle: 2.812 ± 0.772
2.812ArgLys: 2.812 ± 0.552
6.646ArgLeu: 6.646 ± 1.451
2.045ArgMet: 2.045 ± 0.812
2.045ArgAsn: 2.045 ± 0.782
2.556ArgPro: 2.556 ± 0.97
2.556ArgGln: 2.556 ± 0.751
3.579ArgArg: 3.579 ± 0.757
3.323ArgSer: 3.323 ± 1.612
5.368ArgThr: 5.368 ± 2.209
3.834ArgVal: 3.834 ± 0.876
1.022ArgTrp: 1.022 ± 0.564
1.278ArgTyr: 1.278 ± 0.449
0.0ArgXaa: 0.0 ± 0.0
Ser
4.601SerAla: 4.601 ± 0.86
0.511SerCys: 0.511 ± 0.263
3.579SerAsp: 3.579 ± 1.476
2.812SerGlu: 2.812 ± 0.75
1.789SerPhe: 1.789 ± 0.591
6.391SerGly: 6.391 ± 0.763
1.534SerHis: 1.534 ± 0.529
5.112SerIle: 5.112 ± 1.02
4.09SerLys: 4.09 ± 1.312
8.436SerLeu: 8.436 ± 1.366
2.045SerMet: 2.045 ± 0.354
2.556SerAsn: 2.556 ± 0.907
4.346SerPro: 4.346 ± 0.459
1.789SerGln: 1.789 ± 0.881
4.601SerArg: 4.601 ± 1.257
6.902SerSer: 6.902 ± 2.175
5.112SerThr: 5.112 ± 3.329
3.067SerVal: 3.067 ± 0.579
1.278SerTrp: 1.278 ± 0.264
2.812SerTyr: 2.812 ± 0.595
0.0SerXaa: 0.0 ± 0.0
Thr
6.391ThrAla: 6.391 ± 0.977
0.767ThrCys: 0.767 ± 0.287
5.112ThrAsp: 5.112 ± 1.109
2.812ThrGlu: 2.812 ± 0.767
1.534ThrPhe: 1.534 ± 0.689
3.834ThrGly: 3.834 ± 0.743
1.278ThrHis: 1.278 ± 0.323
4.857ThrIle: 4.857 ± 2.104
1.534ThrLys: 1.534 ± 0.567
7.924ThrLeu: 7.924 ± 0.317
1.534ThrMet: 1.534 ± 0.706
2.556ThrAsn: 2.556 ± 0.493
4.857ThrPro: 4.857 ± 0.826
2.045ThrGln: 2.045 ± 0.354
5.879ThrArg: 5.879 ± 1.559
4.857ThrSer: 4.857 ± 1.174
5.879ThrThr: 5.879 ± 2.637
3.834ThrVal: 3.834 ± 1.369
1.022ThrTrp: 1.022 ± 0.526
2.556ThrTyr: 2.556 ± 0.631
0.0ThrXaa: 0.0 ± 0.0
Val
3.067ValAla: 3.067 ± 0.595
1.789ValCys: 1.789 ± 0.627
3.579ValAsp: 3.579 ± 0.793
3.323ValGlu: 3.323 ± 0.422
2.301ValPhe: 2.301 ± 0.679
2.301ValGly: 2.301 ± 0.646
1.789ValHis: 1.789 ± 0.418
4.346ValIle: 4.346 ± 0.823
3.067ValLys: 3.067 ± 0.48
8.18ValLeu: 8.18 ± 1.306
1.022ValMet: 1.022 ± 0.437
2.556ValAsn: 2.556 ± 0.823
3.067ValPro: 3.067 ± 0.75
1.278ValGln: 1.278 ± 0.775
3.579ValArg: 3.579 ± 1.396
3.323ValSer: 3.323 ± 0.976
4.346ValThr: 4.346 ± 2.097
4.346ValVal: 4.346 ± 1.383
1.022ValTrp: 1.022 ± 0.344
1.022ValTyr: 1.022 ± 0.341
0.0ValXaa: 0.0 ± 0.0
Trp
2.045TrpAla: 2.045 ± 0.477
0.256TrpCys: 0.256 ± 0.34
1.534TrpAsp: 1.534 ± 0.565
0.511TrpGlu: 0.511 ± 0.263
0.767TrpPhe: 0.767 ± 0.316
0.767TrpGly: 0.767 ± 0.628
0.256TrpHis: 0.256 ± 0.132
0.511TrpIle: 0.511 ± 0.263
1.022TrpLys: 1.022 ± 0.341
1.022TrpLeu: 1.022 ± 0.473
0.0TrpMet: 0.0 ± 0.0
1.022TrpAsn: 1.022 ± 0.526
0.256TrpPro: 0.256 ± 0.132
0.767TrpGln: 0.767 ± 0.264
0.511TrpArg: 0.511 ± 0.538
0.767TrpSer: 0.767 ± 0.264
1.534TrpThr: 1.534 ± 0.34
0.511TrpVal: 0.511 ± 0.263
0.256TrpTrp: 0.256 ± 0.132
0.511TrpTyr: 0.511 ± 0.263
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.045TyrAla: 2.045 ± 0.558
0.511TyrCys: 0.511 ± 0.237
1.022TyrAsp: 1.022 ± 0.389
0.511TyrGlu: 0.511 ± 0.229
1.022TyrPhe: 1.022 ± 0.242
2.045TyrGly: 2.045 ± 1.053
1.278TyrHis: 1.278 ± 0.705
2.301TyrIle: 2.301 ± 0.41
2.045TyrLys: 2.045 ± 0.498
3.579TyrLeu: 3.579 ± 0.692
0.511TyrMet: 0.511 ± 0.263
1.022TyrAsn: 1.022 ± 0.344
2.301TyrPro: 2.301 ± 0.575
1.789TyrGln: 1.789 ± 0.662
2.301TyrArg: 2.301 ± 0.789
1.789TyrSer: 1.789 ± 0.715
2.301TyrThr: 2.301 ± 1.016
1.534TyrVal: 1.534 ± 0.529
0.0TyrTrp: 0.0 ± 0.0
2.556TyrTyr: 2.556 ± 0.696
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3913 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski