Spaces:

Ahmadzei
/

RAG

Runtime error

added 3 more tables for large emb model

5fa1a76 over 1 year ago

202 Bytes

	This is shown in Figure 2d of the paper, see below for a sample attention mask:

	Using those attention matrices with less parameters then allows the model to have inputs having a bigger sequence
	length.